Computer-Assisted Corpus Analysis: An Introduction to Concepts, Processes, and Decisions
Document Type
Article
Publication Date
3-1-2023
Abstract
Problem: This tutorial aims to guide readers through key concepts, basic processes, and common decision points that inform computer-assisted corpus-based research in technical, professional, and scientific communication (TPSC). Key concepts: Based on our collaborative experiences and an example developed for this tutorial, key concepts of corpus analysis useful to TPSC researchers and practitioners include the following: corpus location, text preparation, and programming language and software selection. Key lessons: These key concepts can be used to establish basic processes and decision points that, in turn, yield lessons related to the usefulness of lexicogrammatical language models and the significance of multidisciplinarity. Implications: Although corpus research is a growing and important part of the field of TPSC, challenges remain in terms of language model variety and ethical considerations. At least in part, these challenges can be met, respectively, by alignment between corpus and analytic tools and reference to the Common Rule and related international standards.
Identifier
85149170164 (Scopus)
Publication Title
IEEE Transactions on Professional Communication
External Full Text Location
https://doi.org/10.1109/TPC.2022.3228026
e-ISSN
15581500
ISSN
03611434
First Page
94
Last Page
113
Issue
1
Volume
66
Recommended Citation
Lang, Susan; Buell, Duncan A.; and Elliot, Norbert, "Computer-Assisted Corpus Analysis: An Introduction to Concepts, Processes, and Decisions" (2023). Faculty Publications. 1876.
https://digitalcommons.njit.edu/fac_pubs/1876