Topic Modeling: Latent Semantic Analysis for the Social Sciences uri icon

abstract

  • ObjectiveTopic modeling (TM) refers to a group of methods for mathematically identifying latent topics in large corpora of data. Although TM shows promise as a tool for social science research, most researchers lack awareness of the tool's utility. Therefore, this article provides a brief overview of TM's logic and processes, offers a simple example, and suggests several possible uses in social sciences.MethodsUsing latent semantic analysis in our example, we analyzed transcripts of the 2016 U.S. presidential debates between Hillary Clinton and Donald Trump.ResultsResulting topics paralleled the most frequent policyrelated Internet searches at the time. When divided by candidate, changes in emergent topics reflected individual policy stances, with nuanced differences between the two.ConclusionFindings underscored the utility of TM to identify thematic patterns embedded in large quantities of text. TM, therefore, represents a valuable addition to the social scientist's methodological tool set.

published proceedings

  • SOCIAL SCIENCE QUARTERLY

author list (cited authors)

  • Valdez, D., Pickett, A. C., & Goodson, P.

citation count

  • 58

complete list of authors

  • Valdez, Danny||Pickett, Andrew C||Goodson, Patricia

publication date

  • November 2018

publisher