Journal article

Cross-Corpora Comparisons of Topics and Topic Trends


Authors listBystrov, V; Naboka, V; Staszewska-Bystrova, A; Winker, P

Publication year2022

Pages433-469

JournalJournal of Economics and Statistics

Volume number242

Issue number4

ISSN0021-4027

eISSN2366-049X

Open access statusBronze

DOI Linkhttps://doi.org/10.1515/jbnst-2022-0024

PublisherDe Gruyter Brill


Abstract
Textual data gained relevance as a novel source of information for applied economic research. When considering longer periods or international comparisons, often different text corpora have to be used and combined for the analysis. A methods pipeline is presented for identifying topics in different corpora, matching these topics across corpora and comparing the resulting time series of topic importance. The relative importance of topics over time in a text corpus is used as an additional indicator in econometric models and for forecasting as well as for identifying changing foci of economic studies. The methods pipeline is illustrated using scientific publications from Poland and Germany in English and German for the period 1984-2020. As methodological contributions, a novel tool for data based model selection, sBIC, is impelemented, and approaches for mapping of topics of different corpora (including different languages) are presented.



Authors/Editors




Citation Styles

Harvard Citation styleBystrov, V., Naboka, V., Staszewska-Bystrova, A. and Winker, P. (2022) Cross-Corpora Comparisons of Topics and Topic Trends, Journal of Economics and Statistics, 242(4), pp. 433-469. https://doi.org/10.1515/jbnst-2022-0024

APA Citation styleBystrov, V., Naboka, V., Staszewska-Bystrova, A., & Winker, P. (2022). Cross-Corpora Comparisons of Topics and Topic Trends. Journal of Economics and Statistics. 242(4), 433-469. https://doi.org/10.1515/jbnst-2022-0024


Last updated on 2025-16-06 at 11:12