Working paper/research report
Authors list: Lenz, D.; Winker, P.
Publication year: 2018
URL: http://hdl.handle.net/10419/200671
Title of series: MAGKS Joint discussion paper series in economics
Number in series: 2018, 15
Topic modeling became an intensively researched area in economics, mainly due to the ever increasing availability of huge digital text information and the improvements in methods to analyze these datasets. In natural language processing, topic modeling describes a set of methods to extract the latent topics from a collection of documents. Several new methods have recently been proposed to improve the topic generation process. However, examination of the generated topics is still mostly based on unsatisfactory practices, for example by looking only at the list of most frequent words for a topic. Our contribution is threefold: 1) We present a topic modeling approach based on neural embeddings and Gaussian mixture modeling, which is shown to generate coherent and meaningful topics. 2) We propose a novel "topic report" based on dimensionality reduction techniques and model generated document vector features which helps to easily identify topics and significantly reduces the required mental overhead. 3) Lastly, we demonstrate on a technology related newsticker corpus how our approach could be used by economists to tackle economic problems, for example to measure the diffusion of innovations.
Abstract:
Citation Styles
Harvard Citation style: Lenz, D. and Winker, P. (2018) Measuring the diffusion of innovations with paragraph vector topic models. (MAGKS Joint discussion paper series in economics, 2018, 15). Marburg: Philipps-University Marburg, School of Business and Economics. http://hdl.handle.net/10419/200671
APA Citation style: Lenz, D., & Winker, P. (2018). Measuring the diffusion of innovations with paragraph vector topic models. (MAGKS Joint discussion paper series in economics, 2018, 15). Philipps-University Marburg, School of Business and Economics. http://hdl.handle.net/10419/200671