Latent Dirichlet Markov allocation for sentiment analysis

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

Abstract

In recent years probabilistic topic models have gained tremendous attention in
data mining and natural language processing research areas. In the field of information retrieval for text mining, a variety of probabilistic topic models have been used to analyse content of documents. A topic model is a generative model for documents, it specifies a probabilistic procedure by which documents can be generated. All topic models share the idea that documents are mixture of topics, where a topic is a probability distribution over words. In this paper we describe Latent Dirichlet Markov Allocation Model (LDMA), a new generative probabilistic topic model, based on Latent Dirichlet Allocation (LDA) and Hidden Markov Model (HMM), which emphasizes on extracting multiword topics from text data. LDMA is a four-level hierarchical Bayesian model where topics are associated with documents, words are associated with topics and topics in the model can be presented with single- or multi-word terms. To evaluate performance of LDMA, we report results in the field of aspect detection in sentiment analysis, comparing to the basic LDA model.
Original languageEnglish
Title of host publicationIn Proceeding of the Fifth European Conference on Intelligent Management Systems in Operations
Pages90-96
Number of pages6
Publication statusPublished - 2013

Fingerprint

Dive into the research topics of 'Latent Dirichlet Markov allocation for sentiment analysis'. Together they form a unique fingerprint.

Cite this