LDA BASED TOPIC MODELING OF JOURNAL ABSTRACTS

dc.contributor.authorAnupriya P
dc.contributor.authorKarpagavalli S
dc.date.accessioned2020-10-13T07:42:31Z
dc.date.available2020-10-13T07:42:31Z
dc.date.issued2015-01-05
dc.description.abstractTopic modeling is a powerful technique for unsupervised analysis of large document collections. Topic models conceive latent topics in text using hidden random variables, and discover that structure with posterior inference. Topic models have a wide range of applications like tag recommendation, text categorization, keyword extraction and similarity search in the broad fields of text mining, information retrieval, statistical language modeling. In this work, a dataset with 200 abstracts fall under four topics are collected from two different domain journals for tagging journal abstracts. The document models are built using LDA (Latent Dirichlet Allocation) with Collapsed Variational Bayes and Gibbs sampling. Then the built model is used to extract appropriate tags for abstracts. The performance of the built models are analyzed by the evaluation measure perplexity and observed that Gibbs sampling outperforms CV B0 sampling. Tags extracted by two algorithms remains almost the same.en_US
dc.identifier.isbn978-1-4799-6438-3
dc.identifier.urihttps://ieeexplore.ieee.org/document/7324058
dc.identifier.urihttps://dspace.psgrkcw.com/handle/123456789/2196
dc.language.isoenen_US
dc.publisherInternational Conference on Advanced Computing & Communication Systems, held at Sri Eshwar College of Engineering, Coimbatore during 5-7 January 2015 and published in the conference proceedings, indexed in IEEE Xplore Digital Library.en_US
dc.subjectTopic modelingen_US
dc.subjectLatent Dirichlet Allocationen_US
dc.subjectGibbs samplingen_US
dc.subjectTag Recommendationen_US
dc.titleLDA BASED TOPIC MODELING OF JOURNAL ABSTRACTSen_US
dc.typeBooken_US

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
LDA BASED TOPIC MODELING OF JOURNAL ABSTRACTS.docx
Size:
10.49 KB
Format:
Microsoft Word XML
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: