AUTOMATIC TAG RECOMMENDATION FOR JOURNAL ABSTRACTS USING STATISTICAL TOPIC MODELING

No Thumbnail Available

Date

2015

Journal Title

Journal ISSN

Volume Title

Publisher

Springer Advances in Intelligent Systems and Computing(AISC Series)

Abstract

Topic modeling is a powerful technique for unsupervised analysis of large document collections. Topic models conceive latent topics in text using hidden random variables, and discover that structure with posterior inference. Topic models have a wide range of applications like tag recommendation, text categorization, keyword extraction and similarity search in the broad fields of text mining, information retrieval, statistical language modeling. In this work, a dataset with 200 abstracts fall under four topics are collected from two different domain journals for tagging journal abstracts. The document model is built using LDA (Latent Dirichlet Allocation) with Collapsed Variational Bayes (CVB0) and Gibbs sampling. Then the built model is used to find appropriate tag for a given abstract. An interface is designed to extract and recommend the tag for a given abstract.

Description

Keywords

Topic modeling, Latent Dirichlet Allocation, Gibbs sampling, Tag Recommendation

Citation

Endorsement

Review

Supplemented By

Referenced By