Obtaining Technology Insights from Large and Heterogeneous Document Collections
Title | Obtaining Technology Insights from Large and Heterogeneous Document Collections |
Publication Type | Conference Paper |
Year of Publication | 2014 |
Authors | Dey, L., Mahajan, D., Gupta, H. |
Conference Name | Web Intelligence (WI) and Intelligent Agent Technologies (IAT), 2014 IEEE/WIC/ACM International Joint Conferences on |
Date Published | Aug |
Keywords | academic research, analyzing research trends, automated topical analysis, Context, Data analysis, data mining, Data visualization, heterogeneous document collections, Hidden Markov models, indexing, information retrieval, insight generation, large heterogeneous text collections, Market research, mining patent databases, mining publications, patenting profiles, patents, publications, text analysis, topic evolution |
Abstract | Keeping up with rapid advances in research in various fields of Engineering and Technology is a challenging task. Decision makers including academics, program managers, venture capital investors, industry leaders and funding agencies not only need to be abreast of latest developments but also be able to assess the effect of growth in certain areas on their core business. Though analyst agencies like Gartner, McKinsey etc. Provide such reports for some areas, thought leaders of all organisations still need to amass data from heterogeneous collections like research publications, analyst reports, patent applications, competitor information etc. To help them finalize their own strategies. Text mining and data analytics researchers have been looking at integrating statistics, text analytics and information visualization to aid the process of retrieval and analytics. In this paper, we present our work on automated topical analysis and insight generation from large heterogeneous text collections of publications and patents. While most of the earlier work in this area provides search-based platforms, ours is an integrated platform for search and analysis. We have presented several methods and techniques that help in analysis and better comprehension of search results. We have also presented methods for generating insights about emerging and popular trends in research along with contextual differences between academic research and patenting profiles. We also present novel techniques to present topic evolution that helps users understand how a particular area has evolved over time. |
DOI | 10.1109/WI-IAT.2014.22 |
Citation Key | 6927531 |
- information retrieval
- topic evolution
- text analysis
- publications
- patents
- patenting profiles
- mining publications
- mining patent databases
- Market research
- large heterogeneous text collections
- insight generation
- academic research
- indexing
- Hidden Markov models
- heterogeneous document collections
- Data visualization
- Data mining
- data analysis
- Context
- automated topical analysis
- analyzing research trends