Title | A New Approach to Use Big Data Tools to Substitute Unstructured Data Warehouse |
Publication Type | Conference Paper |
Year of Publication | 2020 |
Authors | Baker, Oras, Thien, Chuong Nguyen |
Conference Name | 2020 IEEE Conference on Big Data and Analytics (ICBDA) |
Keywords | Big Data, Big Data tools, Buildings, Business, composability, data mining, Data Warehouse, data warehouses, Human Behavior, human factors, Metrics, PostgreSQL, pubcrawl, Scalability, text analytics, Tools, Warehousing |
Abstract | Data warehouse and big data have become the trend to help organise data effectively. Business data are originating in various kinds of sources with different forms from conventional structured data to unstructured data, it is the input for producing useful information essential for business sustainability. This research will navigate through the complicated designs of the common big data and data warehousing technologies to propose an effective approach to use these technologies for designing and building an unstructured textual data warehouse, a crucial and essential tool for most enterprises nowadays for decision making and gaining business competitive advantages. In this research, we utilised the IBM BigInsights Text Analytics, PostgreSQL, and Pentaho tools, an unstructured data warehouse is implemented and worked excellently with the unstructured text from Amazon review datasets, the new proposed approach creates a practical solution for building an unstructured data warehouse. |
DOI | 10.1109/ICBDA50157.2020.9289757 |
Citation Key | baker_new_2020 |