ABSTRACT

The term big data originates from the fact that the datasets used are so large that typical database systems are unable to capture, save, and analyze the datasets. While traditional database systems store only structured data, big data can include semistructured and unstructured data as well. The source of data for traditional database systems is stored transactions, whereas big data can be generated from web pages, social media, and sensors embedded in machinery and common devices. Storage and searches enabled by new technologies like Hadoop and No Structured Query Language (NoSQL) allow processing of enormous datasets. Big data impacts diverse industries such as healthcare, business, industry, and government. Big data has been enabled by cloud technology and is creating a need for new skill sets in the labor market.