Big Data has moved to the “Trough of Disillusionment” phase in the “Hype Cycle for Emerging Technologies” by Gartner released last month. Also, Data Science was added in the chart, bordering between the “Innovation Trigger” and “Peak of Inflated Expectations” stage. Despite the first-time appearance in the cycle, Data Science has a long history of evolution, arguably for more than 50 years. For example, the “Data Science Journal” and “Journal of Data Science” were launched over 10 years ago. It seems to me that a better term for this era is “Big Data Science” for the discipline expanded and applied in the Big Data space. To that, I’d have 3 auxiliary V-words for Big Data:
- Verbiage: There are a variety of manners or styles of expressing something in words. It is hard to analyze the unstructured data like tweets and blogs in the traditional way. Big Data solutions provide better lexical analysis techniques and tools to efficiently find useful information in a massive quantity of data, e.g. hot topics, likes, sentiments, trending, categorization, relation, clustering, patterns, etc.
- Voice: Voice-driven commands and searches are becoming norm, like Siri and Google Now. Speech recognition and interpretation are used in Watson, which beat the top champions in Jeopardy. Audio mining and speech analytics help analyze the customers in real time or recording to gather information, improve client interactions, and extract critical business intelligence buried in the call center. It is leveraged to identify unsatisfied customers or users in frustration, and also help allocate right-level resources with appropriate training or coaching.
- Visual: Big Data deals with data types beyond the traditional text, such as image and video data. Faces can be detected in photos and tagged to sort and group pictures. Image search engines are also available to find photos online. Big Data enables more effective video surveillance and tracking. Facial recognition systems are used in airports to monitor passing individuals, match passport holders, and run fully automated border controls.
For more information, please contact Tony Shan (blog@tonyshan.com) or leave your comments below.
©Tony Shan. All rights reserved. All standard disclaimers apply here.

No comments:
Post a Comment