MITB Banner

Big Data: Challenges and roadblocks

Share
Big Data: Challenges and roadblocks

Illustration by Big Data: Challenges and roadblocks

Big Data: Challenges and roadblocksBig data is the big buzzword these days. Big data refers to a collection of data sets or information too large and complex to be processed by standard tools. It is the art and science of combining enterprise data, social data and machine data to derive new insights, which are otherwise, not possible. It is also about combining past data with real time data to predict or suggest the outcomes for the current or future context.

The digital footprint, is progressively expanding, world over, into fragmented mediums (blogs, tweets, reviews etc.) and technologies (mobile, web, cloud/SaaS etc.).

Digital landscape in India

India’s digital landscape too, maybe evolving quickly but overall penetration remains low, with only 1 in 5 Indians using the Internet in July 2014.

In India enterprises and businesses have access to a veritable wealth of information. And though some of the larger organisations have made a start in harnessing the information, most Indian companies are still learning how to collect and store big data.

Telecom providers, online travel agencies, online retail stores are some of the industries that are using big data analytics to engage customers is some ways.

However, big data analytics is still its infancy in India. Most companies are learning to store the data collected. Then, there are several challenges when it comes to the collection of data sets themselves. Past and current data is required to make the application of big data analytics really useful, there is a scarcity of past data in public and private sectors in India. Some of the reasons for the lack of enough data are:

Yet to be fully computerised

Healthcare, economic and statistical data, in both private and public sectors in India is yet to be computerised. The main reason for this is the late adoption of IT in India. Unlike in the West, most industries in India made the transition from manual records to computerised information systems, only during the last decade.

Over the years, the state and central ministries have made the move towards e-governance.  Efforts to deliver public services and to make access to these services easier are being made as well. While this is still a work in progress, huge amounts of data across many government sectors are yet to be digitised.

Quality of data

In big data analytics, data sufficiency plays a critical role when samples are run across different dimensions. Sufficient data points to perform analytics with the samples are required. Not only quantity of data, the quality of data being used for crunching, also influences the quality of insights.  If the signal-to-noise-ratio is high, the accuracy of results may vary for less than optimum data samples. In a country like India, there is very little information about the individuals, due to the fact that Indians are not overly expressive, especially on public forums.

Public social media information that is available for most individuals from India lacks quality information about users themselves. Random facts and figures in individual profiles, sharing of spam content, and fake social media accounts that are created for bots are very common in India.

Spam

Social media sites are becoming increasingly vulnerable to spam attacks. Time spent by a captive audience on social media sites opens up windows of opportunities,  for online threats and spammers.

Again, social media spam contributes to the signal-to-noise-ratio that defines the quality of big data. This hinders the appropriateness of results.

Cultural and Social influences

In most western markets, insights generated through big data can be applied across the whole consumer base. However, given the extensive cultural and linguistic variation across India, any insight generated for a consumer based out of Chandigarh, for example, will not be directly applicable to a consumer based in Chennai. This problem is made worse, by the fact that a lot of local data lives in regional publications, in different languages and has very limited online visibility.

Unstructured data leads to mapping issues

Big data in India is not structured. Most transactional data in the healthcare and retail segments are stored purely for book keeping purposes. They have very limited appropriate information that can help big data analytics map enterprise generated transactional data, with public information.

In the case of developed countries, user data is rich enough to provide demographic or group level markers that can be used to generate customized insights while maintaining individual privacy. Lack of these standard identifiers in Indian consumer data is one of the biggest bottle necks, while mapping various transactional and social records in India.

Handsets and internet connectivity

Even though smart phones are driving the new handset market in India, feature phones still dominate everyday usage. Most connections in India are pre-paid and fewer than 10% of users have access to 3G networks. To add to it, internet connection speeds are amongst the lowest in Asia. As a result, consumer data, especially retail enterprise data is limited.

As more people in India make the move to smart phones, and internet connectivity improves, there will be an increase in the amount of usable data generated. As Big data analytics may be at its infancy in India today ,huge efforts would need to be made to improve the quality of data by organisations and enterprises. However, key contributors to the promise of big data analytics in India are steadily gaining ground. An increase in social media users, efforts by enterprises, both public and private for optimum collection and storage of transactional enterprise data, will contribute to better quality data sets for the better application of big data analytics.

PS: The story was written using a keyboard.
Picture of Srikant Sastri

Srikant Sastri

Srikant Sastri provides strategic leadership as the co-founder of Crayon Data, the ambitious Singapore-headquartered start-up, which aims to build a global business around a Big Data platform and products. This is his third start-up. Srikant is a seasoned entrepreneur and bridge-builder. As a successful entrepreneur, Srikant had founded India and Southeast Asia’s largest CRM & Digital agency, Solutions – Digitas, in 1995. The business achieved scale (2000 employees) and market leadership before being acquired by the Publicis Groupe. In his most-recent role as VivaKi India Chairperson, Srikant led an ambitious M&A strategy to help the Publicis Groupe establish digital leadership in India, through three cutting-edge acquisitions. Respected as a leading marketing & CRM practitioner, Srikant started his career at Unilever and McCann – Erickson, and was recently inducted into the DMAi ‘Hall of Fame’. He has been on the jury at Cannes Advertising Festival. At Crayon Data, Srikant loves being in the midst of crazy ideas, energetic people, and ambiguity. Srikant works closely with, and nurtures several tech start-ups, and is an active angel investor. Srikant is an engineering undergraduate, and an MBA. His other interests include current affairs, contemporary history, and economic development.
Related Posts

Download our Mobile App

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

3 Ways to Join our Community

Telegram group

Discover special offers, top stories, upcoming events, and more.

Discord Server

Stay Connected with a larger ecosystem of data science and ML Professionals

Subscribe to our Daily newsletter

Get our daily awesome stories & videos in your inbox
Recent Stories

Featured

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

AI Courses & Careers

Become a Certified Generative AI Engineer

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA. 

AIM Conference Calendar

Immerse yourself in AI and business conferences tailored to your role, designed to elevate your performance and empower you to accomplish your organization’s vital objectives. Revel in intimate events that encapsulate the heart and soul of the AI Industry.

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed