fbpx
How Apache Spark Became A Dominant Force In Analytics

Launched in 2009, Apache Spark has become the dominating big data platform. Spark’s diverse portfolio ranges from assisting banks, telecommunications and gaming companies to serving the giants like Apple, Facebook, IBM, and Microsoft. Out of the box, Spark can run…

How Graph Processing Gets A Makeover With Hadoop

Graph analytics has been in use since decades to provide strength and direction of a relationship between objects in a graph. It has multiple pathways to function, some of which include clustering, cutting, partitioning, searching, shortest path, widest path and…

Data Engineering 101: Top Tools And Framework Resources

In today’s fast-paced world, data can be compared to DNA — with data, it is easy to understand the past, predict the future and also replicate what it contains. Back in the early 2000s, the amount of data collected was…

Will HarperDB Replace Hadoop In The Near Future?

There are a plenty of options when you want to switch over to Big Data from traditional data-handling software — for example Relational Database Management Systems (RDBMS) provided  by IBM, Oracle among others — but they lack the capability to…

Exploring the Analytics ecosystem within IBM with Arvind Shetty, Director at IBM Analytics

Arvind Shetty brings with him over three decades of IT industry experience, and has held diverse set of roles at IBM. Arvind joined IBM in 2003, and his assignments have included leadership of the IBM Java Technology Center, the ISL…

Document Classification using Apache Spark in Scala

Email Spam Identification, category classification of news and organization of web pages by search engines are the modern world examples for document classification. It is a technique to systematically classify a text document in one of the fixed category, or…

Over 100,000 people subscribe to our newsletter.

See stories of Analytics and AI in your inbox.