Apache Hadoop is an open-source, fast, and scalable framework that manages and processes exceptionally large volumes of data. We have already discussed Apache Hadoop and Hadoop ecosystem in detail in a previous blog. Hadoop is used by data scientists for offline or batch processing. The framework can be scaled up by adding nodes in the cluster. 

Written by: Prashant Thomas

Choosing the most suitable one is a challenge when several big data frameworks are available in the market. The traditional approach of comparing the strength and weaknesses of each platform is to be of less help, as businesses should consider each framework with their needs in mind.

Written by: Prashant Thomas

Trust is a very delicate concept. We trust instinctively, routinely and at times mindlessly in a wide variety of situations. It is the core foundation on which our relationships, our society is built on and very much our commerce. We have come to believe that to transact with you I should trust you. To verify our trust, we use intermediaries. For example, banks ensure that we deal with the right parties with sufficient funds. We rely on lawyers to ensure that our products are not copied or distributed by unauthorised persons. Hence generally, the use of intermediaries is complex, time-consuming and costly and carry many security risks.

Written by: Prashant Thomas

The global COVID pandemic outbreak has compelled millions to be cooped up at home and has turned to the internet for work, entertainment and other necessities. This has caused unprecedented stress on internet infrastructure. Initial stats indicate a surge of 50-70% in internet hits. Platforms like YouTube and Netflix has announced that they would lower video quality to reduce traffic on mobile and broadband networks. 

Written by: Prashant Thomas

“The number all of us have to really pay attention to is, there will be 50 billion connected devices by 2025, that means we will have 50 billion ends points that we get to really harness. Then comes the data part, these devices are expected to generate 175 zettabyte of data, a quadrupled growth, from the current 45 zettabyte,”- Satya Nadella

Written by: Prashant Thomas
Page 2 of 3123