Kumar Chinnakali – dataottam

↧

10 Key Features in Apache Storm 1.0.0

April 26, 2016, 7:06 am

10 Key Features in Apache Storm 1.0.0 The Apache Storm community recently announced the release of Apache Storm 1.0.0 stable. This is a noteworthy release that delivers several powerful features that...

View Article

11 Key Tuning Checklists for Apache Hadoop!

April 27, 2016, 8:02 pm

11 Key Tuning Checklists for Apache Hadoop! Apache Hadoop is a well know and de-facto framework for processing large big data sets through distributed & parallel computing. YARN(Yet Another...

View Article

Image may be NSFW.
Clik here to view.

Self-Learn Yourself Scala in 21 Blogs – #5

April 30, 2016, 8:52 pm

Self-Learn Yourself Scala in 21 Blogs – #5 Blog 5 – Does functional programming matters and what are monads? Missed the previous blogs have a quick look with Self-Learn Yourself Scala in 21 Blogs (#1,...

View Article

Image may be NSFW.
Clik here to view.

Self-Learn Yourself Apache Spark in 21 Blogs – #8

May 8, 2016, 4:09 am

Self-Learn Yourself Apache Spark in 21 Blogs – #8 In this blog let us discuss on How to loading data, what is Lambdas, How to do Transforming Data and more on Transformations. And want to have quick...

View Article

The 8th Habit of Highly Effective Big Data Programmers !

June 1, 2016, 6:56 pm

The 8th Habit of Highly Effective Big Data Programmers ! Last week I read a book called “The Seven Habits of Highly Effective Big Data Programmers” by Rekha Joshi which is interesting. Happy to share...

View Article

Image may be NSFW.
Clik here to view.

Relationship between MapReduce, Spark, YARN, and HDFS!

June 2, 2016, 10:33 am

Relationship between MapReduce, Spark, YARN, and HDFS ! In Big Data era Hadoop is the de facto standard for developing of big data applications by using MapReduce framework. And Hadoop is composed of...

View Article

Image may be NSFW.
Clik here to view.

Scalable Apache Spark Solution to Big Data Secondary Sort Problem! – Part 1

June 3, 2016, 6:56 pm

Scalable Apache Spark Solution to Big Data Secondary Sort Problem! – Part 1 In Big Data era the secondary sort problem is relates to sorting values associated with a key in the reduce phase. It can be...

View Article

Image may be NSFW.
Clik here to view.

Scalable Apache Spark Solution to Big Data Secondary Sort Problem! – Part 2

June 7, 2016, 5:13 am

Scalable Apache Spark Solution to Big Data Secondary Sort Problem! – Part 2 In Part -1, we have discussed about the Spark solution to Secondary for larger data sets. Now let’s deep dive in Choice #2...

View Article

Image may be NSFW.
Clik here to view.

The Pyramid of Internet of Things (IoT)

July 9, 2016, 12:59 pm

The Pyramid of Internet of Things (IoT) Alright, what is Internet of Things (IoT) ? How does it differ from Internet of Everything? What is M2M ? All the above queries would be running in your mind if...

View Article

Top 11 Apache Hadoop YARN Frameworks

July 10, 2016, 10:26 am

Top 11 Apache Hadoop YARN Frameworks Part of the core Hadoop project, YARN is the architectural center of Hadoop that allows multiple data processing engines such as interactive SQL, real-time...

View Article

Self-Learn Yourself Scala in 21 Blogs – #6

August 8, 2016, 8:44 pm

Self-Learn Yourself Scala in 21 Blogs – #6 Blog 6 – Recursion and Tail Recursion in Functional Programming. Missed the previous blogs have a quick look with Self-Learn Yourself Scala in 21 Blogs (#1,...

View Article

Image may be NSFW.
Clik here to view.

Tuning Handbook of Apache Kafka!

August 11, 2016, 7:21 pm

Tuning Handbook of Apache Kafka! We all know the power and advantages of Apache Kafka. It is publish-subscribe messaging system which basically has three major components Apache Kafka Consumer Apache...

View Article

Image may be NSFW.
Clik here to view.

The 9 Key steps to implement Big Data DevOps

August 18, 2016, 8:13 am

The 9 Key steps to implement Big Data DevOps ! Per WiKi Definition: DevOps (a clipped compound of development and operations) is a culture, movement or practice that emphasizes the collaboration and...

View Article

Top 16 Hadoop Built-in Ingress and Egress Tools !

August 29, 2016, 11:02 am

Top 16 Hadoop Built-in Ingress and Egress Tools ! Hadoop has revolutionized data ingestion, data processing and enterprise data warehousing, but its explosive growth has come with a large amount of...

View Article

3 Solutions for Big Data’s Small Files Problem !

September 9, 2016, 1:56 am

3 Solutions for Big Data’s Small Files Problem ! In this we will be discussion on the efficient solutions to the “small files” problem. And what is a small file in a Big Data Hadoop environment? In the...

View Article

Self-Learn Yourself Scala in 21 Blogs – #7

September 9, 2016, 2:31 am

Self-Learn Yourself Scala in 21 Blogs – #7 Missed the previous blogs have a quick look with Self-Learn Yourself Scala in 21 Blogs (#1, #2, #3, #4, #5, #6). In this blog let’s understand evaluation...

View Article

Image may be NSFW.
Clik here to view.

A First Look at Big Data Apache Flink!

September 17, 2016, 8:55 pm

A First Look at Big Data Apache Flink! There is abundance of interest in learning how to analyze streaming data in large-scale systems, partly because there are situations in which the time-value of...

View Article

Image may be NSFW.
Clik here to view.

Big Data Splunk’s Best & Better Practices !

October 9, 2016, 1:40 am

Big Data Splunk’s Best & Better Practices ! Introduction to Splunk We see servers, devices, apps, logs, traffic, and clouds. We see data, big data, and fat data everywhere. Splunk offers the...

View Article

Image may be NSFW.
Clik here to view.

The 7 Habits Of Successful Big Data and NoSQL Projects by Ben Lorica

October 9, 2016, 10:29 am

The 7 Habits Of Successful Big Data and NoSQL Projects by Ben Lorica ! Let’s have coffee@dataottam.com

View Article

Image may be NSFW.
Clik here to view.

What is Beyond Classic Hadoop? Is it Spark and Flink?

October 10, 2016, 12:50 pm

What is Beyond Classic Hadoop? Is it Spark and Flink? In this blog, we will explore the two new big data friends to Hadoop, and they are Apache Spark and Apache Flink. MapRedcue, Tez, Spark, and Flink...

View Article