Skip to content
Head to Head…

Head to Head…

Hadoop, Bigdata, Robotics, Databases, Python, Shell Script

  • Home
  • Blog Site
  • Projects/Demo
  • Robotics Blog
  • About
  • JRuby code to purge data on Hbase over Hive table…
Proudly powered by WordPress | Theme: Blogpecos by Profoxstudio.

Tag: hadoop

Analytics, Apache Spark, Apache Storm, Bigdata, Hadoop

Apache Storm key takeaways…

Hadoop moves the code to the data, Storm moves the data to the code. This […]

Mukesh Kumar - August 23, 2018September 13, 2018 on Apache Storm key takeaways… Apache Storm, bigdata, hadoop, StreamProcessing
Analytics, Data Science, Exploratory Data Analysis, Hadoop

Approach to execute Machine Learning project, “Halt the Hate”…

Disclaimer: The analysis was done in this project touches a sensitive issue in India. So […]

Mukesh Kumar - July 26, 2018July 31, 2018 on Approach to execute Machine Learning project, “Halt the Hate”… big data, Data Science, Exploratory Data Analysis, hadoop, Machine Learning
Hadoop

Fundamantals of Apache Spark…

You can view my other articles on Spark RDD at below links… Apache Spark RDD […]

Mukesh Kumar - July 9, 2018August 23, 2018 on Fundamantals of Apache Spark… Apache Spark, big data, hadoop, spark
Analytics, Hadoop

Advertisement attributes or Ad Attributes…An Idea!!!

Some time ago i was working on an idea called as Ad Attributes or Advertisement attributes. I’d […]

Mukesh Kumar - October 12, 2017March 14, 20202 Comments on Advertisement attributes or Ad Attributes…An Idea!!! analytics, bigdata, hadoop
Bigdata, HDP Search, Solr, SolrCloud

SolrCloud vs HDPSearch…

Let us start to remove some confusion we have related to SolrCloud and HDPSearch. First […]

Mukesh Kumar - July 19, 2017 on SolrCloud vs HDPSearch… bigdata, database, hadoop, HDP Search, HDP2.5, HDP2.6, Solr, Solr Cloud
Apache Spark, Hbase

Multiple WAL in Apache HBase 1.3 and performance enhancements!!!

Apache HBase 1.3.0 was released mid-January 2017 and ships with support for date-based tiered compaction […]

Mukesh Kumar - March 14, 2017March 14, 2017 on Multiple WAL in Apache HBase 1.3 and performance enhancements!!! bigdata, database, hadoop, Hbase
Analytics, Bigdata, Framework, Hadoop, RHadoop

Install and smoketest R and RHadoop on Hortonworks Data Platform (HDP25-CentOS7)

Before going to Installation steps i’d like to give a small introduction on RHADOOP. What […]

Mukesh Kumar - November 21, 2016 on Install and smoketest R and RHadoop on Hortonworks Data Platform (HDP25-CentOS7) hadoop, R, RHadoop
Hadoop, Hbase, Hive

JRuby code to purge data on Hbase over Hive table…

Problem to Solve:- How to delete/update/query Binary format stored values in a HBase column family column. Hive […]

Mukesh Kumar - October 9, 2016 on JRuby code to purge data on Hbase over Hive table… hadoop, Hbase, Java, Jruby
Hadoop, Hive, Java, Pig, Python

Python and Python bites

Python and Python bites “lambda”    Hi everyone, this article show you one powerful function […]

Mukesh Kumar - October 9, 2016 on Python and Python bites bigdata, hadoop, python
Hadoop, Kylin, Security

Past and Future of Apache Kylin!!!

Short Description: Apache Kylin (Chinese: Kirin) appears, can solve the problems based on Hadoop. Article […]

Mukesh Kumar - July 21, 2016October 11, 20162 Comments on Past and Future of Apache Kylin!!! hadoop, Kylin, security
Hadoop, HDFS

Heterogeneous Storage in HDFS(Part-1)…

An Introduction of heterogeneous storage type, and the flexible configuration of heterogeneous storage! Heterogeneous Storage […]

Mukesh Kumar - June 9, 2016October 11, 2016 on Heterogeneous Storage in HDFS(Part-1)… hadoop, hdfs
Cloudera, encryption, HDFS, Security

A Step-by-Step Guide to HDFS Data Protection Solution for Your Organization on Cloudera CHD

  An enterprise-ready encryption solution should provide the following Comprehensive encryption offering wherever it resides, […]

Mukesh Kumar - June 4, 2016October 11, 2016 on A Step-by-Step Guide to HDFS Data Protection Solution for Your Organization on Cloudera CHD Cloudera, encryption, hadoop, hdfs, security
Best Practices, Hadoop, Hive

Performance utilities in Hive

Before taking you in details of utilities provided by Hive, let me explain few components […]

Mukesh Kumar - May 10, 2016October 10, 2016 on Performance utilities in Hive best practice, hadoop, hive
Best Practices, Database, Hive

Best Practices for Hive Authorization when using connector to HiveServer2

Recently we are in process of working with Presto and configuring Hive Connector to it. […]

Mukesh Kumar - April 6, 2016October 11, 2016 on Best Practices for Hive Authorization when using connector to HiveServer2 hadoop, hive, presto
Database, HPL

HPL/SQL Make SQL-on-Hadoop More Dynamic

Think about the old days when we solved many business problems using Dynamic SQL, exception […]

Mukesh Kumar - February 13, 2016October 11, 2016 on HPL/SQL Make SQL-on-Hadoop More Dynamic bigdata, database, hadoop, hpl
Hadoop, Hive, Oozie

Coding Tips and Best Practice in Hive and Oozie…

Many time during the code review found some common mistakes done by the developer. Here […]

Mukesh Kumar - January 25, 2016October 11, 2016 on Coding Tips and Best Practice in Hive and Oozie… hadoop, hive, oozie
Hadoop, HDFS

HDFS is really not designed for many small files!!!

Few of my friends new to Hadoop ask frequently what the good file size is […]

Mukesh Kumar - January 4, 2016October 11, 2016 on HDFS is really not designed for many small files!!! big data, hadoop, hdfs, hive
Hadoop, HDFS, Hive

HBase Replication and comparison with popular online backup programs…

Short Description: HBase Replication: Hbase Replication solution can solve the cluster security, data security, read […]

Mukesh Kumar - December 30, 2015October 11, 2016 on HBase Replication and comparison with popular online backup programs… hadoop, Hbase, hive
Apache Spark, Spark

Introduction to Spark

Introduction to Apache Spark:- Spark As a Unified Stack and Computational Engine is responsible for […]

Mukesh Kumar - November 10, 2015October 11, 2016 on Introduction to Spark hadoop, memcached, spark
Hadoop, Kafka

Kafka: A detail introduction

I’ll cover Kafka in detail with introduction to programmability and will try to cover almost […]

Mukesh Kumar - September 10, 2015October 11, 2016 on Kafka: A detail introduction bigdata, hadoop, kafka
Bigdata, Hadoop, NoSql

The ACID properties and the CAP theorem are two concepts in data management to distributed system.

Started working on HBase again!! Thought why not refresh few concepts before proceeding to actual […]

Mukesh Kumar - August 10, 2015October 11, 2016 on The ACID properties and the CAP theorem are two concepts in data management to distributed system. bigdata, database, hadoop, nosql
Analytics, Hadoop

Data Analysis Approach to a successful outcome

I have done data analysis for one of my project using below approach and hopefully […]

Mukesh Kumar - May 10, 2015October 11, 2016 on Data Analysis Approach to a successful outcome analytics, bigdata, hadoop

Search here and lets see if I have done something on it…

Sorted based on Categories

  • Administration (1)
  • Analytics (20)
  • Apache Spark (9)
  • Apache Storm (1)
  • Best Practices (8)
  • Bigdata (28)
  • Cloudera (1)
  • Data Science (6)
  • Database (10)
  • encryption (1)
  • Exploratory Data Analysis (6)
  • Framework (4)
  • Free Software (1)
  • Fun (2)
  • GPU (1)
  • Hadoop (51)
  • Hbase (4)
  • HbaseFcsk (1)
  • HDFS (4)
  • HDP Search (1)
  • Health (1)
  • Hive (7)
  • HPL (1)
  • Java (1)
  • Kafka (7)
  • Kylin (1)
  • Love what you like (2)
  • Machine Learning (5)
  • Messaging System (2)
  • NoSql (1)
  • Notebook (1)
  • Oozie (1)
  • open source (2)
  • Pig (1)
  • PostgreSQL (1)
  • Python (11)
  • RHadoop (1)
  • Security (3)
  • Shiro (1)
  • Solr (1)
  • SolrCloud (1)
  • Spark (2)
  • Tephra (1)
  • Tesseract (2)
  • Uncategorized (2)

Recent Posts

  • Demo Delta Lake on big data workloads…
  • My Big Data solution using AWS services…
  • Part -2: Operators teach Kubernetes how to simplify stateful application…
  • Operators teach Kubernetes how to simplify the stateful application…
  • LAMP stack in Cloud: Building a Scalable, Secure and Highly Available architecture using AWS

Recent Comments

    Archives

    • June 2020 (2)
    • May 2020 (4)
    • March 2020 (8)
    • February 2020 (1)
    • January 2020 (1)
    • November 2019 (1)
    • October 2019 (1)
    • September 2018 (1)
    • August 2018 (1)
    • July 2018 (2)
    • June 2018 (1)
    • May 2018 (1)
    • March 2018 (3)
    • February 2018 (3)
    • January 2018 (4)
    • December 2017 (4)
    • November 2017 (3)
    • October 2017 (3)
    • September 2017 (2)
    • August 2017 (4)
    • July 2017 (4)
    • April 2017 (2)
    • March 2017 (2)
    • February 2017 (3)
    • January 2017 (3)
    • November 2016 (4)
    • October 2016 (3)
    • July 2016 (2)
    • June 2016 (2)
    • May 2016 (2)
    • April 2016 (1)
    • March 2016 (1)
    • February 2016 (1)
    • January 2016 (3)
    • December 2015 (1)
    • November 2015 (2)
    • September 2015 (3)
    • August 2015 (2)
    • May 2015 (3)