Sign In
Get Clay Free →

Suggestions

    Mohit Babbar

    Senior Data Engineer

    Mohit Babbar is an IT professional with over 12 years of experience in Big Data and Hadoop, specializing in providing analytical solutions to business challenges.

    He excels in integrating various frameworks to construct data pipelines connecting RDBMS databases like Oracle, MySQL, PostgreSQL, and Snowflake/BigQuery.

    Mohit has expertise in data analysis using BigQuery, Snowflake, and HIVE, along with writing custom UDF in Scala for enhanced HIVE and Scala/Python functionalities.

    His skill set includes implementing business rules using PySpark and Python, with a profound understanding of Python for data structure implementations like Arrays and Lists.

    He is proficient in implementing Streams and Snowpipe for Continuous Data Ingestion and automating data quality processes using Spark/Scala and shell scripting in Big Data projects.

    Mohit has extensive experience in processing Big data on the Hortonworks and Apache Hadoop framework using Spark/Scala.

    He is well-versed in developing Apache Spark programs using Scala for large-scale data processing, leveraging in-memory computing capabilities for faster data processing with Spark core and Spark SQL.

    Additionally, he has experience in installing, configuring, supporting, and monitoring Hadoop clusters using Apache, Airflow-DAG, and GCP Cloud Composer.

    His expertise extends to working in GCP DataProc, Bigquery, and deploying applications in GCP environments.

    Mohit has practical experience in Confluent KSQL and Kafka Data Streaming, focusing on developing Apache Kafka Streaming Applications, Kafka Avro/Parquet Schema, Dockers, and integrating Kafka with Hortonworks Hadoop.

    Furthermore, he serves as a Business Data Analyst, involved in gathering ETL/BI requirements and converting them into functional requirements in Scala and Spark.

    He demonstrates a solid understanding of Data Warehouse concepts and excels in writing complex SQL queries, dimension modeling, and performing data processing through various data types like Map, Struct, and Array.

    With a knack for data profiling to validate data quality, Mohit extensively creates Test Plans, Test Scenarios, Test Scripts, and Test Execution to align with business and functional specifications.

    His experience includes extracting data from diverse sources like oracle tables and flat files, verifying ETL Mapping Rules against source data, running SQL queries to validate loaded data, and checking reports & dashboards against fact & dimensional data for reporting compliance.

    Mohit Babbar
    Add to my network

    Location

    Dallas-Fort Worth Metroplex