Srinivasan Hariharan
Srinivasan Hariharan
Srinivasan Hariharan is a seasoned data engineer with over 9 years of experience in the information technology industry. He has a strong background in big data implementation both in the cloud and on premises. With a Cloudera Certified Developer for Apache Hadoop (CCDH) certification, Srinivasan has worked on projects involving data lake implementation in the Azure cloud and Hadoop ecosystem. He has extensive experience in data engineering projects and has developed MapReduce and Spark programs to process data, including the processing of over 100TB of data. He has also worked with different file formats, such as Avro and Parquet, and developed complex ETL applications in Spark. Srinivasan has experience in workflow orchestration using Airflow, query optimization, and performance tuning. With a strong interest in machine learning, he has developed ML models using Spark MLlib and worked on small-scale machine learning projects using Spark and H2O.