Hi There,
I'm Prashant Yadav

i am into Data Engineering|

About Me
Prashant Pramod Yadav

About Me




I'm Prashant Yadav

Data Engineer | Data Pipeline & Cloud Specialist


Hello, My name is Prashant Yadav, I’m an ambitious individual who is deeply passionate about Data Engineering, cloud platforms, scalable data pipelines and everything related to modern data infrastructure.
"My love for cloud was inspired by my passion for learning new things. As a kid I’ve always observed things differently and always looked at life as an opportunity to learn and experience, innovate in ways to help others. The reason why I chose to learn data is because it is a euphoric feeling knowing that you can create something from literally nothing. When I create, I feel like I’m the most powerful person in the world. I love when I stumble upon a task to be quite challenging and difficult to figure out because situations like that keeps my brain sharp and only makes me stronger. I just look at coding like one huge puzzle, just putting the pieces together until they all fit" - Prashant Pramod Yadav

E-mail : prashantyadav03082000@gmail.com

Place : United Kingdom, London

Education

MBA | Master of Business Administration in Data Analytics (Distinction)

University Of Hertfordshire, London

2024-2025

B.Tech | Computer Science & Engineering

MIT ADT University, India

2017-2021

Projects

Data Pipeline

Data Pipeline

YouTube-Scrapping-Challenge

YouTube-Scrapping-Challenge

Data Processing Workflow

Data Processing Workflow

Image Scraping

End-to-End Diamond Price Prediction

Image Scraping

TechRevenue-Profits

Image Scraping

End-to-End-Image-Scraping

Operation pipeline

Operation Pipeline

Sign Language Recognition

Sign Language Recognition

Prashant & Co

Prashant & Co

Experience

SCALE IT UK LTD, Greater London, United Kingdom

Data Engineer – AWS (Contract)

October 2025 - January 2026


• Architected and operated batch data pipelines on AWS, building a scalable data lake architecture on Amazon S3 for structured and curated datasets exceeding 50+ TB.

• Designed and implemented automated data validation checks within ETL pipelines to improve data consistency and quality.

• Optimized Amazon Redshift schemas, distribution keys, and query execution plans, improving analytical query performance by 22%.

• Orchestrated scheduled workflows using Apache Airflow and implemented monitoring and alerting using AWS CloudWatch, improving pipeline reliability and reducing operational downtime by 25%.

Skills: AWS · Amazon S3 · Redshift · SQL · Python · Apache Airflow · Spark · Data Modeling · ETL/ELT

Centiro Solutions Pvt Ltd, India

Associate Engineer – Data Systems

December 2021 - December 2023


• Engineered structured data flows between application services and relational databases, supporting high-volume transactional systems processing 5M+ monthly records.

• Authored and optimized 100+ complex SQL queries, improving response times by 40% through indexing strategies and execution plan tuning.

• Implemented transformation logic and reconciliation routines to ensure accuracy and consistency across interconnected data systems.

• Led root cause analysis for production-level data issues, improving system stability and reducing database downtime.

Skills: SQL · Database Optimization · Data Transformation · Data Integrity · Data Systems


Articles & Blogs

Statistics

Your First Tableau Dashboard

Python

Pandas/Matplotlib vs R

Flask

Why Docker

Life Cycle of Machine Learning

CeX is always a thrill for tech enthusiasts

Feature Engineering?

Hyperparameter Tuning

K Nearest Neighbors

Statistical Methods in Machine Learning

Spam E-mail

Automatically detect toxic messages and content

Mojo

What is Tox?

Abilities & Skills


AWS (S3, Redshift, Glue, Lambda, EC2, IAM, CloudWatch)
Cloud-Native Architecture
Data Lake Architecture (Amazon S3)
ETL / ELT Development
Batch Data Processing
Data Pipelines
Data Modeling (Star / Layered Models)
Data Quality & Validation
Pipeline Monitoring & Reliability
Apache Airflow
Apache Spark
PySpark
SQL
SQL Performance Tuning
Python
Amazon Redshift Optimization
Amazon RDS
MySQL
Docker
Git & GitHub