π Data Engineer | Cloud & Big Data Enthusiast | Automation & AI Explorer
- π Currently working as a Founding Data Engineer at Blooprint.in
- β‘ Passionate about building scalable ETL pipelines, data lakes, and automation solutions
- βοΈ Skilled in AWS, Azure, Databricks for data engineering and analytics
- π€ Exploring LLMs, LangChain, and Generative AI to bring intelligence into data workflows
- π Love cloud cost optimization β reduced infra costs by 85% in production
- π± Constant learner of cutting-edge data engineering, AI, and ML solutions
Languages: Python, SQL, PySpark, Bash
Cloud & Data Platforms: AWS (EC2, S3, Lambda, SQS), Microsoft Azure (ADF), Databricks
ETL & Orchestration: Prefect, Apache Airflow, Celery
Frameworks & Tools: Django, Streamlit, LangChain, LangGraph
Analytics & Visualization: Apache Superset, Power BI, Pandas
Databases: PostgreSQL, DuckDB, Delta Tables
DevOps: Docker, Docker Compose, Git
- ποΈ Chat with PDF: LangChain-powered Q&A over PDFs with vector search
- ποΈ Data Lake & Analytics Platform: S3-based data lake (20GB daily), fast analytics with DuckDB, cost-optimized AWS infra
- π Report Center Automation: Reduced report latency from 6β24 hrs to 10 secβ5 min, empowering cross-platform insights
- πΌ LinkedIn
- π§ Email: [email protected]
- π₯οΈ GitHub
βοΈ Always excited to collaborate on open-source data engineering and AI projects!