Abhishek abhikashyap

Hi there, I'm Abhishek Kumar 👋

🚀 Data Engineer | Cloud & Big Data Enthusiast | Automation & AI Explorer

👨‍💻 About Me

🔭 Currently working as a Founding Data Engineer at Blooprint.in
⚡ Passionate about building scalable ETL pipelines, data lakes, and automation solutions
☁️ Skilled in AWS, Azure, Databricks for data engineering and analytics
🤖 Exploring LLMs, LangChain, and Generative AI to bring intelligence into data workflows
📈 Love cloud cost optimization – reduced infra costs by 85% in production
🌱 Constant learner of cutting-edge data engineering, AI, and ML solutions

🛠️ Tech Stack

Languages: Python, SQL, PySpark, Bash
Cloud & Data Platforms: AWS (EC2, S3, Lambda, SQS), Microsoft Azure (ADF), Databricks
ETL & Orchestration: Prefect, Apache Airflow, Celery
Frameworks & Tools: Django, Streamlit, LangChain, LangGraph
Analytics & Visualization: Apache Superset, Power BI, Pandas
Databases: PostgreSQL, DuckDB, Delta Tables
DevOps: Docker, Docker Compose, Git

📈 GitHub Stats

🔥 Featured Projects

🗂️ Chat with PDF: LangChain-powered Q&A over PDFs with vector search
🏗️ Data Lake & Analytics Platform: S3-based data lake (20GB daily), fast analytics with DuckDB, cost-optimized AWS infra
📊 Report Center Automation: Reduced report latency from 6–24 hrs to 10 sec–5 min, empowering cross-platform insights

🤝 Let's Connect

💼 LinkedIn
📧 Email: [email protected]
🖥️ GitHub

⭐️ Always excited to collaborate on open-source data engineering and AI projects!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly