Skip to content
View jerinsam's full-sized avatar

Block or report jerinsam

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
jerinsam/README.md

πŸ“Œ Jerin - Data Engineering Practitioner & Solutions Manager

πŸ‘‹ About Me

Hi there! I'm Jerin Sam with 13 years of experience as a Data Engineering Practitioner, I am currently working as a Sr. Solutions Manager at Aligned Automation. I specialize in designing data architectures, workflows, and solutions that drive efficiency and innovation.

πŸ’Ό Expertise

  • Data Engineering & Architecture
    • SQL Server, PostgreSQL
    • Spark, Databricks, Kafka, ksqlDB
    • Delta Lake
  • Big Data & Cloud Technologies
    • Azure Synapse Analytics, Azure SQL, Azure Data Lake Storage (ADLS), Azure Data Factory (ADF)
  • On-Prem Data Technologies
    • SQL Server Integration Services (SSIS), Airflow, DBT
  • Visualization Technologies
    • Power BI
  • Programming Languages
    • Python
  • Customer Engagement & Project Management
    • Collaborating with business leaders, technical teams, and customers to ensure alignment on project goals.
    • Engaging customers to understand their challenges and propose tailored data solutions.
    • Driving efficiency in project execution through best practices, automation, and streamlined workflows.
    • Leveraging data-driven insights and business acumen to make strategic recommendations.

πŸš€ Current Work

  • Building a scalable and dynamic Data Quality Solution for batch & streaming datasets.
  • Implementing an Analytical Framework to enhance data insights and decision-making processes.
  • Enhancing Spark Hive Metastore Integration with PostgreSQL on open-source big data platforms.
  • Developing a Lakehouse Interoperability Layer to support multiple lakehouses (Delta, Iceberg, Hudi) on open-source big data platforms.

πŸ“š Learning & Interests

  • LLM (Large Language Models): Deepening expertise in LLM in data engineering space by implementing LLM based applications.
  • Docker & Kubernetes: Deepening expertise in containerization for data engineering
  • Data Lakehouse and Processing Engines: Exploring interoperability layer for multiple lakehouses and processing engines.
  • Data Governance & Compliance: Exploring data governance frameworks and compliance regulations.
  • Open-Source Data Tech Stack: Deepening expertise in open-source data technologies such as Apache Spark, Delta Lake, Minio, Trino and Kafka.
  • Leadership & Business Skills: Stakeholder Management, Competitive Analysis, Storytelling, and Decision-Making.

πŸ’‘ Get in Touch

πŸ“§ Email: [email protected]
πŸ’Ό LinkedIn: https://www.linkedin.com/in/jerinjsam
πŸ”— GitHub: https://github.com/jerinsam


Let's build scalable, efficient, and innovative data solutions together! πŸš€

Pinned Loading

  1. integration-spark-catalog-lakehouse integration-spark-catalog-lakehouse Public

    Integration of Spark with different catalogs and lakehouses

    Jupyter Notebook

  2. kafka-learn-hands-on kafka-learn-hands-on Public

    Learn Kafka

    Python 3

  3. sql-server-to-redis-using-kafka-connect sql-server-to-redis-using-kafka-connect Public

    DATA ENGINEERING SOLUTIONS - STREAM SQL SERVER DATA TO REDIS

    HTML 4

  4. sql-server-to-redis-using-kafka sql-server-to-redis-using-kafka Public

    Data Engineering Solutions - Push SQL Server data to Kafka

    Python 3 1

  5. airflow-learn-hands-on airflow-learn-hands-on Public

    Data Engineering - Airflow

    Python 2

  6. dbt-learn-hands-on dbt-learn-hands-on Public

    Data Engineering - DBT

    HTML