Harish Padmanabhan

Data Engineer & ML/AI Engineer
delivering data-driven solutions.

Hi! I'm Harish - A Passionate

ABOUT ME

Engineering robust data & AI solutions for real-world impact 🚀

Hi, I’m Harish Padmanabhan — a Data Engineer, Machine Learning Engineer, and aspiring AI Engineer. As an M.S. Data Analytics Engineering student at Northeastern University (’26), I design scalable data architectures, develop machine learning models, and integrate AI-driven solutions into production. My toolkit includes Airflow, AWS, SQL/NoSQL, and Python ML frameworks. I’m passionate about building pipelines, predictive systems, and AI applications that deliver measurable results.

Harish Padmanabhan portrait
Northeastern University — MS DAE ’ 26

My Data · ML · AI Stack

Tools I Use

Degrees Received

Northeastern University logo

Northeastern University

M.S. in Data Analytics Engineering (GPA: 3.81/4.00)

Sep 2024 – May 2026
Boston, MA, USA
  • Coursework: Foundations of Data Analytics, Data Management for Analytics, Computation & Visualization, Data Mining in Engineering, Large Language Models.

  • Built batch & streaming data pipelines with Airflow/AWS Glue and modeled data in PostgreSQL/Redshift.

  • Applied ML for NL2SQL, predictive analytics, and dashboarding in Python (Pandas, scikit‑learn) and Tableau/Power BI.

Anna University logo

Anna University

B.E. in Mechanical Engineering (GPA: 8.21/10.00)

Aug 2018 – Apr 2022
Chennai, TN, India
  • Coursework: Python Programming, Statistics & Numerical Methods; foundations in data analysis and computation.

  • Hands‑on projects emphasizing problem solving, quantitative analysis, and technical communication.

  • Transitioned interests toward data engineering and ML applications through electives and projects.

Work Experience

  • Data Engineer

    Infosys Limited

    Aug 2022 – Aug 2024
    Chennai, India
    • Built scalable data pipelines with Apache Airflow, AWS Glue, and S3—processing 200M+ records monthly with 40% lower ingestion latency.

    • Optimized SQL models in PostgreSQL/Redshift with indexing and partitioning, reducing query time by 55%.

    • Developed Tableau dashboards integrated with ETL workflows, improving decision-making speed by 25%.

    • Automated anomaly detection workflows with PySpark and AWS Lambda, reducing manual validation time by 70%.

    AirflowAWS GlueS3PostgreSQLRedshiftPySparkTableau
  • Data Engineer Intern

    Infosys Limited

    Aug 2022 – Jan 2023
    Chennai, India
    • Enhanced and maintained extbf{ETL pipelines} handling lease, asset, and payment data from financial systems, eliminating manual Excel errors and cutting data latency by 70%

    • Developed Python automation scripts to process multi-format API data (CSV, JSON, XML) into PostgreSQL, reducing upload time from 4 hours to 15 minutes.

    • Implemented data validation pipelines with Pandas and Great Expectations, ensuring 98% accuracy before production loads.

    AWS GlueSnowflakePostgreSQLPythonPandasGreat Expectations

Featured Projects

My Resume

Resume
Loading resume…

Get in Touch

Have a role (Data/ML/AI), project, or collab in mind? Ping me—happy to chat.