Sathyavarthan Balachandar

Data Engineer

Summary

Data Engineer with 3+ years of experience building automated data systems that help companies make better business decisions. I specialize in creating reliable data pipelines, organizing large datasets, and building dashboards that track important business metrics.

Certifications

Work Experience

Data Engineer

Fidelity Investments - Boston, MA

May 2025 - Present

  • Built automated systems that process large financial datasets, making data available for business analysis faster and more reliably
  • Designed organized data storage systems that make it easier to find and analyze financial information across the company
  • Implemented quality checks that catch data errors automatically, reducing manual fixes by 30%
  • Created dashboards using Tableau, Power BI, and Looker that track key business metrics, helping teams make data-driven decisions 20% faster

Data Engineer Intern

Castor Health Institute - Sterling, IL

Jan 2025 - Apr 2025

  • Built automated workflows that move healthcare data from various sources into organized storage, reducing processing time by 55%
  • Created real-time compliance dashboards that help healthcare teams monitor regulatory requirements and identify risks early
  • Improved data organization systems, making queries run 60% faster for analysts tracking 20+ health metrics
  • Implemented AI-powered tools that automatically detect data anomalies, reducing manual investigation work by 65%

Teaching Assistant - Business Intelligence and Analytics, Database Design

Northeastern University College of Engineering - Boston, MA · Hybrid

Apr 2024 - Dec 2024

  • Supported graduate-level coursework in database design by mentoring students on normalization, schema optimization, and transactional integrity using Oracle SQL and PL/SQL
  • Guided students through assignments and projects, helping translate database theory into practical, scalable design decisions
  • Assisted in end-to-end business intelligence coursework, covering data profiling, cleaning, transformation, dimensional modeling, and Slowly Changing Dimensions (SCD)
  • Prepared and reviewed course materials including assignments, quizzes, and capstone BI projects to reinforce real-world ELT and analytics workflows

Data and Software Engineering Intern

Protectt.ai - India

Jul 2023 - Aug 2023

  • Developed high-performance backend services using Java Spring Boot for real-time data processing pipelines, achieving 100ms latency at scale with reactive programming and asynchronous processing patterns
  • Led the design of a custom JWT authentication flow in Spring Security, including credential validation, token generation, request filtering, and role-based authorization for protected APIs
  • Optimized application performance through Spring Boot Actuator monitoring, implementing custom metrics, health checks, and profiling critical endpoints, reducing response times by 35% and memory footprint by 40%

Data Analyst

Capgemini - India

Jun 2021 - Jul 2023

  • Analyzed large healthcare datasets to identify trends in patient outcomes and care efficiency, supporting AI research initiatives
  • Built 15+ interactive Power BI dashboards that translated complex healthcare data into clear visual insights for stakeholders
  • Processed and organized large datasets to enable scalable analytics for digital health transformation projects
  • Created standardized business metrics that helped track healthcare performance and transformation readiness

Featured Projects

FRED Economic Data Pipeline

Real-time financial analytics platform that processes Federal Reserve Economic Data. Features automated data processing, treasury spread analytics, and interactive dashboards for economic forecasting.

FRED Economic Data Pipeline Architecture
PythonSnowflakeAWS LambdaGCP StorageStreamlitPlotly

SEC Financial Data Pipeline

Automated system that extracts financial data from SEC reports and transforms it into organized, queryable format. Built with modern cloud technologies including Snowflake, dbt, Apache Airflow, and AWS.

SEC Financial Data Pipeline Architecture
PythonApache AirflowdbtSnowflakeAWSFastAPIStreamlit

NVIDIA Earnings Intelligence Platform

AI-powered system that processes and analyzes NVIDIA quarterly financial reports. Uses advanced document processing and vector storage to enable intelligent search across multiple quarters of financial data.

NVIDIA Earnings Intelligence Platform Architecture
Azure DatabricksApache AirflowSeleniumPineconeChromaDBRedisFastAPIDocker

Education

Master of Science in Data Architecture & Management

Northeastern University, Boston

Bachelor of Engineering in Computer Science

Anna University, Chennai