Currently
Senior Software Engineer @ PG&E
Based in
Santa Clara, CA

Ajay
Kannan.

I build production data platforms, ML pipelines, and generative AI systems — where reliability matters as much as intelligence.

Portrait of Ajay Kannan
Role
Senior Software Engineer
01 / Summary
/ Profile

In one paragraph

Senior software engineer with a strong data-engineering and machine-learning background. Delivered a production data platform for a major utility — boosting processing throughput by 20% and cutting system errors by 15% through automated pipelines and monitoring. Built image-analysis models hitting 94% accuracy on medical data, with deep experience in Python, PyTorch, and scalable backend services. Looking to apply this blend of data-platform and AI work to drive reliable, high-performance solutions.

02 / About
/ Introduction

The short version

Biography

Born in Chennai. Educated at Anna University and Arizona State. Now building in the Bay Area.

The thread that connects IoT sensors, gesture recognition, medical imaging, and utility-scale data platforms is the same one — curiosity about systems that have to work in the real world.

I grew up in Chennai, where I first fell in love with building things — wiring Arduino boards, training accelerometers to read sign language gestures, and leading an IoT research group before I had a degree to show for it.

That curiosity carried me to two summers at Microsoft, where I shipped sensor-driven ML for agriculture and helped integrate Cortana with a shopping platform — my first taste of building at scale, and of the gap between a model that works in a notebook and one that works in production.

I crossed continents for grad school at Arizona State, researching few-shot learning for immunology — a problem where data is scarce and the stakes are real. That tension between limited signal and high consequence shaped how I think about ML systems today.

Now at Pacific Gas and Electric in the Bay Area, I design the data platform and ML infrastructure behind utility operations — pipelines, anomaly detection, observability, and increasingly, generative AI that has to earn its place in production. The work is less glamorous than a chatbot demo, but reliability is the product.

The journey from Chennai to Santa Clara was not a straight line. It was an odyssey — shaped by curiosity, cross-cultural leaps, and the stubborn belief that the most interesting problems live where engineering meets the real world.

03 / Papers
/ Publications

Papers & posters

04 / Career
/ Work history

Where I've built

Jul 2023 — Present
Santa Clara, CA

Pacific Gas & Electric

Senior Software Engineer
  • Designed and deployed a production data platform (Electric Productivity Tracker) for large-scale utility datasets — improved processing throughput ~20% and reduced system errors ~15% through backend automation, monitoring, and reliability engineering across distributed systems.
  • Engineered scalable, fault-tolerant data pipelines integrating heterogeneous sources (GIS, PostgreSQL, internal services) with consistent low-latency access and continuous stream processing.
  • Built automated backend validation and testing infrastructure using Python services and REST interfaces — cut deployment verification time by 50% via comprehensive unit, integration, and end-to-end suites.
  • Implemented end-to-end ETL, validation, and transformation workflows across distributed systems, improving data integrity and reducing manual ops overhead.
  • Automated extraction and processing of the Distribution Operations Toolset (DOT) — a daily-updated, macro-enabled Excel system containing nested map-creation data — using Python and JavaScript to filter new maps, extract the relevant fields, and feed them into the map-creation pipeline, eliminating manual data entry and reducing processing errors.
  • Unified three separate map-creation processes into a single streamlined workflow — consolidating fragmented operations and improving overall efficiency, consistency, and maintainability across the team.
  • Applied ML and generative AI in production to improve anomaly detection reliability — reduced false positives ~30% via model-assisted signal validation, statistical inference, and system-level monitoring.
  • Shipped observability and alerting (health checks, dashboards, centralized logging) across servers and databases — cut issue detection and resolution time by ~50%.
  • Collaborated with cross-functional teams in design and code reviews, and produced technical documentation — architecture diagrams, API specifications, and runbooks — in Confluence, accelerating feature rollout and reducing onboarding time.
Jul 2021 — Dec 2021
Bengaluru, India

Sigtuple Inc.

Data Scientist
  • Applied image processing techniques to biological data using PyTorch, implementing Siamese Networks, GANs, and Neural Style Transfer — enabling more accurate medical image analysis of cellular structures.
  • Used alpha blending to recreate histology slide images, accurately representing biological particles and membranes.
  • Achieved 94% accuracy in image blending efficiency through innovative generative architectures.
  • Built production software in Python with object-oriented design and comprehensive unit and integration tests, resulting in a stable system that reduced deployment errors.
2017 — 2018
Hyderabad, India

Microsoft

Software Engineering
  • Utilised ML algorithms for the FarmBeats agricultural platform using Python and PyTorch — identified sensor-data patterns to enhance crop-yield predictions.
  • Built a GIS-based heat-map generation system for FarmBeats — real-time spatial visualization of multi-sensor data (soil moisture, temperature, humidity).
  • Contributed to the Shopping on Cortana project, integrating Cortana with a shopping platform to improve user experience.
  • Developed backend services with Cassandra DB and C# as part of the Foundry Team in MS-IDC, building RESTful APIs for distributed data storage.
05 / Projects
/ Projects

Everything on GitHub

A complete index of my public repositories — from production-style LLM systems to graduate coursework and the occasional weekend experiment. Most recent first.

View on GitHub ↗
06 / Tools
/ What I reach for

Tools of the trade

Programming Languages

  • Python (Expert)
  • C++
  • Java
  • C#
  • JavaScript
  • SQL
  • Bash

ML / DL Frameworks

  • PyTorch
  • TensorFlow
  • Hugging Face
  • Keras
  • NumPy
  • Scikit-learn
  • Pandas

LLM & AI Tools

  • LangChain
  • LLM Integration
  • Prompt Engineering
  • RAG Pipelines

Backend Development

  • REST APIs
  • FastAPI
  • Node.js
  • Express.js
  • Spring Boot
  • Microservices
  • GraphQL

Databases

  • PostgreSQL
  • MongoDB
  • MySQL
  • Cassandra
  • NoSQL

Cloud & DevOps

  • AWS (EC2, S3, Lambda)
  • Azure
  • Docker
  • Kubernetes
  • Git
  • CI/CD

Computer Science

  • Data Structures
  • Algorithms
  • System Design
  • Distributed Systems
  • OOP
Currently Learning

JAX · AWS Bedrock · Vector Databases (Pinecone) · CUDA Programming

07 / Education
/ Schools

Where I studied

Jan 2021 — May 2023

Arizona State University, Tempe

M.S. Computer Science
Relevant Coursework
Machine LearningDeep LearningDistributed SystemsData Structures & Algorithms
Aug 2014 — May 2019

Anna University, Chennai

B.S. Computer Science
08 / Contact

Let's
build
something.

Open to conversations about ML infrastructure, generative AI in production, or anything in the neighborhood.