SUBJECT IDENTIFICATION // DATA ENGINEERING OPERATIVE
SANGARSHAN
REDDY KARRA
// SR. SOFTWARE ENGINEER → DATA ENGINEER · 8+ YEARS IN THE FIELD
High-reliability operative specializing in data-intensive Python backend systems and large-scale pipeline architecture. Deployed in safety-critical environments — real-time telemetry monitoring of 2,500+ active locomotives. Deep expertise in Kafka, PySpark, ETL workflows, and AWS infrastructure. Now directing all capabilities toward Data Engineering.
CAPABILITY MATRIX
// PROFICIENCY LEVELS · VERIFIED ACROSS ACTIVE DEPLOYMENTS
MODULE_01 // CORE LANGUAGES
MODULE_02 // DATA & PIPELINES
MODULE_03 // CLOUD & INFRA
MODULE_04 // SYSTEMS
MISSION LOG
// OPERATIONAL HISTORY · FIELD DEPLOYMENTS
CSX TECHNOLOGY · JACKSONVILLE, FL · OPERATION: IRON FLEET
- Architected Python backend services powering real-time telemetry monitoring across CSX's fleet of 2,500+ active locomotives — core of federal PTC compliance, 24/7 uptime.
- Built Pandas and PySpark ETL pipelines processing hundreds of thousands of locomotive telemetry records daily, serving 8 downstream reporting and monitoring systems.
- Cut data retrieval latency by 25% — 3 minutes down to under 2 — by rewriting stored procedures and optimizing SQL across high-volume pipelines.
- Led Python 2.6 → 3.12 migration across 100K+ line codebase in under 6 months. Zero production incidents. Retired end-of-life runtime, unblocked modern tooling.
- Engineered multithreaded data ingestion system concurrently tracking health metrics across 2,500+ locomotives for near real-time anomaly detection.
- Defined XSD message contracts adopted as the standard across 8 cross-functional teams.
- Raised unit test coverage from below 60% to 80%+, measurably reducing production defects.
- Implemented AWS KMS-based encryption for data at rest and in transit. Primary responder for security certificate failures in production.
- Mentored 2 engineers from internship through full-time conversion — code reviews, design sessions, pair debugging.
GUARDIAN LIFE INSURANCE · BETHLEHEM, PA · OPERATION: SENTINEL
- Automated EC2 inventory reconciliation across 1,000+ instances using boto3, eliminating 5 hours/week of manual audit work.
- Unified Zenoss, CMDB, and AWS data into a single reconciliation pipeline — proactive identification of configuration drift before escalation.
- Designed infrastructure gap reporting workflows delivering actionable dashboards to operations teams daily.
- Built Python automation suite replacing recurring manual tasks across dev, staging, and production environments.
ACTIVE PROJECTS
// FIELD OPERATIONS · BUILDING IN PUBLIC
CLASSIFIED DOSSIER — INTERACTIVE RESUME
This very page. A sci-fi mission-dossier-themed personal portfolio. Features a boot sequence, glitch FX, animated HUD corners, capability matrix with scroll-triggered bars, and a live mission log. Hosted on GitHub Pages.
LLM-POWERED DATA PIPELINE
An AI-assisted data ingestion and transformation pipeline using large language models, Kafka, and Airflow. Details redacted until deployment.
AI AGENT: DATA QUALITY MONITOR
An autonomous AI agent that monitors data pipelines for anomalies and quality drift — alerts, auto-remediates, and logs incidents. Architecture classified.
ACADEMIC RECORD
// VERIFIED CREDENTIALS
MASTER OF SCIENCE
Electrical Engineering
University of Missouri — Kansas City
JAN 2016 – JUL 2017 · KANSAS CITY, MO
BACHELOR OF TECHNOLOGY
Electronics & Communication Engineering
Kakatiya Institute of Technology & Science
AUG 2011 – MAY 2015 · WARANGAL, INDIA
ESTABLISH CONTACT
Seeking Data Engineering missions. All transmissions encrypted. Response guaranteed within 24 hours.