surya@atx
$ whoami

Eight years shipping production systems — healthcare, then Meta.

Engineer in Austin, TX. Built a COVID warehouse spanning 46 hospitals at Prime, scaled NLP automation touching tens of millions of cases at Meta, and won Data4Good 2025 with a dual-transformer hallucination detector. Day job and research keep informing each other.

$View work$Get in touch

Selected work

// selected_work.md · 5 entries, sorted by weight
[01]2025 · Independent · National competition
Data4Good 2025 — AI Hallucination Detection
// First place, non-academic track. Dual-transformer system, 99.11% balanced accuracy.

Architected a dual-transformer system pairing DeBERTa-v3-Large (434M) and RoBERTa-Large (355M) with agreement-based routing; LLM arbitration triggered only when models disagree (~1% of queries), reframing the LLM's task from ternary to binary classification. Avoids the failure mode of confidence-based routing on overconfident model errors. Key finding: architecture matters more than scale, and inter-model disagreement is a better uncertainty signal than softmax confidence. Presented at Johns Hopkins Carey Business School national championship, February 2026.

roleLead — team Response Outliers (ResOut)outcome~62ms inference, ~$0.00026/query — 77× cheaper than LLM-only. Paper submitted to INFORMS Journal on Data Science.stackDeBERTa-v3RoBERTaPyTorchTransformers
[02]2026 · Independent research
The Scarcity Fallacy
// Whether expertise framing imposes an unnecessary scarcity constraint on LLMs.

Independent research investigating whether persona framing like "You are an expert in X" — a norm borrowed from human collaboration contexts — imposes an unnecessary scarcity constraint on LLMs. Core hypothesis: telling an LLM to be a specialist doesn't activate specialized knowledge; it artificially suppresses everything else, potentially narrowing the conceptual search space on generative ideation. Refined task-dependent hypothesis: expertise framing is neutral-to-noise on retrieval-heavy factual tasks, beneficial for structured reasoning, potentially harmful for creative/generative ideation. Measurement pipeline uses Shannon entropy over a 15-domain taxonomy.

roleSolo — research, engineering, writingoutcomePilot complete (n=50). 2,880-cell experimental design across 4 models × 4 areas × 4 tasks × 3 conditions × 15 generations.stackPythonGPT-4oClaude 3.5Llama 3.1Mistral
[03]2026 · Personal build
Prism — Multi-Agent AI Deliberation
// Edward de Bono's Six Thinking Hats, with the user as the Red Hat.

Structured decision-making tool: AI personas (optimist, pessimist, analyst, creative, synthesizer) offer distinct perspectives in parallel while the user provides emotional context as the Red Hat. The Blue Hat (synthesizer) integrates all inputs including the user's own. Underlying data model supports custom personas, custom panels, weighted ensembles, and alternate deliberation frameworks beyond Six Hats. Provider-abstracted AI layer supporting both Anthropic Claude and OpenAI GPT-4o.

roleSolo — design, full-stackoutcomeParallel streaming perspectives, on-demand Blue Hat synthesis with dissent flagging. Extensible to custom panels and weighted ensembles.stackNext.jsTypeScriptSupabaseAnthropicOpenAI
[04]2024 · Open source
DConfusion
// Confusion-matrix analysis + ML meta-research auditing library.

Open-source Python library for confusion-matrix analysis and ML meta-research auditing. Features include metric reconstruction, probabilistic inference, statistical testing, and cost-sensitive analysis. Designed to catch errors and implausible results in published ML evaluations. Reflects an applied interest in ML reliability and the integrity of published evaluation metrics.

roleAuthor, maintaineroutcomeCatches errors and implausible results in published ML evaluations. Complementary to mlscorecheck.stackPython
[05]2020 · Prime Healthcare
COVID Reporting Portal
// Cross-hospital data warehouse during the pandemic.

Built and maintained a COVID data warehouse aggregating data from all 46 Prime Healthcare hospitals to support corporate decision-making and government reporting. Developed automated report interfaces and data feeds; surfaced proactive savings opportunities and value analyses for contracts under review. Most of the original architectural work I did pre-pandemic was compressed into a few months when the warehouse became operational under time pressure.

roleBusiness Analyst → Architectoutcome46 hospitals consolidated into one queryable warehouse. Drove corporate decision-making and government reporting.stackPythonSQLTableau

Experience

// experience.log
2022 — Present
Austin, TX
Technical Analyst → Business Support Engineer @ Meta Platforms

NLP automation and end-to-end support journeys at platform scale.

  • Own end-to-end user support journeys for global developer + business surfaces; design intake funnels between self-service and assisted channels.
  • Drive reliability and DX metrics (CSAT, re-open rate, TTR) via debugging, proactive monitoring, and infra fixes.
  • Built NLP-based classification + automation in Hack and JavaScript that triage and auto-resolve cases at Meta scale; LLM-driven intent + issue detection.
  • Defined long-term automation strategy and roadmaps; partnered with PMs and analytics to size opportunities and instrument quality metrics.
HackJavaScriptPythonNLPLLMs
2018 — 2022
Ontario, CA
Business Analyst → Business Analyst II @ Prime Healthcare Management

46-hospital data systems, COVID warehouse, and purchasing automation.

  • Built and maintained a COVID data warehouse aggregating data from all 46 Prime hospitals; powered corporate decision-making and government reporting.
  • Designed a React portal for physician-preference-item contract administration with Python automation for purchase-approval workflows.
  • Automated purchasing across 45+ hospitals in Python — order feeds for recurring items meeting strict pricing criteria; materially reduced manual error.
  • Built dashboards surfacing proactive savings opportunities; item crosswalks and contract-analysis tooling for value analyses.
PythonReactSQLTableau
2014 — 2018
West Lafayette, IN
Teaching Assistant + Data Analyst @ Purdue University · Krannert

Predictive modeling, stochastic optimization, and four years TAing.

  • Predictive modeling and stochastic optimization across multiple stakeholder engagements.
  • Optimized data pipelines on cloud infrastructure — significant runtime reductions on production data models.
  • TA for analytics courses across four years; consistently top-3 of 16 in the program.
PythonRSASOptimization

Credentials

// degrees, patents, certifications
education/
MS, Business Analytics and Information Management · Purdue University · Krannert · 2018
BS, Industrial Management + Economics + Supply Chain Information & Analytics · Purdue University · Krannert · 2017
publications/
Detecting AI Hallucinations in Educational Contexts: A Transformer-Based Approach to Protecting Student Learning · Submitted · INFORMS Journal on Data Science · 2026
Role of Political Ideology in Friendship Social Networks · Midwest Data Science Conference · 2018
Effect of Forecast Accuracy on Inventory Optimization Model · Midwest Data Science Conference · 2018
Carrier Choice Optimization with a Tier-Based Rebate Program · Midwest Data Science Conference · 2018
talks/
AI Hallucination Detection: A Transformer-Based Approach to Protecting Student Learning · Johns Hopkins Carey · National Championship · 2026
Role of Political Ideology in Friendship Social Networks · Midwest DSI · Indianapolis (oral) · 2018
Effect of Forecast Accuracy on Inventory Optimization Model · INFORMS Analytics · Baltimore (poster) · 2018
Carrier Choice Optimization with a Tier-Based Rebate Program · INFORMS Analytics · Baltimore (poster) · 2018
awards/
Data4Good 2025 — National Competition Winner (Non-Academic Track, AI Hallucination Detection) · 2025
Best Machine Learning Algorithm — Identification of Clickers from Bookers · Online travel-booking competition · 2017
Krannert School of Management Scholarship · $7,500 · Purdue University · 2017

Projects

// ls projects/ · featured 1 of 8
LinkedIn Resume Optimizer
/linkedin-resume
Two-pass LLM pipeline: normalize a PDF resume into structured data, then rewrite it against a job description.
Domain
AI · Productivity
Year
2026
Stack
FastAPI · React · Vite · Anthropic
Status
· private

Now

// now.md · Apr '26
building/
Scaling a new generation of LLM support assistants at Meta.
Pilot testing Scarcity Fallacy research.
reading/
Hitler by Ian Kershaw
Life 3.0 by Max Tegmark

Uses

// /etc/uses
editor
Neovim + tmux, VS Code for React work
languages
Python, TypeScript, Go, Hack
infra
Postgres, Supabase, Airflow, Docker
hardware
MacBook Pro M3, ZSA Moonlander