InputConvDenseBottleneckReadout

Hi, 

I'm Arvind Narayan

Staff AI/ML Engineer 

I build AI systems that actually work in the real world — for biotech research labs, financial risk teams, and enterprise organizations. Currently at GATC Health, where the AI research tool I built is now the team's daily interface for scientific literature and drug discovery work. My ML models for drug property prediction achieve F1 ≈ 0.90, AUROC ≈ 0.92 — near state-of-the-art accuracy on biomedical benchmarks. Also founding Superscaled. Nine years shipping at Yahoo, upGrad, and Egen.ai.

0+Years Engineering
0M+Users Impacted
0+ML Models Trained & Deployed
Arvind Narayan — Staff AI/ML Engineer
Selected Work

Products & Case Studies

Capabilities

I architect AI systems that work in production.

Biotech research, financial risk, enterprise AI — the full stack, end to end.

RAG & Agentic AI Systems

An AI research assistant used daily by lab scientists — finds papers, looks up drug data, and reasons across millions of documents without losing context. Technically: hybrid search (keyword + semantic), long-context memory across sessions, and tool-calling so the AI can autonomously query ChEMBL, PubChem, and specialized ML models.

Molecular ML & GNNs

ML models that predict whether a drug compound will be toxic, absorbed correctly, or reach the brain — F1 ≈ 0.90, AUROC ≈ 0.92 (near state-of-the-art). Graph neural networks that read molecular structure as a graph, deployed from data curation to cloud inference APIs.

Clinical & Research Analytics

Investigated why a Phase 3 drug trial didn't show results. Discovered that how patients spoke during visits — not just what they reported — predicted who responded to placebo, pointing to a clinic habituation effect. Delivered recommendations that shaped future trial design.

LLM Infrastructure & MLOps

Deployed a 27 billion parameter language model on private cloud infrastructure — no external API dependency, full data control. Built quality evaluation pipelines, customized the model for scientific language, and designed the system so newer AI models can be swapped in without disrupting workflows.

Also Building

Superscaled

An early-stage AI venture for engineering teams that need to move faster than their organization's approval and procurement processes allow. There's a frustrating gap between “we have a working model” and “this is actually running in production, monitored, and trusted by the business.” Superscaled builds the tooling that closes it.

Building in public at superscaled.com.

Visit Superscaled
Tools

Tech Stack

From ML and data pipelines to infrastructure and product UI — organized by domain.

AI & LLMs

Python
PyTorchHugging FaceLangChain / LangGraphSGLangQwen / LlamaPrompt EngineeringRLHF / Fine-Tuning

ML Engineering

scikit-learnXGBoost / LightGBMGNNs (PyG)EDA & Feature AnalysisModel Evaluation & AblationSMOTE / Class Balancing

MLOps

Experiment TrackingModel Registry & VersioningCI/CD for MLModel Monitoring & DriftReproducible TrainingStaged Rollouts & CanaryFeature Store Patterns

Data Engineering

AWS GluePySparkAmazon RedshiftAmazon S3

Retrieval & Data Stores

BM25 / TF-IDFDense EmbeddingsCross-Encoder RerankingPineconepgvectorWeaviateChroma
PostgreSQL
Redis
MongoDB
SQS / Event Pipelines

Infrastructure & IaC

Infrastructure as CodeTerraformKubernetesHelmDockerELK Stack

Cloud & APIs

AWS
SageMakerFastAPIGolang APIsGraphQLGPU Inference

Application Development

React
TypeScript
Node.js
GolangFastAPITailwind CSSshadcn/ui

A Decade of Shipping

From BASIC programs to distributed systems at scale.

Apr 2025–

GATC Health

Senior AI/ML Engineer · GATC Health

Hybrid RAG + Agentic Research Platform

Built an enterprise AI research platform (think Perplexity, but for science) — used daily by lab teams to search the scientific literature, look up drug compounds, and explore hypotheses. Became the primary AI interface across the organization within weeks of launch.

Designed a hybrid search system that combines traditional keyword matching with modern semantic search — so it finds the right documents whether you phrase a query exactly or ask it conversationally. Achieves high recall across large chemical and biological literature corpora. (Technical: BM25 + dense embeddings, late fusion, cross-encoder reranking.)

Built a long-context memory system that lets the AI maintain coherence across multi-session research workflows — so scientists can return hours or days later and pick up exactly where they left off, with the AI retaining full context. Scales to millions of words without losing coherence.

Gave the AI the ability to autonomously use specialized tools — toxicity lookups, molecular property prediction, chemical structure searches, and live queries to scientific databases (ChEMBL, PubChem) — so it can answer complex drug discovery questions without a human in the loop.

Deployed a 27 billion parameter language model on our own AWS infrastructure — no reliance on external AI providers, full data control. The system is modular, so newer and more capable models can be swapped in as they become available without rebuilding the application.

RAGBM25Dense EmbeddingsRerankingAgentic AILangGraphQwen 3.5SGLangSageMakerLIGHT Memory

PAL-3 Post-Hoc Clinical Analytics · VistaGen Therapeutics

Investigated why a large Phase 3 clinical trial for social anxiety didn't show the expected improvement over placebo. Combined patient health scores over time, site visit records, enrollment timing, and speech patterns measured from recorded therapy sessions to find out what actually happened.

Discovered that how patients spoke during visits — how much they talked, sentence length, total talk time — was a reliable predictor of who responded to placebo. This pointed to a habituation effect: the act of repeatedly attending clinic visits was itself reducing anxiety, making it harder for the drug to show a measurable difference.

Also explored how recruitment channels, site scheduling patterns, and patient vs. clinician communication styles influenced outcomes — delivering concrete recommendations to help the sponsor design more robust future trials.

Clinical NLPSpeech FeaturesLongitudinal AnalysisPlacebo ModelingPythonPandas

Graph Neural Networks for Molecular Property Prediction

Built ML models that predict drug compound properties from molecular structure — specifically whether a compound is toxic, how it gets absorbed and metabolized, and whether it can cross into the brain. Used graph neural networks (GNNs) that “read” a molecule as a graph — atoms as nodes, bonds as edges — rather than treating it as a flat list of numbers. F1 ≈ 0.90 · AUROC ≈ 0.92.

Handled the full data pipeline: sourcing compound data from public scientific databases (ChEMBL, PubChem), cleaning and deduplicating records, and addressing the common problem of heavily imbalanced datasets — where toxic compounds are rare, making them harder to learn from without special handling.

Deployed the models as low-latency GPU-accelerated APIs on Kubernetes— available to internal teams via well-documented endpoints, ready to integrate into broader drug discovery workflows.

GNNsPyTorch GeometricADMETBBB PredictionSMOTEKubernetesChEMBLPubChem

Sept 2021–Apr 2025

Egen.ai

Senior ML Engineer · Egen.ai

Financial Risk & Pricing ML · DriveTime · Carvana

Led development of production ML systems for financial risk and pricing products used by DriveTime and Carvana — two of the largest used-car finance platforms in the US. These models directly influenced which loans get approved and at what rate, across hundreds of thousands of originations annually.

Built models that predict loan pricing, credit risk, vehicle value over time, and likelihood of default — achieving F1 scores close to 0.90 across use cases, validated through rigorous backtesting against historical loan performance.

Owned the full data pipeline — working closely with client teams on data quality, feature selection, and model validation. Also ran fairness and bias audits to ensure the models didn't discriminate in ways that would create regulatory exposure.

PythonXGBoostscikit-learnFeature EngineeringBacktestingFairness Auditing

Data Engineering · Warehousing & Lake Pipelines

Built and maintained batch data pipelines and analytics warehouses that feed underwriting and pricing models — landing raw data in Amazon S3, orchestrating transforms with AWS Glue and PySpark, and delivering curated features and reporting layers in Amazon Redshift for analysts and model consumers.

Owned schema evolution, partition strategies, job monitoring, and cost-aware ETL so daily scoring and backtests stayed reliable as data volume grew across client fleets.

AWS GluePySparkAmazon RedshiftAmazon S3ETLData Modeling

ML Inference Platform · Kubernetes · IaC

Led the team to build a scalable microservices inference platform on AWS Kubernetes — packaged with Helm for repeatable releases and provisioned with Terraform (infrastructure as code). Full observability via the ELK stack — structured logging, latency dashboards, and auto-scaling to absorb burst inference across multiple model families.

Authored custom Docker images with CUDA support and GPU-aware runtimes so deep learning models serve with predictable performance in production.

Inference platform dashboard
KubernetesHelmTerraformDockerCUDAELK StackAWSGPU InferenceAuto-scaling

Team Leadership & ML Best Practices

Managed and mentored a team of engineers, driving MLOps practices across data ingestion, experiment tracking, model training, and production deployment — standards for reproducibility, registry-backed model versioning, monitoring and drift checks, and staged rollout / canary releases that became the team's default workflow.

MLOpsExperiment TrackingModel RegistryCI/CD (ML)Canary ReleasesEngineering Leadership

Dec 2019–Sept 2021

upGrad

Lead Software Engineer · upGrad

LMS Rebuild · Full-Stack Platform [Case Study]

75% improvement in Core Web Vitals — led the rewrite of a large-scale Learning Management System serving 3M+ active learners, with a strong focus on PWA support, offline-first usage, and global responsiveness for low-bandwidth networks.

Architected the stack across React + TypeScript for learner UX, Node.js and Golang services for performance-critical APIs, and caching layers with Redis. API response times were pushed below 200ms for core learner journeys.

Implemented edge-aware delivery and multi-geo cache strategy (regional CDN + Redis invalidation patterns), so content stayed fast and consistent across markets. Combined code splitting, prefetching, and fallback offline flows so learners could continue progress even with unstable connectivity.

Worked closely with product, design, and infrastructure teams on rollout, SLO tracking, incident response, and platform guardrails; mentored engineers through the migration and established performance standards that persisted after launch.

upGrad LMS
ReactTypeScriptNode.jsGolangPWAOffline-firstEdge CachingCode SplittingCDNRedis<200ms APIsPerformance Engineering

Assessments & Test Platform

Revamped the online assessment and test platform used for proctored exams and skills validation at scale. Rebuilt critical paths for faster item load, more stable delivery under spike traffic, and clearer observability for the ops team — with up to ~40% performance gains on slow-device profiles in lab tests.

Same React / TypeScript / Node stack as the LMS program, extended to secure session handling, timer-critical UX, and coordination with backend teams on data consistency for grading pipelines.

upGrad test platform
ReactTypeScriptNode.jsLoad OptimizationObservability

ML Product: “Shorts” — Micro-Learning Feed

Engineered upGrad Shorts as a learning-science driven micro-learning product. Built the ranking and retention engine using the SM2 spaced repetition algorithm paired with a neural network classifier to predict optimal review intervals per learner, improving long-term knowledge retention.

Owned the end-to-end experience across personalization logic, experimentation loops, and React delivery surfaces for short-form lessons, balancing educational outcomes with feed engagement.

Partnered with Growth on acquisition, retargeting, and cross-sell — the feature drove a measurable lift in engagement and contributed roughly 15% improvement across retargeting and cross-sales experiments tied to the short-form experience.

upGrad Shorts micro-learning
PythonReactSM2 AlgorithmLearning Science MLFFNNSpaced RepetitionPersonalizationRecommendation Systems

Student Dropout & Failure Prediction

Built models that flag students at risk of failing or dropping out months in advance — before it's too late to intervene. Based on how often they logged in, how long they spent on content, attendance, and peer interaction patterns. Gave academic advisors an early warning system so they could reach out before outcomes became irreversible.

Production path: batch scoring jobs and lightweight service APIs so advisor dashboards stayed fresh without overloading OLTP systems — coordinated with data and security on retention policies for student data.

Pythonscikit-learnFeature EngineeringBehavioral DataEarly Warning SystemsBatch Scoring

Mar 2018–Dec 2019

Yahoo!

Software Engineer · Yahoo!

Yahoo Ad.com — Self-Serve SMB Ad Platform

Contributed to Yahoo Ad.com, a self-serve advertising platform enabling SMBs to create and manage ad campaigns across the Yahoo network with minimal setup friction.

Built smart onboarding flows — a small business just enters their website URL, name, and category, and the system automatically generates a ready-to-launch campaign. Measurably reduced the time for a new advertiser to get their first ad live.

Yahoo Ad.com platformYahoo advertising mobile
ML OnboardingCampaign AutomationNLPPython

Business Knowledge Graph

Built a structured knowledge base about businesses by automatically extracting information from company websites, metadata, and third-party sources — building a rich understanding of what each advertiser does, who their customers are, and what they want to achieve.

Used that knowledge to pre-fill campaign setup automatically — suggesting the right call-to-action, target audience, location targeting, and bid strategy — so new advertisers could launch a campaign in minutes instead of hours, measurably reducing drop-off during onboarding.

Knowledge GraphInformation ExtractionEntity ResolutionAudience TargetingGraph ML

2017

Fulfil.io

Engineering Intern · Fulfil.io

Built the CMS infrastructure for Fulfil's marketing team — enabling non-technical stakeholders to manage content, landing pages, and product documentation independently without engineering involvement.

Led a product-wide UI redesign, modernizing the visual language of the B2B ERP platform and improving consistency across core modules. First exposure to production engineering, shipping cadences, and real-user feedback loops at a venture-backed SaaS company.

Fulfil.io