Andi Shehu, PhD

LLM & AI Systems Architect
Principal Applied AI Specialist

I design and deliver AI systems that are reliable in production and aligned to business goals.

New York, NY

PhD in Physics (Quantum Information) Sectors: Healthcare, AdTech, Finance, Academia Specialties: LLM systems, evaluation, delivery

"The value of AI is not novelty. It is better decisions, repeated at scale."

Selected Work

NYU | Applied AI Enablement Design and build an end-to-end AI research hub for discovery, synthesis, and workflow support.
NYU | Research Adoption Train researchers to adopt AI in lab workflows and provide one-on-one advisory support across science and social science teams.
NYU | Program Delivery Lead a focused applied AI team and support grant strategy, technical writing, and AI team design.
Healthcare Build predictive models for Star Ratings and patient churn to support operational planning and patient-focused decisions.
AdTech Deliver audience modeling and campaign measurement systems at Microsoft and DeepIntent, with production-grade pipelines.
Cross-Industry Advise organizations on AI strategy, implementation, and governance.
Operating Standard Apply scientific rigor to model quality, reliability, and risk.

Define the decision, owner, and success metric before model work begins.

Use clear architecture, retrieval, and control layers to improve output quality and reduce failure modes.

Establish testing, monitoring, and guardrails before launch.

NYC / Remote | 2023 - Present

Designed AI pipelines for research automation, enterprise search, and assistant products.

New York, NY | Dec 2023 - Jul 2025

Built audience modeling and campaign measurement systems using Python, Spark, and AWS.

New York, NY | 2021 - 2023

Designed segmentation and predictive modeling frameworks and partnered with engineering to ship ML and NLP pipelines.

New York, NY | 2015 - 2021; 2023 - Present

Led end-to-end AI engagements from problem framing through deployment for startup and enterprise clients.

Focused on GenAI system design first, then reliable data, model, and deployment operations.

LangChain, LangGraph, LlamaIndex, MCP, tool calling, workflow orchestration

OpenAI, Anthropic, Hugging Face, open-source models (Llama, Mistral, Qwen, Gemma)

RAG pipelines, vector stores, Weaviate, Chroma, hybrid retrieval

Evals, tracing, observability, reliability testing, safety controls

GPU-enabled pipelines, HPC/distributed compute workflows, Docker, FastAPI

Python, R, SQL, PySpark, PyTorch, scikit-learn, AWS, Azure

PhD, Physics (Quantum Information Theory) - CUNY Graduate Center, New York, NY, 2015

MS, Mathematics - Hunter College, New York, NY, 2010

"In strong AI work, the model is only part of the answer. The rest is context, judgment, and care."