Faris Elshammouty AI Engineer

I build and ship production AI systems: large language models, retrieval pipelines, and autonomous agents. I like taking an idea from rough prototype to something people can actually rely on.

About
Faris Elshammouty at JPMorgan Chase

I build AI that holds up in production: measured, observable, and genuinely useful to the people who rely on it.

My focus is the full lifecycle of applied machine learning: framing the problem, choosing the right model strategy, and engineering the system around it, from data pipelines to evaluation, deployment, and monitoring. I care as much about latency, cost, and reliability as I do about accuracy.

Lately I work most with large language models: designing retrieval-augmented architectures, fine-tuning for narrow domains, and building tool-using agents that do real work safely. The part I enjoy most is when a rough prototype turns into dependable infrastructure.

Capabilities

What I work with.

01

Language models

  • LLMs
  • Prompt engineering
  • Fine-tuning
  • Agents & tool-use
  • Hugging Face
02

Retrieval & data

  • RAG
  • Embeddings
  • Vector databases
  • LangChain
03

Modeling

  • Python
  • PyTorch
  • TensorFlow
  • Evaluation
04

Deployment & MLOps

  • FastAPI
  • Docker
  • AWS / GCP
  • Model serving
How I work

Engineering discipline, applied to AI.

Evaluation first

I define what “good” means with measurable evals before building, so progress is real and regressions are visible.

Grounded & safe

Retrieval, citations, and guardrails keep model output anchored to sources and within safe bounds.

Cost & latency aware

The right model for the job, balancing accuracy against the budget and speed a product can actually afford.

Built to operate

Monitoring, tracing, and clean interfaces so systems stay reliable long after launch day.

Contact

Let’s work together.