Senior AI Engineer
Fulcrum Digital
- Architected production ML serving infrastructure on AKS and Kubernetes, processing 50K+ inference requests daily with sub-200ms latency.
- Built full-stack GenAI apps with React, FastAPI, LangGraph agentic workflows, and RAG pipelines, reducing manual document processing by 70%.
- Designed event-driven microservices with Kafka and RabbitMQ for real-time data ingestion across distributed systems.
- Implemented observability with Grafana and Prometheus; optimized pipelines with Numba/CUDA for 4x throughput improvement.