Hello, I'm
McLean, VA (Open to Relocate)
AI/ML Engineer with 9+ years of experience designing, developing, and deploying scalable machine learning, deep learning, and Generative AI solutions. Skilled across the end-to-end ML lifecycle — data engineering, feature engineering, model building, deployment, and monitoring. Hands-on with LLMs, prompt engineering, RAG pipelines, NLP, and computer vision, building production-grade systems using FastAPI, Docker, Kubernetes, and MLOps practices on AWS, Azure, and GCP.
I'm Harsha Vardhan Reddy, an AI/ML Engineer with 9+ years of experience designing, developing, and deploying scalable machine learning, deep learning, and Generative AI solutions. I work across the full ML lifecycle — data engineering, feature engineering, model building, deployment, and monitoring — applying supervised, unsupervised, and reinforcement learning to solve complex business problems.
I have strong expertise in deep learning architectures (CNNs, RNNs, LSTMs, Transformers) for NLP and computer vision, and hands-on experience with Generative AI — LLMs, prompt engineering, embeddings, and Retrieval-Augmented Generation (RAG). I build production-ready ML systems with FastAPI, Flask, Docker, and MLOps practices (CI/CD, model versioning, monitoring, automated retraining) on AWS, Azure, and GCP, working with large datasets using Python, SQL, Pandas, NumPy, and Spark.
Environment: Python, Scikit-learn, XGBoost, LightGBM, TensorFlow, PyTorch, Keras, FastAPI, Flask, Docker, Kubernetes, MLflow, Airflow, CI/CD (Jenkins/GitHub Actions).
Environment: Python, Hugging Face Transformers, TensorFlow, PyTorch, Keras, OpenCV, FastAPI, Flask, Docker, Kubernetes, MLflow, Airflow, ONNX, Optuna, CI/CD (Jenkins/GitHub Actions).
Environment: Python, R, Scikit-learn, XGBoost, LightGBM, TensorFlow, PyTorch, Keras, FastAPI, Flask, Docker, Kubernetes, MLflow, Airflow, Optuna, SHAP, LIME, CI/CD (Jenkins/GitHub Actions).
Environment: Python, R, Scikit-learn, XGBoost, LightGBM, TensorFlow, PyTorch, Keras, Spark, Statsmodels, FastAPI, Docker, Kubernetes, MLflow, Airflow, Optuna, SHAP, LIME.
Environment: Python, R, Scikit-learn, XGBoost, LightGBM, TensorFlow, PyTorch, Keras, Spark, Statsmodels, FastAPI, Docker, Kubernetes, MLflow, Airflow, Optuna, SHAP, LIME, DVC.
Built a scalable chatbot framework handling multi-turn conversations with memory and contextual awareness using LLMs and Retrieval-Augmented Generation. Combined vector-database semantic search (FAISS/Pinecone) with fine-tuned transformer models for domain-specific relevance, served through a fault-tolerant FastAPI + Kubernetes layer supporting high concurrent request loads.
Developed a personalization engine leveraging collaborative filtering, matrix factorization, and deep-learning embeddings to power real-time recommendations. Backed by automated feature engineering pipelines and an end-to-end MLOps workflow (CI/CT/CD), with model monitoring for data and concept drift to maintain accuracy in production.
Engineered unsupervised anomaly detection for fraud and predictive maintenance alongside time-series forecasting for demand planning. Built scalable batch and real-time prediction systems on Spark and Kafka streaming pipelines, with explainable-AI (SHAP, LIME) layers for transparency and automated drift-triggered retraining.
Served as a core committee member of IEEE Signal Processing Society, contributing to technical events and initiatives
Worked as Vice President of LEO Club, leading community service initiatives and organizational activities
AWS Certified Cloud Practitioner, demonstrating foundational knowledge of AWS cloud services and architecture
Certified Agent Force Specialist, showcasing expertise in specialized technical domains
I'm always interested in discussing new opportunities, innovative projects, and collaborative ventures in machine learning, Generative AI, LLM applications, and MLOps.