Open to AI Engineer roles · Dubai / UAE / Remote

hi, i'm mohammed zain rafeeque.

AI Engineer shipping production RAG systems, multi-modal document intelligence, and domain-specific LLM applications — with live demos to prove it.

Mohammed Zain Rafeeque

skills

Languages

Python TypeScript JavaScript SQL R

LLMs & RAG

LangChain vLLM pgvector ChromaDB BM25 + RRF Cross-encoder reranking Sentence-Transformers

AI Platforms

Claude (Anthropic) OpenAI Groq Gemini Vision Manus AI HuggingFace n8n

ML / DL

PyTorch TensorFlow / Keras Scikit-learn NumPy / Pandas OpenCV NLTK

Backend & Infra

FastAPI Flask Docker PostgreSQL Redis Nginx Linux / Bash

Frontend & Deploy

Next.js 14 React 18 Tailwind CSS Streamlit Gradio Vercel Render HuggingFace Spaces

work experience

2025-2026
MTA Investment LLC — Dubai
AI Engineer

Architecting production RAG systems and domain-specific LLM applications. Designed and shipped the bilingual Glass Industry Expert System — a fine-tuned Qwen2.5-14B with hybrid retrieval over 247K+ chunks — sustaining ~85% hit rate at 1–5 s latency. Also building autonomous retrieval pipelines and n8n workflow automations integrated with Gemini Vision.

2025
Khansaheb Sustainability — Dubai
AI Freelance

Architected n8n automated workflows to streamline sustainability operations, and delivered a RAG + LLM chatbot prototype with LangChain and Groq API to automate customer inquiries, provide intelligent product advice, and optimize support efficiency.

2025
Direct Axis Technologies — Dubai
AI Intern

Deployed real-world AI/ML solutions including an intelligent cheque processing system powered by neural networks and Python — automating data extraction, validation, and financial workflows for measurable productivity gains.

2024 - 2025
Meta Scifor Technologies
AI/ML Trainee

Completed intensive remote training across ML algorithms and deep learning frameworks. Strengthened production ML workflows, model deployment, and optimization strategies through hands-on Python projects.

2023
ReverTech
Python Intern

Enhanced Python programming through hands-on internship work focused on data structures, algorithms, and clean coding practices for scalable software.

projects

Live Demo DocIntel - Multi-Modal Document Intelligence

DocIntel — Multi-Modal Document Intelligence

Full-stack RAG application: drop any PDF, chat with it, and get answers grounded in the source with clickable page citations. Hybrid retrieval (sentence-transformers MiniLM dense vectors fused with BM25 lexical search via reciprocal rank fusion), one-shot structured extraction with downloadable CSVs, and a polished Next.js 14 + Tailwind workspace with embedded react-pdf viewer. FastAPI backend on Render + Next.js frontend on Vercel + Groq Llama-3.3-70B for inference.

FastAPI Next.js 14 RAG hybrid retrieval Groq TypeScript
Live Demo Malayalam Speech-to-Text

Malayalam Speech Translator

End-to-end Malayalam → English speech and text translator deployed on HuggingFace Spaces. Accepts mic recording or any audio format (OGG, MP3, WAV) via Gradio, transcribes with Google Speech API, translates with deep-translator. Includes an offline IndicTrans2 + ctranslate2 NMT pipeline as an alternative backend.

gradio huggingface speech recognition NLP python
Live Demo Loan Prediction System

Loan Approval Prediction System

End-to-end ML pipeline for automating loan approvals — data preprocessing, feature engineering, model selection (Random Forest / Logistic Regression), and a Streamlit web interface deployed on Streamlit Cloud. Tackles real-world challenges: missing data, outliers, and multicollinearity.

streamlit scikit-learn pandas flask machine learning
RAG-Powered Chatbot

Khansaheb Sustainability RAG Chatbot

Production RAG chatbot over Khansaheb's sustainability documents — LangChain + Groq for fast inference, vector retrieval over PDF corpus, and a Node.js front-end. Automates ESG inquiries with grounded, citation-backed answers. Built for Khansaheb Sustainability, Dubai; source code lives in a private client repository.

langchain groq RAG node.js vector DB
Glass Engineer AI

Glass Engineer AI — Bilingual RAG Expert System

Production-grade bilingual (English + Farsi) AI assistant for glass manufacturing engineers. Fine-tuned Qwen2.5-14B (vLLM, 4-bit LoRA), hybrid retrieval (pgvector HNSW + BM25 + Reciprocal Rank Fusion) over 247K+ chunks, cross-encoder reranking, Redis semantic caching, React 18 + TypeScript chat UI, deployed via Docker Compose — ~85% retrieval hit rate, 1–5s end-to-end latency. Built for MTA Investment LLC, Dubai; source code lives in a private company repository.

vLLM pgvector RAG react docker
Advertising Dashboard MVP

Advertising Dashboard MVP

Take-home assessment built in Flask + OpenAI. Full mini app with user authentication, campaign creation (product, keywords, targeting, banner upload), AI-generated ad copy via GPT-3.5-turbo with a template fallback, and simulated performance analytics. Demonstrates end-to-end web-app + LLM integration in a single self-contained codebase.

flask openai python bootstrap
Live Demo Image Processing

Image Preprocessing Toolkit

Interactive OpenCV + Streamlit demo covering all the foundational image-processing techniques: contrast enhancement (global HE, CLAHE), morphology (erosion, dilation, opening, closing), edge detection (Canny, Sobel, Laplacian, morphological gradient), and segmentation (thresholding, K-means, contours). Upload any image, drag the parameter sliders, download the result.

opencv streamlit computer vision python
Autoencoders Deep Dive

Autoencoders Deep Dive

Three-part practical guide to autoencoders: fundamentals, image denoising, and anomaly detection. Demonstrates dimensionality reduction, feature extraction, and reconstruction-loss analysis with TensorFlow/Keras.

tensorflow deep learning autoencoders jupyter
Live Demo AI Learning Style Predictor

AI Learning Style Predictor

End-to-end ML pipeline classifying primary students into VARK learning styles (Visual / Auditory / Reading-Writing / Kinesthetic) from a 15-question survey + study-habit features. Realistic synthetic dataset (5,000 students), feature engineering, and a Streamlit demo. Lifted accuracy from a 59% LogReg baseline to 84.5% with a Random Forest on engineered features (+25.3 pp).

scikit-learn streamlit feature engineering edtech python
Intelligent Cheque Processing

Intelligent Cheque Processing

Contributed to a neural network-based system for automated cheque extraction and validation at Direct Axis Technologies, Dubai. Streamlined financial workflows with high-accuracy OCR. Source code lives in a private team repository.

neural networks OCR python opencv

education

Vimal Jyothi Engineering College
Bachelor of Technology in Artificial Intelligence and Data Science
APJ Abdul Kalam Technological University | AICTE Approved | Dec 2020 - Sep 2024

certifications

One Million Prompters - AI Prompt Engineering
Dubai Centre for Artificial Intelligence (DCAI)
2025
Data Analytics Assessment
LearnTube (Career Ninja)
2024
Deep Learning - A Real-World Approach
Microsoft Research Scientist Mentorship
2024
Linux Shell Programming
IIT Bombay
2023
Python Programming
IIT Bombay
2023

get in touch

Shoot me a message on linkedin here

or email me at zainrafeeque@gmail.com