Passionate about building AI systems that combine Machine Learning, Computer Vision, and Large Language Models to solve real-world problems.
- Agentic RAG Pipelines
- AI Agents & Multi-Agent Systems
- LLM Orchestration
- Production-ready AI Applications
πΌοΈ Image Captioning Refinement System Using Deep learning and Computer Vision β M.Sc. Project
A six-stage multi-model fusion pipeline that generates, scores, and refines image captions using Florence-2-Large, BLIP ITM, Jina Reranker M0, Cosine Similarity, majority voting, and Qwen2.5-1.5B-Instruct β evaluated on the Flickr8k benchmark (8,091 images) and deployed as a live web app.
π GitHub Repo Β· Live Demo
NLP-based classification system using TF-IDF vectorization and supervised ML models to detect fake news articles. Compared Logistic Regression, SVM, Naive Bayes, and Decision Tree across labeled datasets.
π GitHub Repo
Analysis of how the Bitcoin Fear and Greed Index influences trader behavior and performance across 125,830 on-chain trades on Hyperliquid β covering win rate, position sizing, trade direction, and profitability across all five sentiment regimes with statistical significance testing.
π GitHub Repo
I am a recent M.Sc. Data Science graduate from Aligarh Muslim University (AMU) with a strong interest in Machine Learning, Computer Vision, Deep Learning, and Generative AI.
My work focuses on developing intelligent systems that can understand and generate meaningful insights from data, images, and text. I enjoy building end-to-end AI solutions, from data collection and preprocessing to model development, evaluation, and deployment.
Recently, I worked on a multimodal Image Captioning Refinement System that combines Florence-2, BLIP, Qwen, Jina Reranker, and embedding-based evaluation techniques to generate high-quality image descriptions. I am also interested in LLM evaluation, Vision-Language Models, and AI-powered applications.
Currently, I am seeking opportunities as a Data Science Intern, AI Intern, or Entry-Level Data Scientist where I can contribute, learn, and grow while solving impactful problems.
- Deep Learning β Neural Networks (ANN, CNN, RNN, LSTM), Transformers, LLMs, Vision-Language Models (BLIP, Florence-2, Qwen)
- Computer Vision & NLP β Image Captioning, Multimodal AI, TF-IDF, Tokenization, Text Classification, Embedding Extraction
- Machine Learning β Logistic Regression, SVM, Naive Bayes, Decision Trees
- Data β Exploratory Data Analysis, Feature Engineering, Statistical Analysis, Predictive Modeling
- Visualization β Matplotlib, Seaborn, Tableau, Plotly
- APIs β Gemini API, Jina API
LLM Post-Training Intern β Ethara.ai Feb 2026 β Mar 2026
- Evaluated and annotated LLM outputs for quality, coherence, and alignment with predefined benchmarks to support model post-training.
Data Scientist Intern β Evoastra Ventures Inc. Oct 2025 β Nov 2025
- Built a deep learningβbased image captioning system using CNN + LSTM
- Performed text preprocessing, tokenization, and vocabulary creation
- Conducted EDA and documented insights for real-world business use cases
- M.Sc. Data Science β Aligarh Muslim University (2024β2026)
- B.Sc. Mathematics β CCS University (2021β2024)
β Feel free to explore my repositories and connect with me for collaboration or opportunities in Data Science & Machine Learning.