Skip to content
View Afsha001's full-sized avatar
🎯
Code the noise. Decode the truth.
🎯
Code the noise. Decode the truth.

Block or report Afsha001

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Afsha001/README.md

Hi πŸ‘‹, I'm Afsha Anjum

Data Scientist β€’ Machine Learning Engineer β€’ Computer Vision & Generative AI Enthusiast

Passionate about building AI systems that combine Machine Learning, Computer Vision, and Large Language Models to solve real-world problems.


πŸ“« Connect With Me


πŸš€ Featured Projects

🚧 Currently Working On

  • Agentic RAG Pipelines
  • AI Agents & Multi-Agent Systems
  • LLM Orchestration
  • Production-ready AI Applications

πŸ–ΌοΈ Image Captioning Refinement System Using Deep learning and Computer Vision β€” M.Sc. Project

A six-stage multi-model fusion pipeline that generates, scores, and refines image captions using Florence-2-Large, BLIP ITM, Jina Reranker M0, Cosine Similarity, majority voting, and Qwen2.5-1.5B-Instruct β€” evaluated on the Flickr8k benchmark (8,091 images) and deployed as a live web app.

πŸ”— GitHub Repo Β· Live Demo


πŸ“° Fake News Detection using Machine Learning

NLP-based classification system using TF-IDF vectorization and supervised ML models to detect fake news articles. Compared Logistic Regression, SVM, Naive Bayes, and Decision Tree across labeled datasets.

πŸ”— GitHub Repo


πŸ“Š Hyperliquid Sentiment Analysis

Analysis of how the Bitcoin Fear and Greed Index influences trader behavior and performance across 125,830 on-chain trades on Hyperliquid β€” covering win rate, position sizing, trade direction, and profitability across all five sentiment regimes with statistical significance testing.

πŸ”— GitHub Repo


πŸš€ About Me

I am a recent M.Sc. Data Science graduate from Aligarh Muslim University (AMU) with a strong interest in Machine Learning, Computer Vision, Deep Learning, and Generative AI.

My work focuses on developing intelligent systems that can understand and generate meaningful insights from data, images, and text. I enjoy building end-to-end AI solutions, from data collection and preprocessing to model development, evaluation, and deployment.

Recently, I worked on a multimodal Image Captioning Refinement System that combines Florence-2, BLIP, Qwen, Jina Reranker, and embedding-based evaluation techniques to generate high-quality image descriptions. I am also interested in LLM evaluation, Vision-Language Models, and AI-powered applications.

Currently, I am seeking opportunities as a Data Science Intern, AI Intern, or Entry-Level Data Scientist where I can contribute, learn, and grow while solving impactful problems.


πŸ› οΈ Tech Stack

Programming Languages

Python R SQL

Machine Learning & Deep Learning

Scikit-Learn TensorFlow PyTorch

Generative AI & VLMs

HuggingFace Transformers BLIP Florence-2 Qwen

Data Analysis & Visualization

Pandas NumPy Matplotlib Plotly Tableau

Tools & Platforms

GitHub Google Colab Streamlit VS Code


πŸ’‘ Skills & Expertise

  • Deep Learning β€” Neural Networks (ANN, CNN, RNN, LSTM), Transformers, LLMs, Vision-Language Models (BLIP, Florence-2, Qwen)
  • Computer Vision & NLP β€” Image Captioning, Multimodal AI, TF-IDF, Tokenization, Text Classification, Embedding Extraction
  • Machine Learning β€” Logistic Regression, SVM, Naive Bayes, Decision Trees
  • Data β€” Exploratory Data Analysis, Feature Engineering, Statistical Analysis, Predictive Modeling
  • Visualization β€” Matplotlib, Seaborn, Tableau, Plotly
  • APIs β€” Gemini API, Jina API

πŸ§ͺ Experience

LLM Post-Training Intern β€” Ethara.ai Feb 2026 – Mar 2026

  • Evaluated and annotated LLM outputs for quality, coherence, and alignment with predefined benchmarks to support model post-training.

Data Scientist Intern β€” Evoastra Ventures Inc. Oct 2025 – Nov 2025

  • Built a deep learning–based image captioning system using CNN + LSTM
  • Performed text preprocessing, tokenization, and vocabulary creation
  • Conducted EDA and documented insights for real-world business use cases

πŸŽ“ Education

  • M.Sc. Data Science β€” Aligarh Muslim University (2024–2026)
  • B.Sc. Mathematics β€” CCS University (2021–2024)

⭐ Feel free to explore my repositories and connect with me for collaboration or opportunities in Data Science & Machine Learning.

Popular repositories Loading

  1. fake-news-detection-ML fake-news-detection-ML Public template

    Fake news detection using Logistic Regression, Naive Bayes, Decision Tree, and SVM in Python.

    Jupyter Notebook

  2. Afsha001 Afsha001 Public

    "The goal is to turn data into information, and information into insight."

  3. Image-Captioning-refinement-System Image-Captioning-refinement-System Public

    Image Captioning Refinement Using Deep Learning and computer-vision β€” Flickr8k dataset with BLIP, Jina, Qwen and Majority Voting

    Jupyter Notebook

  4. hyperliquid-sentiment-analysis hyperliquid-sentiment-analysis Public

    Analysis of how the Bitcoin Fear and Greed Index influences trader behavior and performance across 125,830 on-chain trades on Hyperliquid β€” covering win rate, position sizing, trade direction, and …

    Jupyter Notebook