Skip to content
View hexsyro's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report hexsyro

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
hexsyro/README.md

Hi, I'm Heshan 👨‍💻

I build production web scraping platforms that deliver structured data at scale

Python | Playwright | FastAPI

Pulse Aggregator Social Intel FinPull


Featured Projects

Production news platform indexing 100000+ articles per week from 1,000+ news sources updated hourly. (BBC, Reuters, Guardian, TechCrunch).

4-tier RSS fallback → Playwright scraper (paywalls) → hourly APScheduler → full-text search + REST API

Articles Sources Uptime

FastAPI PostgreSQL Playwright Next.js APScheduler

Demo


OSINT dataset marketplace150+ premium datasets across 75+ platfroms, datasets enriched with sentiment/topic analysis updated daily. (Reddit/YouTube/Facebook/Telegram/Etc.).

Pre-processed: sentiment scores, topic tags, engagement signals. Drop-in ready for Python/Tableau/LLMs.

Records Datasets Free

FastAPI PostgreSQL Next.js Paddle AWS S3

Demo


Production Goodreads scraper → structured CSV/JSON datasets (quotes, authors, tags).

BeautifulSoup Pagination Data validation Multi-page


FinPull (Upcoming)

Financial data pipeline pulling OHLCV, earnings, P/E ratios, and analyst ratings into structured datasets.

Playwright + yfinance → FastAPI → PostgreSQL → REST API + Next.js dashboard

Playwright yfinance FastAPI PostgreSQL Next.js

Target: Traders · Analysts · Portfolio dashboards


Production Tech Stack

Layer Tools
Scraping Playwright · BeautifulSoup · Asyncio · Proxy rotation
Data Pandas · NumPy · Parquet/JSONL exports
Backend FastAPI · PostgreSQL · APScheduler · JWT
Frontend Next.js 15 · Tailwind · TypeScript
Infra Railway · Vercel · Supabase · Docker

freeCodeCamp Certified: Responsive Web Design (Mar 2024) · Scientific Computing with Python (Nov 2025)


Hire Me → Fiverr

Custom scrapers · ETL pipelines · Data platforms · REST APIs

Pinned Loading

  1. Goodreads-quote-scraper Goodreads-quote-scraper Public

    Scrapes quotes from goodreads.com/quotes and saves to CSV.

    Python 1

  2. Anti-Scraping-Playwright Anti-Scraping-Playwright Public

    web scraping framework that bypasses common anti-bot measures using advanced evasion techniques.

    Python