pretraining

Here are 286 public repositories matching this topic...

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

python machine-learning natural-language-processing ai deep-learning tokenizer transformers pytorch artificial-intelligence gpt language-model attention-mechanism from-scratch finetuning pretraining large-language-models llm generative-ai instruction-tuning

Updated May 19, 2026
Jupyter Notebook

LlamaChinese / Llama-Chinese

Star

Llama中文社区，实时汇总最新Llama学习资料，构建最好的中文Llama大模型开源生态，完全开源可商用

agent rl llama pretraining llm llama4

Updated Apr 6, 2025
Python

microsoft / LMOps

Star

General technology for enabling AI capabilities w/ LLMs and MLLMs

nlp prompt agi lm gpt language-model pretraining llm promptist x-prompt lmops

Updated May 20, 2026
Python

OFA-Sys / OFA

Star

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

prompt chinese image-captioning pretrained-models visual-question-answering multimodal text-to-image-synthesis vision-language pretraining referring-expression-comprehension prompt-tuning

Updated Apr 24, 2024
Python

X-PLUG / mPLUG-Owl

Star

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Updated Apr 2, 2025
Python

ChandlerBang / awesome-self-supervised-gnn

Star

Papers about pretraining and self-supervised learning on Graph Neural Networks (GNN).

machine-learning deep-learning graph-mining graph-neural-networks self-supervised-learning pre-training pretraining graph-self-supervised-learning

Updated Feb 2, 2024
Python

[ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling"

Updated Jan 23, 2024
Python

yuewang-cuhk / awesome-vision-language-pretraining-papers

Star

Recent Advances in Vision and Language PreTrained Models (VL-PTMs)

bert vision-and-language multimodal-deep-learning pretraining vl-ptms

Updated Aug 19, 2022

deepmodeling / Uni-Mol

Star

Official Repository for the Uni-Mol Series Methods

deep-learning molecular-modeling pre-trained-model pretraining

Updated May 29, 2025
Python

qqlu / Entity

Star

EntitySeg Toolbox: Towards Open-World and High-Quality Image Segmentation

computer-vision deep-learning cnn pytorch segmentation object-detection pretrained-models image-segmentation semantic-segmentation pretrained-weights instance-segmentation panoptic-segmentation fcos pretraining detectron2 condinst

Updated Nov 30, 2023
Jupyter Notebook

InternScience / GraphGen

Star

GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation

qa knowledge-graph data-generation question-answering data-synthesis sft pretrain pretraining graphgen ai4science llm llm-training qwen xtuner llama-factory sft-data

Updated May 19, 2026
Python

YehLi / xmodaler

Star

X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).

image-captioning video-captioning visual-question-answering vision-and-language cross-modal-retrieval pretraining tden

Updated Feb 27, 2023
Python

seal-rg / recurrent-pretraining

Star

Pretraining and inference code for a large-scale depth-recurrent language model

reasoning pretraining llms recurrent-depth

Updated Dec 29, 2025
Python

PKU-YuanGroup / LanguageBind

Star

【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

multi-modal zero-shot pretraining language-central

Updated Mar 25, 2024
Python

zubair-irshad / Awesome-Robotics-3D

Star

A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vision, including papers, codes, and related websites

computer-vision robotics navigation benchmarks simulations manipulation scene-graph grasping nerf 3d pointclouds vlm diffusion-models pretraining policy-learning foundation-models llm vision-language-model gaussian-splatting

Updated Dec 17, 2025

Alibaba-MIIL / ImageNet21K

Star

Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(NeurIPS, 2021) paper

mixer multi-label-classification downstream pretraining vision-transformer imagenet21k semantic-softmax single-label

Updated Jan 11, 2023
Python

AGI-Arena / MARS

Star

The official implementation of MARS: Unleashing the Power of Variance Reduction for Training Large Models

optimizer optimization-algorithms fine-tuning pretraining large-language-models

Updated Mar 26, 2026
Python

Coobiw / MPP-LLaVA

Star

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train your own 8B/14B LLaVA-training-like MLLM on RTX3090/4090 24GB.

fine-tuning pipeline-parallelism pretraining model-parallel deepspeed mllm multimodal-large-language-models qwen video-large-language-models video-language-model

Updated Mar 10, 2025
Jupyter Notebook

alibaba / Megatron-LLaMA

Star

Best practice for training LLaMA models in Megatron-LM

pytorch llama distributed-training pretraining deepspeed megatron-lm llm

Updated Jan 2, 2024
Python

cxcscmu / Craw4LLM

Star

Official repository for "Craw4LLM: Efficient Web Crawling for LLM Pretraining"

crawler web-crawler crawling web-crawling pre-training pretraining large-language-models llm

Updated Feb 24, 2025
Python

Improve this page

Add a description, image, and links to the pretraining topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pretraining topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pretraining

Here are 286 public repositories matching this topic...

rasbt / LLMs-from-scratch

LlamaChinese / Llama-Chinese

microsoft / LMOps

OFA-Sys / OFA

X-PLUG / mPLUG-Owl

ChandlerBang / awesome-self-supervised-gnn

keyu-tian / SparK

yuewang-cuhk / awesome-vision-language-pretraining-papers

deepmodeling / Uni-Mol

qqlu / Entity

InternScience / GraphGen

YehLi / xmodaler

seal-rg / recurrent-pretraining

PKU-YuanGroup / LanguageBind

zubair-irshad / Awesome-Robotics-3D

Alibaba-MIIL / ImageNet21K

AGI-Arena / MARS

Coobiw / MPP-LLaVA

alibaba / Megatron-LLaMA

cxcscmu / Craw4LLM

Improve this page

Add this topic to your repo