可控文字到圖像生成資料集
Noah-Wukong Dataset
Zero:微調文字到影像的擴散模型以實現主題驅動的生成
Flickr 30k Dataset
Visual Genome Dataset
Conceptual Captions(CC) Dataset
YFCC100M Dataset
ALT200M Dataset
LAION-400M Dataset
LAION-5B Dataset
Wikipedia-based Image Text (WIT) Dataset 基於維基百科的圖像文字(WIT) 資料集
LAION-5B Dataset
TaiSu(太素--億級大規模中文視覺語言預訓練資料集)
COYO-700M:大規模圖像文字對資料集
WIT:基於維基百科的圖像文字資料集
DiffusionDB
# Get this repo
git clone https://github.com/nightrome/cocostuff.git
cd cocostuff
# Download everything
wget --directory-prefix=downloads http://images.cocodataset.org/zips/train2017.zip
wget --directory-prefix=downloads http://images.cocodataset.org/zips/val2017.zip
wget --directory-prefix=downloads http://calvin.inf.ed.ac.uk/wp-content/uploads/data/cocostuffdataset/stuffthingmaps_trainval2017.zip
# Unpack everything
mkdir -p dataset/images
mkdir -p dataset/annotations
unzip downloads/train2017.zip -d dataset/images/
unzip downloads/val2017.zip -d dataset/images/
unzip downloads/stuffthingmaps_trainval2017.zip -d dataset/annotations/
1. 下载hfd
wget https://hf-mirror.com/hfd/hfd.sh
chmod a+x hfd.sh
2. 设置环境变量
export HF_ENDPOINT=https://hf-mirror.com
3.1 下载模型
./hfd.sh gpt2 --tool aria2c -x 4
3.2 下载数据集
./hfd.sh yuvalkirstain/pickapic_v1 --dataset --tool aria2c -x 4
DeepFashion-MultiModal
DeepFashion
COCO(COCO Captions) Dataset
CUBS-2000-2021 Dataset
102 Category Flower Dataset
Flickr8k_dataset
Flickr8k_Dataset.zip https://github.com/jbrownlee/Datasets/releases/download/Flickr8k/Flickr8k_Dataset.zip
Flickr8k_text.zip https://github.com/jbrownlee/Datasets/releases/download/Flickr8k/Flickr8k_text.zip
Nouns Dataset自動加入標題的名詞資料集卡
OxfordTVG-HIC Dataset大規模幽默圖像文字資料集
Multi-Modal-CelebA-HQ大規模人臉影像文字資料集