Download PuMer - Download do código-fonte PuMer

PuMer

Outro código-fonte

1.0.0

Baixar

PuMer (ACL 2023)

Este repositório é a implementação oficial do artigo "PuMer: Pruning and Merging Tokens for Efficient Vision Language Models", artigo

Uso

Instalar

instale o miniforge (igual ao conda, mais portátil) crie um ambiente python: conda env create -f env.yaml , ative-o: conda activate pumer clone este repositório: [email protected]:csarron/pumer.git

teste cuda: python -c "import torch;print(torch.cuda.is_available())"

obtenha o ambiente da tocha: python -m torch.utils.collect_env

instalar: pip install -e .

para fins de desenvolvimento local: pip install -e ".[dev]"

O env-frozen.yaml é gerado via conda env export | grep -v "^prefix: | pumer==" > env-frozen.yaml

Preparar dados e modelos pré-treinados

veja notas/data.md para pré-processamento de dados

consulte cli/prep/convert_ckpt.py para converter pontos de verificação originais pré-treinados METER e ViLT

abaixo está o layout do arquivo após a preparação:

# tree -h data
├── [4.0K]  ckpt
│   └── [4.0K]  converted
│       ├── [4.0K]  meter_pretrain_384
│       │   ├── [ 674]  config.json
│       │   └── [1.3G]  pytorch_model.bin
│       ├── [4.0K]  meter_pretrain_irtr_384
│       │   ├── [ 729]  config.json
│       │   └── [1.2G]  pytorch_model.bin
│       ├── [4.0K]  meter_pretrain_nlvr2_288
│       │   ├── [ 674]  config.json
│       │   └── [1.3G]  pytorch_model.bin
│       ├── [4.0K]  vilt_pretrain
│       │   ├── [ 619]  config.json
│       │   └── [518M]  pytorch_model.bin
│       ├── [4.0K]  vilt_pretrain_irtr
│       │   ├── [ 718]  config.json
│       │   └── [426M]  pytorch_model.bin
│       └── [4.0K]  vilt_pretrain_nlvr2
│           ├── [ 619]  config.json
│           └── [518M]  pytorch_model.bin
├── [4.0K]  datasets
│   ├── [4.0K]  irtr
│   │   ├── [390K]  flickr30k-test.jsonl
│   │   ├── [ 11M]  flickr30k-train.jsonl
│   │   ├── [397K]  flickr30k-val.jsonl
│   │   ├── [ 10M]  mscoco-restval.jsonl
│   │   ├── [1.7M]  mscoco-test.jsonl
│   │   ├── [ 28M]  mscoco-train.jsonl
│   │   └── [1.7M]  mscoco-val.jsonl
│   ├── [4.0K]  nlvr2
│   │   ├── [3.6M]  dev.json
│   │   ├── [3.6M]  test1.json
│   │   └── [ 39M]  train.json
│   ├── [4.0K]  snli-ve
│   │   ├── [ 16M]  snli_ve_dev.jsonl
│   │   ├── [ 16M]  snli_ve_test.jsonl
│   │   └── [464M]  snli_ve_train.jsonl
│   └── [4.0K]  vqa2
│       ├── [ 57K]  vqa2_ans2label.json
│       ├── [ 39K]  vqa2_label2ans.json
│       ├── [161K]  vqa2-small.jsonl
│       ├── [ 45M]  vqa2-test2015.jsonl
│       ├── [ 71M]  vqa2-train2014.jsonl
│       └── [ 34M]  vqa2-val2014.jsonl
└── [4.0K]  lmdb
    ├── [ 13G]  coco-test2015.lmdb
    ├── [ 19G]  coco-trainval2014.lmdb
    ├── [4.2G]  flickr30k_images.lmdb
    ├── [837M]  nlvr2-dev.lmdb
    ├── [837M]  nlvr2-test1.lmdb
    └── [ 11G]  nlvr2-train.lmdb

Treinamento e Avaliação

veja notas/cmd.md para exemplo de uso;

verifique https://huggingface.co/csarron para pontos de verificação finos: ( -ft é o modelo original ajustado, p0.x-r0.x-t0.x-xxx é nosso modelo PuMer)

vilt-vqa2-ft
vilt-vqa2-p0.1-r0.3-t0.2-258
vilt-ve-ft 
vilt-ve-p0.1r0.3t0.2-2468 
vilt-nlvr2-ft 
vilt-nlvr2-p0.1r0.3t0.2-258
meter-vqa2-ft
meter-vqa2-p0.2r0.2t0.2-0246
meter-ve-ft 
meter-ve-p0.3r0.5t0.2-0246 
meter-nlvr2-ft 
meter-nlvr2-p0.3r0.5t0.2-246

Perfil de FLOPs

veja notas/perfil.md

Perguntas frequentes

defina TRANSFORMERS_OFFLINE=1 após o primeiro uso, caso contrário, em algum momento ele reportará erro 504 devido à pesquisa sempre online.

Diversos

ignore o código em src/pumer/model/pruner.py (obsoleto e não utilizado), precisa de limpeza
a base de código atual contém muita confusão e código experimental que não está relacionado à implementação do PuMer, ignore isso.

Citação

 @inproceedings{cao-etal-2023-pumer, title = "{P}u{M}er: Podando e mesclando tokens para modelos de linguagem de visão eficientes", autor = "Cao, Qingqing e Paranjape, Bhargavi e Hajishirzi, Hannaneh", booktitle = "Anais da 61ª Reunião Anual da Association for Computational Linguistics (Volume 1: Long Artigos)", mês = julho, ano = "2023", endereço = "Toronto, Canadá", editor = "Association for Computational Linguistics", url = "https://aclanthology.org/2023.acl-long.721" , páginas = "12890--12903",
}

Expandir

Informações adicionais

Versão 1.0.0
Tipo Outro código-fonte
Data da Última Atualização 2024-12-21
tamanho 89.9KB
Vindo de Github

Aplicativos Relacionados

waymo open dataset

2024-11-18
SmartTube

2024-12-14
Sunamu

2024-12-14
MySchedule.py

2024-12-15
viptools for eslam

2024-12-15
chat.petals.dev

2024-11-30

Recomendado para você

chat.petals.dev

Outro código-fonte

1.0.0
GPT Prompt Templates

Outro código-fonte

1.0.0
GPTyped

Outro código-fonte

GPTyped 1.0.5
waymo open dataset

Outro código-fonte

December 2023 Update
SmartTube

Outro código-fonte

24.71 Stable
Sunamu

Outro código-fonte

Release 2.2.0
waymo open dataset

Outro código-fonte

December 2023 Update
wp functions

Outras categorias

1.0.0
termwind

Outras categorias

v2.3.0

Informações Relacionadas Todos