Unduh marqo ecommerce embeddings - pengunduhan kode sumber marqo ecommerce embeddings

marqo ecommerce embeddings

Kode sumber lainnya

1.0.0

Unduh

Model Penyematan E-niaga Marqo

Dalam karya ini, kami memperkenalkan dua model penyematan canggih untuk produk e-niaga: Marqo-Ecommerce-B dan Marqo-Ecommerce-L.

Hasil benchmarking menunjukkan bahwa model Marqo-Ecommerce secara konsisten mengungguli semua model lainnya dalam berbagai metrik. Secara khusus, marqo-ecommerce-L mencapai peningkatan rata-rata sebesar 17,6% pada MRR dan 20,5% pada nDCG@10 jika dibandingkan dengan model sumber terbuka terbaik saat ini, ViT-SO400M-14-SigLIP pada ketiga tugas di marqo-ecommerce-hard kumpulan data marqo-ecommerce-hard . Jika dibandingkan dengan model privat terbaik, Amazon-Titan-Multimodal , kami melihat peningkatan rata-rata sebesar 38,9% pada MRR dan 45,1% pada nDCG@10 pada ketiga tugas, dan 35,9% pada Recall pada seluruh tugas Text-to-Image di kumpulan data marqo-ecommerce-hard .

Hasil benchmarking selengkapnya dapat ditemukan di bawah.

Konten yang Dirilis :

Model penyematan Marqo-Ecommerce-B dan Marqo-Ecommerce-L
GoogleShopping-1m dan AmazonProducts-3m untuk evaluasi
Kode Evaluasi

visual multi-terpisah

Model

Model Penyematan	#Param (m)	Dimensi	Memeluk Wajah	Unduh .pt	Inferensi Teks Batch Tunggal (A10g)	Inferensi Gambar Batch Tunggal (A10g)
Marqo-Ecommerce-B	203	768	Marqo/marqo-ecommerce-embeddings-B	link	5,1 ms	5,7 ms
Marqo-Ecommerce-L	652	1024	Marqo/marqo-ecommerce-embeddings-L	link	10,3 ms	11,0 mdtk

Muat dari HuggingFace dengan OpenCLIP

Untuk memuat model di OpenCLIP, lihat di bawah. Model dihosting di Hugging Face dan dimuat menggunakan OpenCLIP. Anda juga dapat menemukan kode ini di dalam run_models.py .

 pip install open_clip_torch

 from PIL import Image
import open_clip
import requests
import torch

# Specify model from Hugging Face Hub
model_name = 'hf-hub:Marqo/marqo-ecommerce-embeddings-L'
model , preprocess_train , preprocess_val = open_clip . create_model_and_transforms ( model_name )
tokenizer = open_clip . get_tokenizer ( model_name )

# Preprocess the image and tokenize text inputs
# Load an example image from a URL
img = Image . open ( requests . get ( 'https://raw.githubusercontent.com/marqo-ai/marqo-ecommerce-embeddings/refs/heads/main/images/dining-chairs.png' , stream = True ). raw )
image = preprocess_val ( img ). unsqueeze ( 0 )
text = tokenizer ([ "dining chairs" , "a laptop" , "toothbrushes" ])

# Perform inference
with torch . no_grad (), torch . cuda . amp . autocast ():
    image_features = model . encode_image ( image , normalize = True )
    text_features = model . encode_text ( text , normalize = True )

    # Calculate similarity probabilities
    text_probs = ( 100.0 * image_features @ text_features . T ). softmax ( dim = - 1 )

# Display the label probabilities
print ( "Label probs:" , text_probs )
# [1.0000e+00, 8.3131e-12, 5.2173e-12]

Muat dari HuggingFace dengan trafo

Untuk memuat model di Transformers, lihat di bawah. Model dihosting di Hugging Face dan dimuat menggunakan Transformers.

 from transformers import AutoModel , AutoProcessor
import torch
from PIL import Image
import requests

model_name = 'Marqo/marqo-ecommerce-embeddings-L'
# model_name = 'Marqo/marqo-ecommerce-embeddings-B'

model = AutoModel . from_pretrained ( model_name , trust_remote_code = True )
processor = AutoProcessor . from_pretrained ( model_name , trust_remote_code = True )

img = Image . open ( requests . get ( 'https://raw.githubusercontent.com/marqo-ai/marqo-ecommerce-embeddings/refs/heads/main/images/dining-chairs.png' , stream = True ). raw ). convert ( "RGB" )
image = [ img ]
text = [ "dining chairs" , "a laptop" , "toothbrushes" ]
processed = processor ( text = text , images = image , padding = 'max_length' , return_tensors = "pt" )
processor . image_processor . do_rescale = False
with torch . no_grad ():
    image_features = model . get_image_features ( processed [ 'pixel_values' ], normalize = True )
    text_features = model . get_text_features ( processed [ 'input_ids' ], normalize = True )

    text_probs = ( 100 * image_features @ text_features . T ). softmax ( dim = - 1 )
    
print ( text_probs )
# [1.0000e+00, 8.3131e-12, 5.2173e-12]

Evaluasi

Generalized Contrastive Learning (GCL) digunakan untuk evaluasi. Kode berikut juga dapat ditemukan di scripts .

 git clone https://github.com/marqo-ai/GCL

Instal paket yang dibutuhkan oleh GCL.

1. Pengambilan Gambar GoogleShopping-Text2.

 cd ./GCL
MODEL=hf-hub:Marqo/marqo-ecommerce-B
outdir=MarqoModels/GE/marqo-ecommerce-B/gs-title2image
mkdir -p $outdir
hfdataset=Marqo/google-shopping-general-eval
python  evals/eval_hf_datasets_v1.py 
      --model_name $MODEL 
      --hf-dataset $hfdataset 
      --output-dir $outdir 
      --batch-size 1024 
      --num_workers 8 
      --left-key "['title']" 
      --right-key "['image']" 
      --img-or-txt "[['txt'], ['img']]" 
      --left-weight "[1]" 
      --right-weight "[1]" 
      --run-queries-cpu 
      --top-q 4000 
      --doc-id-key item_ID 
      --context-length "[[64], [0]]"

2. Pengambilan Gambar Kategori Belanja Google2.

 cd ./GCL
MODEL=hf-hub:Marqo/marqo-ecommerce-B
outdir=MarqoModels/GE/marqo-ecommerce-B/gs-cat2image
mkdir -p $outdir
hfdataset=Marqo/google-shopping-general-eval
python  evals/eval_hf_datasets_v1.py 
      --model_name $MODEL 
      --hf-dataset $hfdataset 
      --output-dir $outdir 
      --batch-size 1024 
      --num_workers 8 
      --left-key "['query']" 
      --right-key "['image']" 
      --img-or-txt "[['txt'], ['img']]" 
      --left-weight "[1]" 
      --right-weight "[1]" 
      --run-queries-cpu 
      --top-q 4000 
      --doc-id-key item_ID 
      --context-length "[[64], [0]]"

3. Pengambilan Gambar AmazonProducts-Category2.

 cd ./GCL
MODEL=hf-hub:Marqo/marqo-ecommerce-B
outdir=MarqoModels/GE/marqo-ecommerce-B/ap-title2image
mkdir -p $outdir
hfdataset=Marqo/amazon-products-eval
python  evals/eval_hf_datasets_v1.py 
      --model_name $MODEL 
      --hf-dataset $hfdataset 
      --output-dir $outdir 
      --batch-size 1024 
      --num_workers 8 
      --left-key "['title']" 
      --right-key "['image']" 
      --img-or-txt "[['txt'], ['img']]" 
      --left-weight "[1]" 
      --right-weight "[1]" 
      --run-queries-cpu 
      --top-q 4000 
      --doc-id-key item_ID 
      --context-length "[[64], [0]]"

Performa Terperinci

Proses pembandingan kami dibagi menjadi dua sistem berbeda, masing-masing menggunakan kumpulan data daftar produk e-niaga yang berbeda: marqo-ecommerce-hard dan marqo-ecommerce-easy. Kedua kumpulan data tersebut berisi gambar dan teks produk dan hanya berbeda ukurannya. Kumpulan data "mudah" berukuran sekitar 10-30 kali lebih kecil (200 ribu vs 4 juta produk), dan dirancang untuk mengakomodasi model dengan tarif terbatas, khususnya Cohere-Embeddings-v3 dan GCP-Vertex (dengan batas masing-masing 0,66 rps dan 2 rps). Kumpulan data "keras" mewakili tantangan sebenarnya, karena berisi empat juta daftar produk e-niaga dan lebih mewakili skenario penelusuran e-niaga di dunia nyata.

Dalam kedua skenario ini, model dibandingkan dengan tiga tugas berbeda:

Teks-ke-Gambar Google Belanja
Kategori-ke-Gambar Google Belanja
Produk Amazon Teks-ke-Gambar

Marqo-Ecommerce-Keras

Marqo-Ecommerce-Hard mengkaji evaluasi komprehensif yang dilakukan menggunakan 4 juta kumpulan data, yang menyoroti kinerja kuat model kami dalam konteks dunia nyata.

Pengambilan Gambar GoogleShopping-Text2.

Model Penyematan	peta	R@10	MRR	nDCG@10
Marqo-Ecommerce-L	0,682	0,878	0,683	0,726
Marqo-Ecommerce-B	0,623	0,832	0,624	0,668
ViT-SO400M-14-SigLip	0,573	0,763	0,574	0,613
ViT-L-16-SigLip	0,540	0,722	0,540	0,577
ViT-B-16-SigLip	0,476	0,660	0,477	0,513
Amazon-Titan-MultiModal	0,475	0,648	0,475	0,509
Jina-V1-KLIP	0,285	0,402	0,285	0,306

Pengambilan Gambar Kategori-Belanja Google2.

Model Penyematan	peta	P@10	MRR	nDCG@10
Marqo-Ecommerce-L	0,463	0,652	0,822	0,666
Marqo-Ecommerce-B	0,423	0,629	0,810	0,644
ViT-SO400M-14-SigLip	0,352	0,516	0,707	0,529
ViT-L-16-SigLip	0,324	0,497	0,687	0,509
ViT-B-16-SigLip	0,277	0,458	0,660	0,473
Amazon-Titan-MultiModal	0,246	0,429	0,642	0,446
Jina-V1-KLIP	0,123	0,275	0,504	0,294

Pengambilan Gambar AmazonProducts-Text2.

Model Penyematan	peta	R@10	MRR	nDCG@10
Marqo-Ecommerce-L	0,658	0,854	0,663	0,703
Marqo-Ecommerce-B	0,592	0,795	0,597	0,637
ViT-SO400M-14-SigLip	0,560	0,742	0,564	0,599
ViT-L-16-SigLip	0,544	0,715	0,548	0,580
ViT-B-16-SigLip	0,480	0,650	0,484	0,515
Amazon-Titan-MultiModal	0,456	0,627	0,457	0,491
Jina-V1-KLIP	0,265	0,378	0,266	0,285

Marqo-Ecommerce-Mudah

Seperti disebutkan, proses benchmarking kami dibagi menjadi dua skenario berbeda: marqo-ecommerce-hard dan marqo-ecommerce-easy. Bagian ini mencakup model terakhir yang memiliki fitur korpus 10-30 kali lebih kecil dan dirancang untuk mengakomodasi model dengan tarif terbatas. Kami akan melihat evaluasi komprehensif yang dilakukan menggunakan 200 ribu produk penuh di kedua kumpulan data. Selain model yang sudah diukur di atas, tolok ukur ini juga mencakup Cohere-embedding-v3 dan GCP-Vertex.

Pengambilan Gambar GoogleShopping-Text2.

Model Penyematan	peta	R@10	MRR	nDCG@10
Marqo-Ecommerce-L	0,879	0,971	0,879	0,901
Marqo-Ecommerce-B	0,842	0,961	0,842	0,871
ViT-SO400M-14-SigLip	0,792	0,935	0,792	0,825
GCP-Vertex	0,740	0,910	0,740	0,779
ViT-L-16-SigLip	0,754	0,907	0,754	0,789
ViT-B-16-SigLip	0,701	0,870	0,701	0,739
Amazon-Titan-MultiModal	0,694	0,868	0,693	0,733
Jina-V1-KLIP	0,480	0,638	0,480	0,511
Kohere-penyematan-v3	0,358	0,515	0,358	0,389

Pengambilan Gambar Kategori-Belanja Google2.

Model Penyematan	peta	P@10	MRR	nDCG@10
Marqo-Ecommerce-L	0,515	0,358	0,764	0,590
Marqo-Ecommerce-B	0,479	0,336	0,744	0,558
ViT-SO400M-14-SigLip	0,423	0,302	0,644	0,487
GCP-Vertex	0,417	0,298	0,636	0,481
ViT-L-16-SigLip	0,392	0,281	0,627	0,458
ViT-B-16-SigLip	0,347	0,252	0,594	0,414
Amazon-Titan-MultiModal	0,308	0,231	0,558	0,377
Jina-V1-KLIP	0,175	0,122	0,369	0,229
Kohere-penyematan-v3	0,136	0,110	0,315	0,178

Pengambilan Gambar AmazonProducts-Text2.

Model Penyematan	peta	R@10	MRR	nDCG@10
Marqo-Ecommerce-L	0,92	0,978	0,928	0,940
Marqo-Ecommerce-B	0,897	0,967	0,897	0,914
ViT-SO400M-14-SigLip	0,860	0,954	0,860	0,882
ViT-L-16-SigLip	0,842	0,940	0,842	0,865
GCP-Vertex	0,808	0,933	0,808	0,837
ViT-B-16-SigLip	0,797	0,917	0,797	0,825
Amazon-Titan-MultiModal	0,762	0,889	0,763	0,791
Jina-V1-KLIP	0,530	0,699	0,530	0,565
Kohere-penyematan-v3	0,433	0,597	0,433	0,465

Kutipan

 @software{zhu2024marqoecommembed_2024,
        author = {Tianyu Zhu and and Jesse Clark},
        month = oct,
        title = {{Marqo Ecommerce Embeddings - Foundation Model for Product Embeddings}},
        url = {https://github.com/marqo-ai/marqo-ecommerce-embeddings/},
        version = {1.0.0},
        year = {2024}
        }

Memperluas

Informasi Tambahan

Versi 1.0.0
Tipe Kode sumber lainnya
Waktu Pembaruan 2024-12-04
ukuran 229.23KB
Berasal dari Github

Aplikasi Terkait

GitHub sgrebnov/cordova plugin background download

2024-11-05
Wa ch ull navra maza navsacha 2 2024 ull ovie Fr e Online On Strea ings

2024-11-03
Wa ch navra maza navsacha 2 2024 ull ovie Online For Fr e Strea ings At Home

2024-11-03
Wa ch the greatest of all time 2024 ull ovie Online For Fr e Strea ings At Home

2024-11-02
wolfs 2024 f llmo ie f lmyz lla dow load ree 7 0p 4 0p a d 10 0p

2024-11-01
GitHub actions/download artifact

2024-11-01

Direkomendasikan untuk Anda

chat.petals.dev

Kode sumber lainnya

1.0.0
GPT Prompt Templates

Kode sumber lainnya

1.0.0
GPTyped

Kode sumber lainnya

GPTyped 1.0.5
waymo open dataset

Kode sumber lainnya

December 2023 Update
SmartTube

Kode sumber lainnya

24.71 Stable
Sunamu

Kode sumber lainnya

Release 2.2.0
waymo open dataset

Kode sumber lainnya

December 2023 Update
wp functions

Kategori lainnya

1.0.0
termwind

Kategori lainnya

v2.3.0

Informasi Terkait Semua