deformable attention 다운로드 - deformable attention 소스 코드 다운로드

deformable attention

AI 소스 코드

0.0.19

다운로드

변형 가능한 주의

Pytorch에서 이 문서의 Deformable Attention을 구현했는데, 이는 DETR에서 제안된 것보다 개선된 것으로 보입니다. 상대 위치 임베딩도 SwinV2에서 제안된 연속 위치 임베딩을 사용하여 더 나은 외삽을 위해 수정되었습니다.

설치하다

$ pip install deformable-attention

용법

 import torch
from deformable_attention import DeformableAttention

attn = DeformableAttention (
    dim = 512 ,                   # feature dimensions
    dim_head = 64 ,               # dimension per head
    heads = 8 ,                   # attention heads
    dropout = 0. ,                # dropout
    downsample_factor = 4 ,       # downsample factor (r in paper)
    offset_scale = 4 ,            # scale of offset, maximum offset
    offset_groups = None ,        # number of offset groups, should be multiple of heads
    offset_kernel_size = 6 ,      # offset kernel size
)

x = torch . randn ( 1 , 512 , 64 , 64 )
attn ( x ) # (1, 512, 64, 64)

3D 변형 주의

 import torch
from deformable_attention import DeformableAttention3D

attn = DeformableAttention3D (
    dim = 512 ,                          # feature dimensions
    dim_head = 64 ,                      # dimension per head
    heads = 8 ,                          # attention heads
    dropout = 0. ,                       # dropout
    downsample_factor = ( 2 , 8 , 8 ),      # downsample factor (r in paper)
    offset_scale = ( 2 , 8 , 8 ),           # scale of offset, maximum offset
    offset_kernel_size = ( 4 , 10 , 10 ),   # offset kernel size
)

x = torch . randn ( 1 , 512 , 10 , 32 , 32 ) # (batch, dimension, frames, height, width)
attn ( x ) # (1, 512, 10, 32, 32)

좋은 측정을 위한 1d 변형 가능 주의

 import torch
from deformable_attention import DeformableAttention1D

attn = DeformableAttention1D (
    dim = 128 ,
    downsample_factor = 4 ,
    offset_scale = 2 ,
    offset_kernel_size = 6
)

x = torch . randn ( 1 , 128 , 512 )
attn ( x ) # (1, 128, 512)

소환

 @misc { xia2022vision ,
    title   = { Vision Transformer with Deformable Attention } , 
    author  = { Zhuofan Xia and Xuran Pan and Shiji Song and Li Erran Li and Gao Huang } ,
    year    = { 2022 } ,
    eprint  = { 2201.00520 } ,
    archivePrefix = { arXiv } ,
    primaryClass = { cs.CV }
}

 @misc { liu2021swin ,
    title   = { Swin Transformer V2: Scaling Up Capacity and Resolution } ,
    author  = { Ze Liu and Han Hu and Yutong Lin and Zhuliang Yao and Zhenda Xie and Yixuan Wei and Jia Ning and Yue Cao and Zheng Zhang and Li Dong and Furu Wei and Baining Guo } ,
    year    = { 2021 } ,
    eprint  = { 2111.09883 } ,
    archivePrefix = { arXiv } ,
    primaryClass = { cs.CV }
}