(相对)位置编码
Stand-Alone Self-Attention in Vision Models-NIPS 2019
data:image/s3,"s3://crabby-images/bbfa8/bbfa87bb46993ab56a916a8494627dd365b7d86a" alt="在这里插入图片描述"
[网络主要结构](https://blog.csdn.net/baigalxy/article/details/93662282)
[Stand-Alone Self-Attention in Vision Models](https://zhuanlan.zhihu.com/p/361259616)
[paer](https://papers.nips.cc/paper/2019/file/3416a75f4cea9109507cacd8e2f2aefc-Paper.pdf)
[现在transformer有什么工作有用二维的位置编码吗?](https://www.zhihu.com/question/476052139)
[实验分析非常精彩 | Transformer中的位置嵌入到底改如何看待?](http://www.360doc.com/content/22/0209/13/99071_1016566082.shtml)
Self-Attention with Relative Position Representations-NAACL 2018
data:image/s3,"s3://crabby-images/88f99/88f99a4d1f9fc46f7ca6cd9ed9a55b759ccfd689" alt="在这里插入图片描述"
[Self-Attention with Relative Position Representations](https://www.jianshu.com/p/cb5b2d967e90)
Rethinking and Improving Relative Position Encoding for Vision Transformer-2019
data:image/s3,"s3://crabby-images/5525a/5525ab47100cde71a524f5b75597c7dec4e478d7" alt="在这里插入图片描述" data:image/s3,"s3://crabby-images/c3d46/c3d4603391dbb41fd961749156f8b2f35b55d630" alt="在这里插入图片描述" data:image/s3,"s3://crabby-images/6780d/6780d051f1454cf460ae52f5ebd65f34d723ac13" alt="在这里插入图片描述" data:image/s3,"s3://crabby-images/efdc3/efdc39ba28ef675884a31b7b35e250f8e9f9306a" alt="在这里插入图片描述"
[paper](https://houwenpeng.com/publications/iRPE.pdf)
[论文阅读:本篇论文提出了四种二维相对位置编码的映射方式,同时分析了Vision Transformer中影响二维相对位置性能的关键因素](https://baijiahao.baidu.com/s?id=1706147979229455598&wfr=spider&for=pc)
二维位置的旋转式位置编码
[科学空间](https://spaces.ac.cn/archives/8265)
[Transformer升级之路:二维位置的旋转式位置编码](https://www.sohu.com/a/482611838_121119001)
[git](https://github.com/ZhuiyiTechnology/roformer)
[A PyTorch implementation of the 1d and 2d Sinusoidal positional encoding/embedding.](https://github.com/wzlxjtu/PositionalEncoding2D)
https://pypi.org/project/positional-encodings/中:
1D, 2D, and 3D Sinusoidal Postional Encoding (Pytorch and Tensorflow)
An implementation of 1D, 2D, and 3D positional encoding in Pytorch and TensorFlow
pip install positional-encodings
import torch
from positional_encodings import PositionalEncoding1D, PositionalEncoding2D, PositionalEncoding3D
p_enc_1d_model = PositionalEncoding1D(10)
p_enc_1d_model_sum = Summer(PositionalEncoding1D(10))
x = torch.rand(1,6,10)
penc_no_sum = p_enc_1d_model(x)
penc_sum = p_enc_1d_model_sum(x)
print(penc_no_sum + x == penc_sum)
data:image/s3,"s3://crabby-images/d9e29/d9e2980b2b46817c072de80362f35c17cfa6e488" alt="在这里插入图片描述"
data:image/s3,"s3://crabby-images/3ae8c/3ae8c9c46c5b7c56dcfe88818b30cf2c5f65c44a" alt="在这里插入图片描述"
data:image/s3,"s3://crabby-images/5ad22/5ad22120e583e6f396d895d499093a610036f94b" alt="在这里插入图片描述"
|