TinyViT: Fast Pretraining Distillation for Vision Transformers

作者：JC2023.10.08 07:05浏览量：3

简介：TinyViT: Fast Pretraining Distillation for Small Vision Transformers

千帆应用开发平台“智能体Pro”全新上线限时免费体验

面向慢思考场景，支持低代码配置的方式创建“智能体Pro”应用

TinyViT: Fast Pretraining Distillation for Small Vision Transformers
As the amount of available data continues to grow, the use of large-scale pretrained models has shown impressive results in various tasks, particularly in the field of natural language processing. However, in the realm of computer vision, the application of such models has been challenging due to their prohibitive memory footprint and computational cost. To address this issue, several studies have explored the use of smaller variants of vision transformers (ViTs), but their performance has generally lagged behind that of their larger counterparts. In this paper, we propose TinyViT, a novel fast pretraining distillation framework for small ViTs that effectively addresses this gap.
TinyViT is built on the concept of knowledge distillation, a process of transferring the knowledge from a large, cumbersome teacher model to a smaller student model. The key advantage of this approach is that it enables us to capture the salient features of the teacher model while significantly reducing the computational requirements of the student model. Typically, distillation involves fine-tuning the parameters of the student model on a labeled dataset using the soft predictions of the teacher model as supervision. However, this approach can be computationally expensive and time-consuming.
To address this issue, we propose a novel fast pretraining distillation framework for small ViTs. Our approach begins with initializing the parameters of the student model using a pretrained teacher model. Subsequently, we use self-supervision and distillation loss to update the parameters of the student model solely based on unlabeled data. The self-supervision aspect encourages the student model to learn transferable features by predicting its own outputs (or a related task) while the distillation loss penalizes the student model for deviated predictions from the teacher model. This framework not only enables us to capture the knowledge from the teacher model but also allows us to circumvent the need for labeled data, thereby significantly reducing the computational cost and time required for training.
We evaluate TinyViT on several benchmark datasets for image classification and demonstrate its effectiveness by comparing it with existing state-of-the-art methods. Our experiments show that TinyViT outperforms its competitors with a significant margin while using a fraction of the computational resources. Specifically, on ImageNet, our method achieves a top-1 accuracy of 75.3%, which is 24% higher than the current state-of-the-art method that does not use pretraining and 16% higher than the one that uses pretraining. These results establish TinyViT as a competitive alternative to existing methods for computer vision tasks.
In summary, we present TinyViT, a novel fast pretraining distillation framework for small vision transformers that effectively addresses the challenges associated with their large memory footprint and computational cost. By utilizing our framework, we can achieve state-of-the-art performance using a fraction of the resources without sacrificing accuracy. We believe that TinyViT paves a new path forward for efficient vision transformer-based models and opens up exciting opportunities for future research in this area.

发表评论

开发者关注产品榜

最热文章

关于作者

JC

935983被阅读数
13被赞数
9被收藏数

开发者热搜

TinyViT: Fast Pretraining Distillation for Vision Transformers

千帆应用开发平台“智能体Pro”全新上线限时免费体验

相关文章推荐

文心一言接入指南：通过百度智能云千帆大模型平台API调用

从 MLOps 到 LMOps 的关键技术嬗变

Sugar BI教你怎么做数据可视化 - 拓扑图，让节点连接信息一目了然

更轻量的百度百舸，CCE Stack 智算版发布

打造合规数据闭环，加速自动驾驶技术研发

LMOps 工具链与千帆大模型平台

发表评论

开发者关注产品榜

千帆大模型服务与开发平台ModelBuilder

千帆大模型应用开发平台AppBuilder

秒哒-生成式应用开发平台

百度智能云客悦智能客服平台

最热文章

关于作者

JC

TinyViT: Fast Pretraining Distillation for Vision Transformers

千帆应用开发平台“智能体Pro”全新上线 限时免费体验

相关文章推荐

文心一言接入指南：通过百度智能云千帆大模型平台API调用

从 MLOps 到 LMOps 的关键技术嬗变

Sugar BI教你怎么做数据可视化 - 拓扑图，让节点连接信息一目了然

更轻量的百度百舸，CCE Stack 智算版发布

打造合规数据闭环，加速自动驾驶技术研发

LMOps 工具链与千帆大模型平台

发表评论

开发者关注产品榜

千帆大模型服务与开发平台ModelBuilder

千帆大模型应用开发平台AppBuilder

秒哒-生成式应用开发平台

百度智能云客悦智能客服平台

最热文章

关于作者

JC

千帆应用开发平台“智能体Pro”全新上线限时免费体验