共 56 条
- [31] Yuan L., Chen Y., Wang T., Yu W., Shi Y., Jiang Z., Tay F.E.H., Feng J., Yan S., Tokens-to-Token ViT: Training vision transformers from scratch on ImageNet, Proc. IEEE/CVF Int. Conf. Comput. Vis. (ICCV), pp. 538-547, (2021)
- [32] Tian Z., Shen C., Chen H., He T., FCOS: Fully convolutional onestage object detection, Proc. IEEE/CVF Int. Conf. Comput. Vis. (ICCV), pp. 9626-9635, (2019)
- [33] Wang Y., Zhang X., Yang T., Sun J., Anchor DETR: Query design for transformer-based detector, Proc. AAAI Conf. Artif. Intell., 36, 3, pp. 2567-2575, (2022)
- [34] Xiao A., Transformer in Transformer, (2021)
- [35] Howard A., Sandler M., Chen B., Wang W., Chen L.-C., Tan M., Chu G., Vasudevan V., Zhu Y., Pang R., Adam H., Le Q., Searching for mobilenetv3, Proc. IEEE/CVF Int. Conf. Comput. Vis. (ICCV), pp. 1314-1324, (2019)
- [36] Howard A.G., Zhu M., Chen B., Kalenichenko D., Wang W., Weyand T., Andreetto M., Adam H., MobileNets: Efficient convolutional neural networks for mobile vision applications, (2017)
- [37] Sandler M., Howard A., Zhu M., Zhmoginov A., Chen L.-C., MobileNetV2: Inverted residuals and linear bottlenecks, (2018)
- [38] Ma N., Zhang X., Zheng H., Sun J., ShuffleNet V2: Practical guidelines for efficient CNN architecture design, Proc. ECCV, pp. 122-138, (2018)
- [39] Zhang X., Zhou X., Lin M., Sun J., ShuffleNet: An extremely efficient convolutional neural network for mobile devices, (2017)
- [40] Iandola F.N., Han S., Moskewicz M.W., Ashraf K., Dally W.J., Keutzer K., SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0. 5MB model size, (2016)