(1. College of Computer Science and Technology, National University of Defense Technology, Changsha 410073, China;2. National Key Laboratory of Paralle and Distributed Computing, National University of Defense Technology, Changsha 410073, China)
TP18
HE Yuanhong, JIANG Jingfei, XU Jinwei. Quantization and pruning optimization method for attention mechanism[J]. Journal of National University of Defense Technology,2024,46(1):113-120.
Copy