Rope Posted on 2025-02-17 | In 深度学习 | Rope Su, Jianlin, et al. “Roformer: Enhanced transformer with rotary position embedding.” Neurocompu ... Read more »
seed-tts论文 Posted on 2025-02-08 | In 语音合成 | seed-tts论文 Anastassiou, Philip, et al. “Seed-TTS: A Family of High-Quality Versatile Speech Generati ... Read more »
End-to-end object detection with transformers Posted on 2025-01-18 | In 深度学习 | End-to-end object detection with transformers Carion, Nicolas, et al. “End-to-end object detection w ... Read more »
Megatron Posted on 2025-01-08 | In 分布式系统 | Megatron Shoeybi, Mohammad, et al. “Megatron-lm: Training multi-billion parameter language models us ... Read more »
GPipe Posted on 2024-12-20 | In 分布式系统 | GPipe Huang, Yanping, et al. “Gpipe: Efficient training of giant neural networks using pipeline para ... Read more »
参数服务器 Posted on 2024-12-18 | In 分布式系统 | 参数服务器 Li, Mu, et al. “Scaling distributed machine learning with the parameter server.” 11th USENIX S ... Read more »
CLIP Posted on 2024-12-06 | In 多模态 | CLIP https://arxiv.org/abs/2103.00020 Radford, Alec, et al. “Learning transferable visual models fro ... Read more »
SpotTune Transfer Learning through Adaptive Fine-tuning Posted on 2023-07-02 | In Finetune | SpotTune: Transfer Learning through Adaptive Fine-tuning Guo, Yunhui, et al. “Spottune: transfer lea ... Read more »
Finetune网上博客 Posted on 2023-06-06 | In Finetune | finetune网上博客 极市 花式 Finetune 方法大汇总 1. 招式1:使用Pretrain模型做约束在Finetune阶段,如果我们可用于Finetune的目标任务数据量较少时,很有可能 ... Read more »
Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference Posted on 2023-05-21 | In 模型压缩 | Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference量化感知论文 ... Read more »