识别工具包

Vosk Speech Recognition Toolkit

github:https://github.com/alphacep/vosk-api

Wenet

https://github.com/wenet-e2e/wenet

emoASR

https://github.com/emonosuke/emoASR

专注推理过程

  • Encoder
    • RNN
    • Transformer(Trf.)[Vaswani 2017]
    • Conformer(Cf.)[Gulati 2020]
  • Decoder
    • CTC[Graves 2006]
    • RNN-Transducer(RNN-T)[Graves2012]
    • LAS [chan 2015]
    • Transformer(Trf.)
  • LM
    • RNNLM
    • Transformer LM
    • BERT [Devlin 2018]
    • ELECTRA[Clark2020]
    • Phone-attentive ELECTRA(P-ELECTRA [Futami 2021]
  • Method
    • Rescoring
    • Shallow Fusion
    • KnowledgeDistillation