Huawei-HIT Joint Laboratory - Multimodal Voice Assistant Rejection Project
Developed models to distinguish interference (like non-targeted voice assistant speech or background noise) during continuous multi-round dialogue listening by voice assistants.Responsible for building a dual-tower rejection model of voice (Wav2vec2) + text (BERT), and adopting various approaches (like data augmentation, MoE) to improve model generalization (OOD) across domains.