HiFi-Stream：使用生成對抗網絡的流式語音增強

2503.17141v2

中文标题#

HiFi-Stream：使用生成對抗網絡的流式語音增強

英文标题#

HiFi-Stream: Streaming Speech Enhancement with Generative Adversarial Networks

中文摘要#

語音增強技術已成為移動設備和語音軟件中的核心技術。儘管如此，現代深度學習解決方案通常需要大量的計算資源，這使得它們在低資源設備上的使用具有挑戰性。我們提出了 HiFi-Stream，這是最近發布的 HiFi++ 模型的優化版本。我們的實驗表明，儘管其大小和計算複雜度相比原始 HiFi++ 有所改進，HiFi-Stream 保留了原始模型的大部分質量，使其成為目前最小和最快的模型之一。該模型在流式設置中進行了評估，與現代基線方法相比，表現出優越的性能。

英文摘要#

Speech Enhancement techniques have become core technologies in mobile devices and voice software. Still, modern deep learning solutions often require high amount of computational resources what makes their usage on low-resource devices challenging. We present HiFi-Stream, an optimized version of recently published HiFi++ model. Our experiments demonstrate that HiFi-Stream saves most of the qualities of the original model despite its size and computational complexity improved in comparison to the original HiFi++ making it one of the smallest and fastest models available. The model is evaluated in streaming setting where it demonstrates its superior performance in comparison to modern baselines.

PDF 获取#

查看中文 PDF - 2503.17141v2

智能達人抖店二維碼

抖音掃碼查看更多精彩內容