Update lynxnet backbone #228

yxlllc · 2025-01-16T17:10:20Z

No description provided.

* Change the injection method of conditions on lynxnet (#225) * update configurations for new-lynxnet * update configurations for new-lynxnet * update configurations for new-lynxnet --------- Co-authored-by: KakaruHayate <[email protected]>

* Add multi-dictionary preprocessing and training * Fix lang_map.json copy * Add language embed (inject to txt_embed) for acoustic models * Save language sequence in variance preprocessing * Display merged phoneme groups properly in distribution plots * Add multi-dictionary inference * Save original phoneme texts for duration plots * Fix duration plots displaying bug * Explicit `languages` argument passing * Add language embed (inject to txt_embed) for variance models * Fix argument passing * Add log for lang_map.json copy * Add language embedding scale * Add language embedding type * Preprocessing: only apply lang embed on cross-lingual phonemes * Inference: only apply lang embed on cross-lingual phonemes * Revert "Add language embedding type" This reverts commit 655e9ba. * Revert lang_embed_scale * Adapt ONNX exporters for multi-language models * Refactor configuration schemas for datasets * Add check of existence for merged phonemes * Fix spk_id assignment * Fix languages.json filename * Fix `languages` key in dsconfig.yaml * Set `use_lang_id` to false if there are no cross-lingual phonemes * Support defining extra phonemes * Refactor configs * Prefer file copies in work_dir when loading dictionaries * Fix cannot locate dictionary * Fix unexpected loading error when dictionary changes * Update toplevel.py (#219) * Fix unexpected config passing * Update lynxnet backbone (#228) * Change the injection method of conditions on lynxnet (#225) * update configurations for new-lynxnet * update configurations for new-lynxnet * update configurations for new-lynxnet --------- Co-authored-by: KakaruHayate <[email protected]> * Improve fastspeech2 encoder using Rotary Position Embedding (RoPE) in multi-head self-attention (#234) * update multi-head self attention with RoPE * RoPE onnx (#230) * fix requirements.txt (#233) * fix rope for melody encoder * support swiglu activation for ffn * update dependencies --------- Co-authored-by: KakaruHayate <[email protected]> * support mini-nsf-hifigan vocoder * discard negative pad * fix MHA inference using low torch version * Fix missing phoneme list sorting * Fix single-language dictionary parsing language tag * Add `pitch_controllable` flag to vocoder exporter (cherry picked from commit a6deb6b) * support noise injection * Allow merging global phonemes and language-specific phonemes * Check for conflicts between short names and global tags * Finish documentation for multi-dictionary --------- Co-authored-by: Anjo <[email protected]> Co-authored-by: yxlllc <[email protected]> Co-authored-by: KakaruHayate <[email protected]> Co-authored-by: yxlllc <[email protected]>

KakaruHayate and others added 4 commits December 13, 2024 16:51

Change the injection method of conditions on lynxnet (#225)

a19a2eb

update configurations for new-lynxnet

50bab77

update configurations for new-lynxnet

0c844ec

update configurations for new-lynxnet

74ab9e4

yxlllc merged commit 26ce743 into main Jan 16, 2025

yxlllc deleted the new-lynxnet branch February 15, 2025 05:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update lynxnet backbone #228

Update lynxnet backbone #228

yxlllc commented Jan 16, 2025

Update lynxnet backbone #228

Update lynxnet backbone #228

Conversation

yxlllc commented Jan 16, 2025