Skip to content
View liusongxiang's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro
Block or Report

Block or report liusongxiang

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
liusongxiang/README.md

Hi there 👋

My research interests encompass the extensive domain of speech and language intelligence, which includes speech foundation models, large language models (LLMs), text-to-speech synthesis (TTS), voice conversion (VC), singing synthesis, cross-modal representation learning, audio adversarial attacks & defense, among other related areas.

My homepage

Google scholar profile

Pinned Loading

  1. StarGAN-Voice-Conversion StarGAN-Voice-Conversion Public

    This is a pytorch implementation of the paper: StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial networks

    Python 507 92

  2. ppg-vc ppg-vc Public

    PPG-Based Voice Conversion

    Python 321 73

  3. efficient_tts efficient_tts Public

    Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"

    Python 114 22

  4. BNE-Seq2SeqMoL-VC BNE-Seq2SeqMoL-VC Public

    Demo for "Any-to-Many Voice Conversion with Location-Relative Sequence-to-Sequence Modeling"

    6 3

  5. diffsvc diffsvc Public

    DiffSVC demo page

    77 66

  6. Large-Audio-Models Large-Audio-Models Public

    Keep track of big models in audio domain, including speech, singing, music etc.

    415 24