Synthical
Your space
Profile
Activity
Favorites
Folders
Feeds
All articles
Simple
Original
Articles about
Sound
Perceived Femininity in Singing Voice: Analysis and Prediction
4 days ago by
Yuexuan Kong
and
others
Sound
Audio-Thinker: Guiding Audio Language Model When and How to Think via Reinforcement Learning
4 days ago by
Shu Wu
and
others
Sound
,
Computation and Language
Improving DF-Conformer Using Hydra For High-Fidelity Generative Speech Enhancement on Discrete Codec Token
5 days ago by
Shogo Seki
and
others
Sound
MultiSoundGen: Video-to-Audio Generation for Multi-Event Scenarios via SlowFast Contrastive Audio-Visual Pretraining and Direct Preference Optimization
5 days ago by
Jianxuan Yang
and
others
Multimedia
,
Computer Vision and Pattern Recognition
H-Infinity Filter Enhanced CNN-LSTM for Arrhythmia Detection from Heart Sound Recordings
5 days ago by
Rohith Shinoj Kumar
and
others
Machine Learning
,
Artificial Intelligence
Phoenix-VAD: Streaming Semantic Endpoint Detection for Full-Duplex Speech Interaction
5 days ago by
Weijie Wu
and
others
Audio and Speech Processing
,
Sound
DARAS: Dynamic Audio-Room Acoustic Synthesis for Blind Room Impulse Response Estimation
5 days ago by
Chunxi Wang
and
others
Audio and Speech Processing
,
Sound
Prevailing Research Areas for Music AI in the Era of Foundation Models
5 days ago by
Megan Wei
and
others
Sound
,
Artificial Intelligence
From the perspective of perceptual speech quality: The robustness of frequency bands to noise
5 days ago by
Junyi Fan
and
Donald Williamson
Audio and Speech Processing
,
Sound
An Evaluation of Interleaved Instruction Tuning on Semantic Reasoning Performance in an Audio MLLM
5 days ago by
Jiawei Liu
and
others
Multimedia
,
Computation and Language
AuthGlass: Enhancing Voice Authentication on Smart Glasses via Air-Bone Acoustic Features
5 days ago by
Weiye Xu
and
others
Human-Computer Interaction
,
Sound
Sound Clouds: Exploring ambient intelligence in public spaces to elicit deep human experience of awe, wonder, and beauty
5 days ago by
Chengzhi Zhang
and
others
Human-Computer Interaction
,
Multimedia
ADNAC: Audio Denoiser using Neural Audio Codec
5 days ago by
Daniel Jimon
and
others
Sound
,
Machine Learning
AnyEnhance: A Unified Generative Model with Prompt-Guidance and Self-Critic for Voice Enhancement
5 days ago by
Junan Zhang
and
others
Sound
,
Artificial Intelligence
The Ghost in the Keys: A Disklavier Demo for Human-AI Musical Co-Creativity
5 days ago by
Louis Bradshaw
and
others
Sound
,
Artificial Intelligence
Leveraging Language Information for Target Language Extraction
5 days ago by
Mehmet Sinan Yıldırım
and
others
Audio and Speech Processing
,
Sound
Continuous Boostlet Transform and Associated Uncertainty Principles
5 days ago by
Owais Ahmad
and
Jasifa Fayaz
Signal Processing
,
Sound
Temporal Feature Learning in Weakly Labelled Bioacoustic Cetacean Datasets via a Variational Autoencoder and Temporal Convolutional Network: An Interdisciplinary Approach
6 days ago by
Laia Garrobé Fonollosa
and
others
Sound
,
Audio and Speech Processing
Instance-Specific Test-Time Training for Speech Editing in the Wild
6 days ago by
Taewoo Kim
and
others
Audio and Speech Processing
,
Sound
Speech-DRAME: A Framework for Human-Aligned Benchmarks in Speech Role-Play
6 days ago by
Jiatong Shi
and
others
Sound
,
Artificial Intelligence
MultiMed-ST: Large-scale Many-to-many Multilingual Medical Speech Translation
1
6 days ago by
Khai Le-Duc
and
others
Computation and Language
,
Artificial Intelligence
Feedback-driven Retrieval-augmented Audio Generation with Large Audio Language Models
6 days ago by
Junqi Zhao
and
others
Sound
As Good as It KAN Get: High-Fidelity Audio Representation
6 days ago by
Patryk Marszałek
and
others
Sound
,
Computer Vision and Pattern Recognition
Mitigating Attention Sinks and Massive Activations in Audio-Visual Speech Recognition with LLMs
7 days ago by
Anand
and
others
Audio and Speech Processing
,
Computer Vision and Pattern Recognition
MULTI-Bench: A Multi-Turn Interactive Benchmark for Assessing Emotional Intelligence ability of Spoken Dialogue Models
7 days ago by
Yayue Deng
and
others
Audio and Speech Processing
,
Artificial Intelligence
Rhythm in the Air: Vision-based Real-Time Music Generation through Gestures
7 days ago by
Barathi Subramanian
and
others
Multimedia
,
Sound
Music Arena: Live Evaluation for Text-to-Music
7 days ago by
Yonghyun Kim
and
others
at
Carnegie Mellon University
Sound
,
Artificial Intelligence
More Than A Shortcut: A Hyperbolic Approach To Early-Exit Networks
7 days ago by
Swapnil Bhosale
and
others
Sound
,
Artificial Intelligence
Audio Driven Real-Time Facial Animation for Social Telepresence
7 days ago by
Jiye Lee
and
others
Graphics
,
Computer Vision and Pattern Recognition
Recent Trends in Distant Conversational Speech Recognition: A Review of CHiME-7 and 8 DASR Challenges
7 days ago by
Samuele Cornell
and
others
Audio and Speech Processing
,
Computation and Language
Load more