Humans can listen to a target speaker even in challenging acoustic condi...
We present a general framework to compute the word error rate (WER) of A...
Recent speaker diarization studies showed that integration of end-to-end...
Target speech extraction is a technique to extract the target speaker's ...
Target speech extraction (TSE) extracts the speech of a target speaker i...
In many situations, we would like to hear desired sound events (SEs) whi...
It is essential to perform speech intelligibility (SI) experiments with ...
Speaker diarization has been investigated extensively as an important ce...
The combination of a deep neural network (DNN) -based speech enhancement...
This paper develops a framework that can perform denoising, dereverberat...
Many state-of-the-art neural network-based source separation systems use...
This paper proposes an approach for optimizing a Convolutional BeamForme...
Automatic transcription of meetings requires handling of overlapped spee...
Permutation invariant training (PIT) is a widely used training criterion...
Target sound extraction consists of extracting the sound of a target aco...
Sound event localization aims at estimating the positions of sound sourc...
Although recent advances in deep learning technology improved automatic
...
Recently, we proposed a novel speaker diarization method called
End-to-E...
Many subjective experiments have been performed to develop objective spe...
Sound event localization frameworks based on deep neural networks have s...
The continuous speech separation (CSS) is a task to separate the speech
...
Estimating the positions of multiple speakers can be helpful for tasks l...
Recently, the end-to-end approach has been successfully applied to
multi...
Target speaker extraction, which aims at extracting a target speaker's v...
Target speech extraction, which extracts the speech of a target speaker ...
Developing microphone array technologies for a small number of microphon...
Leveraging additional speaker information to facilitate speech separatio...
Time-domain training criteria have proven to be very effective for the
s...
Recent diarization technologies can be categorized into two approaches, ...
Recently, the source separation performance was greatly improved by
time...
Being able to control the acoustic events (AEs) to which we want to list...
Most approaches to multi-talker overlapped speech separation and recogni...
This paper proposes methods that can optimize a Convolutional BeamFormer...
The performance of speech enhancement algorithms in a multi-speaker scen...
With the advent of deep learning, research on noise-robust automatic spe...
Automatic meeting analysis is an essential fundamental technology requir...
Target speech extraction, which extracts a single target source in a mix...
The rising interest in single-channel multi-speaker speech separation sp...
The rising interest in single-channel multi-speaker speech separation sp...
We previously proposed an optimal (in the maximum likelihood sense)
conv...
This article describes a probabilistic formulation of a Weighted Power
m...
In this study, we proposed a new concept, gammachirp envelope distortion...
Automatic meeting analysis comprises the tasks of speaker counting, spea...
This paper proposes a method for estimating a convolutional beamformer t...