The Internet of Things (IoT) connects people, devices, and information
r...
Childhood and adolescent obesity rates are a global concern because obes...
Clinical decision support systems (CDSSs) have been widely utilized to
s...
Expressive text-to-speech systems have undergone significant advancement...
Despite rapid progress in the voice style transfer (VST) field, recent
z...
Decoding EEG signals for imagined speech is a challenging task due to th...
Although text-to-speech (TTS) systems have significantly improved, most ...
Recently, denoising diffusion models have demonstrated remarkable perfor...
Diffusion-based generative models have exhibited powerful generative
per...
Translating imagined speech from human brain activity into voice is a
ch...
Decoding imagined speech from human brain signals is a challenging and
i...
Previous video-based human pose estimation methods have shown promising
...
Although modern object detectors rely heavily on a significant amount of...
Although many approaches for multi-human pose estimation in videos have ...
Temporal action localization (TAL) is a task of identifying a set of act...
Video Anomaly Detection(VAD) has been traditionally tackled in two main
...
Most object detection frameworks use backbone architectures originally
d...
Recently, advanced technologies have unlimited potential in solving vari...
Hedging is a strategy for reducing the potential risks in various types ...
Few-shot object detection has gained significant attention in recent yea...
Speech impairments due to cerebral lesions and degenerative disorders ca...
Brain-computer interface (BCI) is challenging to use in practice due to ...
Domain adaptation (DA) or domain generalization (DG) for face presentati...
Metaverse provides an alternative platform for human interaction in the
...
Brain-computer interface (BCI) is a practical pathway to interpret users...
Deep learning frameworks have become increasingly popular in brain compu...
We present a mobile dataset obtained from electroencephalography (EEG) o...
Few-shot speaker adaptation is a specific Text-to-Speech (TTS) system th...
Deep learning has played a major role in the interpretation of dermoscop...
Aerial image registration or matching is a geometric process of aligning...
As interpretability has been pointed out as the obstacle to the adoption...
Recently, semiconductors' demand has exploded in virtual reality,
smartp...
Brain-computer interface (BCI) is one of the tools which enables the
com...
Recently, various deep neural networks have been applied to classify
ele...
Face anti-spoofing (FAS) plays an important role in protecting face
reco...
Due to the recent outbreak of COVID-19, many classes, exams, and meeting...
Human-robot collaboration has the potential to maximize the efficiency o...
Brain-computer interface (BCI) is used for communication between humans ...
Although recent works on neural vocoder have improved the quality of
syn...
Computer-aided diagnosis has recently received attention for its advanta...
The 3D Morphable Model (3DMM), which is a Principal Component Analysis (...
Every people has their own voice, likewise, brain signals dis-play disti...
Lack of adequate training samples and noisy high-dimensional features ar...
Brain-computer interfaces (BCIs) use brain signals such as
electroenceph...
Recently, practical brain-computer interface is actively carried out,
es...
To enable a deep learning-based system to be used in the medical domain ...
Visual question answering requires a deep understanding of both images a...
Human activity recognition in videos has been widely studied and has rec...
Noninvasive brain-computer interface (BCI) is widely used to recognize u...
Recent advances in brain-computer interface technology have shown the
po...