We study a synthetic corpus-based approach for language models (LMs) to
...
Writing a readme is a crucial aspect of software development as it plays...
This paper investigates the effect of tokenizers on the downstream
perfo...
Masked language modeling (MLM) is a widely used self-supervised pretrain...
One of the challenges in text generation is to control generation as int...
This paper explains the participation of team Hitachi to SemEval-2023 Ta...
Sparsity learning with known grouping structures has received considerab...
Structural equation models and Bayesian networks have been widely used t...