Conferences >2022 4th International Confer...

Language Model Adaptation for Downstream Tasks using Text Selection

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Previous research shows that the domain of the training data has a large impact on the performance of the downstream tasks. Selecting data from an appropriate domain lead...Show More

Metadata

Abstract:

Previous research shows that the domain of the training data has a large impact on the performance of the downstream tasks. Selecting data from an appropriate domain leads to improvements on the performance. Using text classification can help discriminate the data which belong to different domains. In this paper, we use a text classification method to select data from a particular domain (task-specific target domain). We experiment with different sizes of target domain corpus to explore the effect of the method. A pretrained RoBERTa model is adapted to the target domain corpus using the selected data prior to training the model on the downstream tasks. Our experiments show that using a simple domain classifier to select a small dataset to adapt the model can help stabilize the performance of downstream tasks.

Published in: 2022 4th International Conference on Natural Language Processing (ICNLP)

Date of Conference: 25-27 March 2022

Date Added to IEEE Xplore: 19 September 2022

ISBN Information:

DOI: 10.1109/ICNLP55136.2022.00058

Conference Location: Xi'an, China

Contents

References is not available for this document.

Language Model Adaptation for Downstream Tasks using Text Selection

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Language Model Adaptation for Downstream Tasks using Text Selection

Alerts

Abstract:

Metadata

Abstract:

Authors

Figures

References

Citations

Keywords

Metrics

Footnotes

References

IEEE Account

Purchase Details

Profile Information

Need Help?