Fine-tuned language models for intelligent aggregation systems in the banking sector
Author affiliations
DOI:
https://doi.org/10.15625/1813-9663/22428Keywords:
News aggregation system, sentiment analysis, financial text, large language models, fine-tuned models.Abstract
This paper presents the design and implementation of a practical news aggregation system for decision making and risk management in the banking sector. The system gathers financial news from multiple sources and mines texts to provide insight into banks’ professional activities such as policies, products, and financial performance. Multiple natural language processing (NLP) modules are integrated to address tasks such as topic classification and sentiment analysis. To identify the most suitable technologies for each component, we systematically evaluated a wide range of NLP techniques, including large language models (LLMs) and domain-specific pre-trained models. A Vietnamese financial corpus of 12,000 annotated samples was constructed to fine-tune models such as PhoBERT, ViT5, and BARTPho. Experimental results show that fine-tuned models significantly outperform general-purpose LLMs (e.g., LLaMA-3.1-8B, Vistral-7B) in both accuracy and computational efficiency. The fine-tuned models achieve a 7.15% accuracy improvement and reduce resource requirements. The study demonstrates a scalable and adaptable framework for building multi-source, text-based intelligent systems in the financial domain.
Downloads
Published
How to Cite
Issue
Section
License
1. We hereby assign copyright of our article (the Work) in all forms of media, whether now known or hereafter developed, to the Journal of Computer Science and Cybernetics. We understand that the Journal of Computer Science and Cybernetics will act on my/our behalf to publish, reproduce, distribute and transmit the Work.2. This assignment of copyright to the Journal of Computer Science and Cybernetics is done so on the understanding that permission from the Journal of Computer Science and Cybernetics is not required for me/us to reproduce, republish or distribute copies of the Work in whole or in part. We will ensure that all such copies carry a notice of copyright ownership and reference to the original journal publication.
3. We warrant that the Work is our results and has not been published before in its current or a substantially similar form and is not under consideration for another publication, does not contain any unlawful statements and does not infringe any existing copyright.
4. We also warrant that We have obtained the necessary permission from the copyright holder/s to reproduce in the article any materials including tables, diagrams or photographs not owned by me/us.

