Vocabulary-Enhanced Named Entity Recognition and its Application on Distribution Network Maintenance
Abstract
Named entity recognition is a crucial task in natural language processing, serving downstream tasks such as information extraction and question-answering systems. With the success of Transformer in the field of natural language processing, named entity recognition has made significant strides. However, despite these advancements, the recognition capability in specialized domains remains challenging. The distribution network system contains a vast amount of equipment maintenance text, characterized by inherent complexity and specialization, lacking effective recognition and extraction methods. To address this issue, we propose a vocabulary-enhanced named entity recognition method. Unlike the original named entity recognition methods, we employ a vocabulary adapter to enhance the understanding of text semantic information by integrating matching information between the dictionary and the text. This method strengthens the ability to extract specialized nouns. In conjunction with the TaCL-BERT pre-trained model, we conducted training on our self-constructed distribution network maintenance dataset, GNP-M, to better align the model with the context and specific entities of the electrical grid maintenance domain. Simultaneously, we compared our method with four other named entity recognition models, revealing satisfactory performance results.
This paper was recommended by Regional Editor Tongquan Wei.