Journal of Civil Aviation University of China ›› 2022, Vol. 40 ›› Issue (3): 47-53.

• Civil Aviation • Previous Articles     Next Articles

Automatic classification of civil aviation safety information based on text augment

CUI Zhenxin, ZHANG Zhuoyan   

  1. (College of Flight Technology, CAUC, Tianjin 300300, China)
  • Received:2020-10-16 Revised:2021-01-05 Online:2022-06-15 Published:2023-10-29

Abstract: Aiming at the problem of insufficient samples in the application of automatic classification of civil aviation safety information, an automatic classifier is developed, based on the bidirectional encoder representations from transformers (BERT) pre-training model, easy data augment(EDA) and support vector machine(SVM) algorithms.This paper categorizes the incident information into data subsets of different orders of magnitude according to the orders of magnitude of information of single incident type and analyzes the model performance of data-sets in different scale, especially for the small data-sets. The results show that the F 1w value is increased by 31.21 % , 9.66 % and 3.35 % when the orders of magnitude of single incident type are ten -scale, hundred -scale and thousand -scale respectively, indicating a significant improvement of model performance. Therefore, by text enhancement algorithm, the automatic classifier trained in relatively small data-sets has a good effect, and is applicable to automatic classification and processing for civil aviation safety information.

Key words: civil aviation safety, safety information, text augment, natural language processing

CLC Number: