中国民航大学学报 ›› 2022, Vol. 40 ›› Issue (2): 14-18.

• 民用航空 • 上一篇    下一篇

基于 NLP 的不规范航行通告识别方法 

项 恒,张 驰,李 猛    

  1. (中国民航大学空中交通管理学院,天津 300300) 
  • 收稿日期:2021-05-17 修回日期:2021-05-17 接受日期:2021-03-02 出版日期:2022-06-05 发布日期:2022-06-05
  • 作者简介:项恒(1975—),男,新疆巴音郭楞蒙古自治州人, 副教授,硕士,研究方向为空管人为因素、空域规划、航行情报服务.
  • 基金资助:
    国家自然科学基金项目(U1633109)

Identification method for non-normative NOTAM based on NLP 

XIANG Heng , ZHANG Chi , LI Meng    

  1. (College of Air Traffic Management, CAUC, Tianjin 300300, China) 
  • Received:2021-05-17 Revised:2021-05-17 Accepted:2021-03-02 Online:2022-06-05 Published:2022-06-05

摘要: 针对航行通告中出现的 Q 代码和 E 项正文部分不规范的问题,通过自然语言处理中的文本相似度计算方 法可识别出不规范航行通告。 首先,基于统计机器翻译方法将航行通告 E 项正文部分翻译成中文并建立数 据库,将 Q 代码翻译成中文;然后,利用 Word2vec 模型计算两者之间的相似度,并制定不规范航行通告识 别标准。 通过对收集的 500 条航行通告中的 Q 代码和 E 项正文进行相似度计算,设定 0.7 作为不规范航行 通告的识别标准,经数据测试可得不规范航行通告识别准确率为 96.2%,验证了基于自然语言处理的不规 范航行通告识别方法的可行性。

关键词: 自然语言处理, 航行通告, 机器翻译, Word2vec, 文本相似度计算

Abstract: Aiming at the non-normative problems of item Q and item E in the notice to airmen(NOTAM), a method of text similarity calculation in natural language processing is proposed to identify the non-normative NOTAMs. Firstly, translating the item E into Chinese by statistical machine translation method, and translating the item Q into Chinese by establishing a database. Then, Word2vec model is used to calculate the similarity between the two sentences, and the recognition standard of non -normative NOTAMs is established. Based on the similarity calculation of item Q and item E in 500 collected NOTAMs, 0.7 is set as the benchmark of non -normative NOTAMs. The data test shows that the recognition accuracy of NOTAMs is 96.2%, which verifies the feasibility of non-normative NOTAMs recognition method based on NLP.

Key words: natural language processing, notice to airmen, machine translation, Word2vec, text similarity calculation

中图分类号: