Visible to the public Internet of things device recognition method based on natural language processing and text similarity

TitleInternet of things device recognition method based on natural language processing and text similarity
Publication TypeConference Paper
Year of Publication2021
AuthorsGe, Xin
Conference Name2021 4th International Conference on Advanced Electronic Materials, Computers and Software Engineering (AEMCSE)
KeywordsComputers, Cyberspace, Fingers, Human Behavior, IoT, IoT equipment detection, Jaccard, natural language processing, Object recognition, performance evaluation, pubcrawl, resilience, Resiliency, Scalability, Text recognition, Text similarity
AbstractEffective identification of Internet of things devices in cyberspace is of great significance to the protection of Cyberspace Security. However, there are a large number of such devices in cyberspace, which can not be identified by the existing methods of identifying IoT devices because of the lack of key information such as manufacturer name and device name in the response message. Their existence brings hidden danger to Cyberspace Security. In order to identify the IoT devices with missing key information in these response messages, this paper proposes an IoT device identification method, IoTCatcher. IoTCatcher uses HTTP response message and the structure and style characteristics of HTML document, and based on natural language processing technology and text similarity technology, classifies and compares the IoT devices whose response message lacks key information, so as to generate their device finger information. This paper proves that the recognition precision of IoTCatcher is 95.29%, and the recall rate is 91.01%. Compared with the existing methods, the overall performance is improved by 38.83%.
DOI10.1109/AEMCSE51986.2021.00036
Citation Keyge_internet_2021