Abstract:
With the development of geographic information system, digital earth and digital city play more and more important roles in life. The data generated by sensors or other e...Show MoreMetadata
Abstract:
With the development of geographic information system, digital earth and digital city play more and more important roles in life. The data generated by sensors or other edge nodes need to be collected by crawlers in the distributed systems in IoT, such as the GIS data in CyberGIS. In some edge networks, network operators have adopted methods to limit crawlers, such as blocking the request IP addresses, requiring logging in verification codes and other measures to avoid disturbance to servers. To collect data from web servers in these types of edge networks, a dynamic IP address based strategy DP-crawler is proposed to solve the anti-crawler strategies in the edge networks. DP-crawler can dynamic get proper IP addresses from a security-aware list and select the best available proxies. The security-aware list is designed to use the block-chain. Security and dynamic storage can be achieved by this method. DP-crawler is used to crawler webs, and the detailed information of Douban movies are obtained in the experiments. The experiment results show that the DP-Crawler can get more information by using the DP-Crawler.
Published in: 2018 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC)
Date of Conference: 18-20 October 2018
Date Added to IEEE Xplore: 21 February 2019
ISBN Information: