Open Access Open Access  Restricted Access Subscription Access

Analysis of JD Commodity Evaluation Word Cloud Based on Web Crawler


Affiliations
1 School of Computer and Communication, Hunan Institute of Engineering Xiangtan 411104, China
2 Xiangya Nursing School, Central South University, Changsha 410013, China
 

This project is the design of word cloud analysis program based on web crawler. Taking Jingdong Mall as the platform, it crawls all comment information of designated products, conducts data cleaning and data analysis on the information obtained from the review and crawler, and generates word cloud map.At the same time, the visual analysis of review data can clearly show the advantages and disadvantages of customer-centered evaluations and commodities, and provide an important reference for consumers to choose commodities and businesses to improve decision-making and optimize services. This project is developed by using Python3 language, using PyChart as the IDE, using the requests library, JSON library, World Cloud library and PyMongo library, using Navicat to connect MongoDB, using PyQT5 library to achieve visual interface, and JavaScript+HTML5+ CSS3 +MySQL+ word cloud + Boozing and Bagging algorithm for data analysis and algorithm optimization. In addition to providing consumers with cost-effective, highly evaluated and highly rated goods, it also provides the sellers with more specific data to improve their own defects.

Keywords

: Jingdong crawler; Natural Language Processing; Data mining; Visualization.
User
Notifications
Font Size

  • Xu Lei, Zhang Kewei.Analysis of Jingdong commodity reviews based on text mining [J]. Inner Mongolia Science, Technology and Economy, 2020,(3):41,43.
  • Yan Ming, Zheng Changxing. Text segmentation and word cloud production in Python environment [J].Modern Computers, 2018,(34):86-89.
  • Feng Feng Xingjie and Zeng Yunze. In-depth recommendation model based on scoring Matrix and review text. Journal of Computer Science, 2020, 43(5) : 884-900.
  • Li Jun, Zhou Yuying, Tang Zhihang.Clothing Information Collection Based on Topic Web Crawler. Information Technology and Information Technology, 2018, (8):97-99.
  • Zeng Xiaoqin, Yu Hong.Sentiment analysis of commodity review text based on Python [J]. Computer Knowledge and Technology, 2020, 16(8):181-183.
  • Zhang Yan, Wu Yuquan.Design of Network Data Crawler Program Based on Python [J]. Computer Programming Skills and Maintenance, 2020,(4):26-27.
  • Mona Nasr, Andrew karam, Mina Atef, Kirollos Boles, Kirollos Samir, Mario Raouf,Natural Language Processing: Text Categorization And Classifications[J].Int. J. Advanced Networking and Applications, 2020,12(02), 4542-4548.
  • ZUO Wei, ZHANG Xi, DONG Hongjuan, et al.Review of Topic Web Crawler Research [J]. Software Guide, 2020, 19(2):278-281.
  • Li Junhua. Research on Web Crawler Based on Python [J]. Modern Information Science and Technology, 2019, 3(20):26-27,30.
  • Li Lin. Design and Implementation of Web Crawler System Based on Python [J].Information and Communications Technology, 2017,(9):26-27.
  • Li Huiyun, He Zhenwei, Li Li, et al. Research on HTML5 Technology and Application Mode [J].Communications Science and Technology,2012,28(5):24-29.
  • Sun Jianyan, Ma Yuxin, Wu Wenjie.Web Crawler System Based on Python [J]. Computer Knowledge and Technology, 2019,15(26):61-63.
  • Bi Sen, Yang Yubing.Research on Web Crawler Technology Based on Python [J]. Digital Communications World, 2019, (12):107-108.
  • Huo Bingliang. Analysis of Web Crawler Technology Based on Python [J]. Digital World, 2020,(4):73-74.
  • Rahul Desai, Dr. B P Patil, Maximizing throughput using adaptive routing based on reinforcement learning[J].Int. J. Advanced Networking and Applications,2017, 09(02), 3391-3395
  • Zhao G, Shu X. Analysis and application of SQL language in Navicat for MySQL platform[J]. Wireless Internet Technology, 2017, (19):74-75.
  • Li Pei. Research on web crawler and anti-crawler technology based on Python [J].Computer and Digital Engineering, 2019, 47(6):1415-1420.

Abstract Views: 153

PDF Views: 1




  • Analysis of JD Commodity Evaluation Word Cloud Based on Web Crawler

Abstract Views: 153  |  PDF Views: 1

Authors

Wu Shiqi
School of Computer and Communication, Hunan Institute of Engineering Xiangtan 411104, China
Zhao Xing-yu
Xiangya Nursing School, Central South University, Changsha 410013, China
Qiu Fenglin
School of Computer and Communication, Hunan Institute of Engineering Xiangtan 411104, China
Tang Zhi-hang
School of Computer and Communication, Hunan Institute of Engineering Xiangtan 411104, China

Abstract


This project is the design of word cloud analysis program based on web crawler. Taking Jingdong Mall as the platform, it crawls all comment information of designated products, conducts data cleaning and data analysis on the information obtained from the review and crawler, and generates word cloud map.At the same time, the visual analysis of review data can clearly show the advantages and disadvantages of customer-centered evaluations and commodities, and provide an important reference for consumers to choose commodities and businesses to improve decision-making and optimize services. This project is developed by using Python3 language, using PyChart as the IDE, using the requests library, JSON library, World Cloud library and PyMongo library, using Navicat to connect MongoDB, using PyQT5 library to achieve visual interface, and JavaScript+HTML5+ CSS3 +MySQL+ word cloud + Boozing and Bagging algorithm for data analysis and algorithm optimization. In addition to providing consumers with cost-effective, highly evaluated and highly rated goods, it also provides the sellers with more specific data to improve their own defects.

Keywords


: Jingdong crawler; Natural Language Processing; Data mining; Visualization.

References