The usage of cybersecurity is growing steadily because it is beneficial to us. When people use cybersecurity, they can easily protect their valuable data. Today, everyone is connected through the internet. It’s much easier for a thief to connect important data through cyber-attacks. Everyone needs cybersecurity to protect their precious personal data and sustainable infrastructure development in data science. However, systems protecting our data using the existing cybersecurity systems is difficult. There are different types of cybersecurity threats. It can be phishing, malware, ransomware, and so on. To prevent these attacks, people need advanced cybersecurity systems. Many software helps to prevent cyber-attacks. However, these are not able to early detect suspicious internet threat exchanges. This research used machine learning models in cybersecurity to enhance threat detection. Reducing cyberattacks internet and enhancing data protection; this system makes it possible to browse anywhere through the internet securely. The Kaggle dataset was collected to build technology to detect untrustworthy online threat exchanges early. To obtain better results and accuracy, a few pre-processing approaches were applied. Feature engineering is applied to the dataset to improve the quality of data. Ultimately, the random forest, gradient boosting, XGBoost, and Light GBM were used to achieve our goal. Random forest obtained 96% accuracy, which is the best and helpful to get a good outcome for the social development in the cybersecurity system.
The purpose of this paper is to explore the performance of ridge regression and the random forest model improved by genetic algorithm in predicting the Boston house price data set and conduct a comparative analysis. To achieve it, the data is divided into training set and test set according to the ratio of 70-30. The RidgeCV library is used to select the best regularization parameter for the Ridge regression model, and for the random forest model, the genetic algorithm is used to optimize the model's hyperparameters. The result shows that compared with ridge regression, the random forest model improved by genetic algorithm can perform better in the regression problem of Boston house prices.
Based on the population change data of 2005–2009, 2010–2014, 2015–2019 and 2005–2019, the shrinking cities in Northeast China are determined to analyze their spatial distribution pattern. And the influencing factors and effects of shrinking cities in Northeast China are explored by using multiple linear regression method and random forest regression method. The results show that: 1) In space, the shrinking cities in Northeast China are mainly distributed in the “land edge” areas represented by Changbai Mountain, Sanjiang Plain, Xiaoxing’an Mountain and Daxing’an Mountain. In terms of time, the contraction center shows an obvious trend of moving northward, while the opposite expansion center shows a trend of moving southward, and the shrinking cities gather further; 2) in the study of influencing factors, the results of multiple linear regression and random forest regression show that socio-economic factors play a major role in the formation of shrinking cities; 3) the precision of random forest regression is higher than that of multiple linear regression. The results show that per capita GDP has the greatest impact on the contraction intensity, followed by the unemployment rate, science and education expenses and the average wage of on-the-job workers. Among the four influencing factors, only the unemployment rate promotes the contraction, and the other three influencing factors inhibit the formation of shrinking cities to various degrees.
Copyright © by EnPress Publisher. All rights reserved.