“Random Forest”Search All-EnPress Publisher LLC.

Apr 1, 2025

Wheat crop detection by combining NDVI time series, phenology with satellite imagery using machine learning

Creating a crop type map is a dominant yet complicated model to produce. This study aims to determine the best model to identify the wheat crop in the Haridwar district, Uttarakhand, India, by presenting a novel approach using machine learning techniques for time series data derived from the Sentinel-2 satellite spanned from mid-November to April. The proposed methodology combines the Normalized Difference Vegetation Index (NDVI), satellite bands like red, green, blue, and NIR, feature extraction, and classification algorithms to capture crop growth's temporal dynamics effectively. Three models, Random Forest, Convolutional Neural Networks, and Support Vector Machine, were compared to obtain the start of season (SOS). It is validated and evaluated using the performance metrics. Further, Random Forest stood out as the best model statistically and spatially for phenology parameter extraction with the least RMSE value at 19 days. CNN and Random Forest models were used to classify wheat crops by combining SOS, blue, green, red, NIR bands, and NDVI. Random Forest produces a more accurate wheat map with an accuracy of 69% and 0.5 MeanIoU. It was observed that CNN is not able to distinguish between wheat and other crops. The result revealed that incorporating the Sentinel-2 satellite data bearing a high spatial and temporal resolution with supervised machine-learning models and crop phenology metrics can empower the crop type classification process.

Abstract

Download PDF(961.82KB)

XML

0

9

Jan 16, 2025

Spatial analysis and classification of land use patterns in Lucknow district, UP, India using GIS and random forest approach

Mapping land use and land cover (LULC) is essential for comprehending changes in the environment and promoting sustainable planning. To achieve accurate and effective LULC mapping, this work investigates the integration of Geographic Information Systems (GIS) with Machine Learning (ML) methodology. Different types of land covers in the Lucknow district were classified using the Random Forest (RF) algorithm and Landsat satellite images. Since the research area consists of a variety of landforms, there are issues with classification accuracy. These challenges are met by combining supplementary data into the GIS framework and adjusting algorithm parameters like selection of cloud free images and homogeneous training samples. The result demonstrates a net increase of 484.59 km² in built-up areas. A net decrement of 75.44 km² was observed in forest areas. A drastic net decrease of 674.52 km² was observed for wetlands. Most of the wastelands have been converted into urban areas and agricultural land based on their suitability with settlements or crops. The classifications achieved an overall accuracy near 90%. This strategy provides a reliable way to track changes in land cover, supporting resource management, urban planning, and environmental preservation. The results highlight how sophisticated computational methods can enhance the accuracy of LULC evaluations.

Abstract

Download PDF(604.71KB)

XML

0

7

Dec 13, 2024

Identifying suspicious internet threat exchanges using machine learning algorithms to ensure privacy and cybersecurity in the USA

The usage of cybersecurity is growing steadily because it is beneficial to us. When people use cybersecurity, they can easily protect their valuable data. Today, everyone is connected through the internet. It’s much easier for a thief to connect important data through cyber-attacks. Everyone needs cybersecurity to protect their precious personal data and sustainable infrastructure development in data science. However, systems protecting our data using the existing cybersecurity systems is difficult. There are different types of cybersecurity threats. It can be phishing, malware, ransomware, and so on. To prevent these attacks, people need advanced cybersecurity systems. Many software helps to prevent cyber-attacks. However, these are not able to early detect suspicious internet threat exchanges. This research used machine learning models in cybersecurity to enhance threat detection. Reducing cyberattacks internet and enhancing data protection; this system makes it possible to browse anywhere through the internet securely. The Kaggle dataset was collected to build technology to detect untrustworthy online threat exchanges early. To obtain better results and accuracy, a few pre-processing approaches were applied. Feature engineering is applied to the dataset to improve the quality of data. Ultimately, the random forest, gradient boosting, XGBoost, and Light GBM were used to achieve our goal. Random forest obtained 96% accuracy, which is the best and helpful to get a good outcome for the social development in the cybersecurity system.

Abstract

Download PDF(575.44KB)

XML

0

6

Nov 11, 2024

Advancing user classification models: A comparative analysis of machine learning approaches to enhance faculty password policies at the University of Buraimi

In this paper, we assess the results of experiment with different machine learning algorithms for the data classification on the basis of accuracy, precision, recall and F1-Score metrics. We collected metrics like Accuracy, F1-Score, Precision, and Recall: From the Neural Network model, it produced the highest Accuracy of 0.129526 also highest F1-Score of 0.118785, showing that it has the correct balance of precision and recall ratio that can pick up important patterns from the dataset. Random Forest was not much behind with an accuracy of 0.128119 and highest precision score of 0.118553 knit a great ability for handling relations in large dataset but with slightly lower recall in comparison with Neural Network. This ranked the Decision Tree model at number three with a 0.111792, Accuracy Score while its Recall score showed it can predict true positives better than Support Vector Machine (SVM), although it predicts more of the positives than it actually is a majority of the times. SVM ranked fourth, with accuracy of 0.095465 and F1-Score of 0.067861, the figure showing difficulty in classification of associated classes. Finally, the K-Neighbors model took the 6th place, with the predetermined accuracy of 0.065531 and the unsatisfactory results with the precision and recall indicating the problems of this algorithm in classification. We found out that Neural Networks and Random Forests are the best algorithms for this classification task, while K-Neighbors is far much inferior than the other classifiers.

Abstract

Download PDF(556.88KB)

XML

0

23

Sept 10, 2024

Harmonizing sentiments: Analyzing user reviews of Spotify through sentiment analysis

This research investigates the sentiment of user reviews on Spotify, with a particular focus on the Indonesian market, leveraging advanced sentiment analysis techniques. We employed three prominent classification models—Naïve Bayes, Support Vector Machine (SVM), and Random Forest—to analyze a dataset of 14,296 user reviews extracted from the Google Play Store and App Store. These findings reveal that the SVM model achieved the highest performance, with an F1-score of 0.875 and an accuracy of 0.874, outperforming Naïve Bayes and Random Forest, which scored accuracies of 0.857 and 0.856 respectively. These results highlight not only the significance of this research which offers valuable contributions to the broader academic discourse on digital marketing, sentiment analysis, and consumer behavior. Additionally, it also showcases the robustness and superior performance of SVM and Random Forest in various sentiment analysis contexts. This study not only provides valuable insights for Spotify’s future development strategies but also contributes to the broader academic discourse on sentiment analysis and machine learning model performance in digital marketing. By highlighting the efficacy of specific models, this research underscores the importance of model selection in sentiment analysis, paving the way for more accurate and effective sentiment analysis applications in the music streaming industry.

Abstract

Download PDF(3.13M)

XML

0