Browse Articles

Transfer Learning for Small and Different Datasets: Fine-Tuning A Pre-Trained Model Affects Performance

Gupta et al. | Oct 18, 2020

Transfer Learning for Small and Different Datasets: Fine-Tuning A Pre-Trained Model Affects Performance

In this study, the authors seek to improve a machine learning algorithm used for image classification: identifying male and female images. In addition to fine-tuning the classification model, they investigate how accuracy is affected by their changes (an important task when developing and updating algorithms). To determine accuracy, a set of images is used to train the model and then a separate set of images is used for validation. They found that the validation accuracy was close to the training accuracy. This study contributes to the expanding areas of machine learning and its applications to image identification.

Read More...

Using data science along with machine learning to determine the ARIMA model’s ability to adjust to irregularities in the dataset

Choudhary et al. | Jul 26, 2021

Using data science along with machine learning to determine the ARIMA model’s ability to adjust to irregularities in the dataset

Auto-Regressive Integrated Moving Average (ARIMA) models are known for their influence and application on time series data. This statistical analysis model uses time series data to depict future trends or values: a key contributor to crime mapping algorithms. However, the models may not function to their true potential when analyzing data with many different patterns. In order to determine the potential of ARIMA models, our research will test the model on irregularities in the data. Our team hypothesizes that the ARIMA model will be able to adapt to the different irregularities in the data that do not correspond to a certain trend or pattern. Using crime theft data and an ARIMA model, we determined the results of the ARIMA model’s forecast and how the accuracy differed on different days with irregularities in crime.

Read More...

An Analysis of Soil Microhabitats in Revolutionary War, Civil War, and Modern Graveyards on Long Island, NY

Caputo et al. | May 05, 2019

An Analysis of Soil Microhabitats in Revolutionary War, Civil War, and Modern Graveyards on Long Island, NY

Previously established data indicate that cemeteries have contributed to groundwater and soil pollution, as embalming fluids can impact the microbiomes that exist in decomposing remains. In this study, Caputo et al hypothesized that microbial variation would be high between cemeteries from different eras due to dissimilarities between embalming techniques employed, and furthermore, that specific microbes would act as an indication for certain contaminants. Overall, they found that there is a variation in the microbiomes of the different eras’ cemeteries according to the concentrations of the phyla and their more specific taxa.

Read More...

The velocity of white dwarf stars relates to their magnitude

Glazer et al. | Jun 30, 2023

The velocity of white dwarf stars relates to their magnitude
Image credit: Jacub Gomez

Using the European Space Agency’s Gaia dataset, the authors analyzed the relationship between white dwarfs’ magnitudes and proper motions. They hypothesized that older white dwarf stars may have different velocities than younger ones, possibly that stars slow down as they age. They found that the white dwarfs in the dataset were substantially redder and higher magnitude (traits traditionally associated with older stars) as compared to their non-fast counterparts.

Read More...

Similarity Graph-Based Semi-supervised Methods for Multiclass Data Classification

Balaji et al. | Sep 11, 2021

Similarity Graph-Based Semi-supervised Methods for Multiclass Data Classification

The purpose of the study was to determine whether graph-based machine learning techniques, which have increased prevalence in the last few years, can accurately classify data into one of many clusters, while requiring less labeled training data and parameter tuning as opposed to traditional machine learning algorithms. The results determined that the accuracy of graph-based and traditional classification algorithms depends directly upon the number of features of each dataset, the number of classes in each dataset, and the amount of labeled training data used.

Read More...

Assessing and Improving Machine Learning Model Predictions of Polymer Glass Transition Temperatures

Ramprasad et al. | Mar 18, 2020

Assessing and Improving Machine Learning Model Predictions of Polymer Glass Transition Temperatures

In this study, the authors test whether providing a larger dataset of glass transition temperatures (Tg) to train the machine-learning platform Polymer Genome would improve its accuracy. Polymer Genome is a machine learning based data-driven informatics platform for polymer property prediction and Tg is one property needed to design new polymers in silico. They found that training the model with their larger, curated dataset improved the algorithm's Tg, providing valuable improvements to this useful platform.

Read More...

Transfer learning and data augmentation in osteosarcoma cancer detection

Chu et al. | Jun 03, 2023

Transfer learning and data augmentation in osteosarcoma cancer detection
Image credit: Chu and Khan 2023

Osteosarcoma is a type of bone cancer that affects young adults and children. Early diagnosis of osteosarcoma is crucial to successful treatment. The current methods of diagnosis, which include imaging tests and biopsy, are time consuming and prone to human error. Hence, we used deep learning to extract patterns and detect osteosarcoma from histological images. We hypothesized that the combination of two different technologies (transfer learning and data augmentation) would improve the efficacy of osteosarcoma detection in histological images. The dataset used for the study consisted of histological images for osteosarcoma and was quite imbalanced as it contained very few images with tumors. Since transfer learning uses existing knowledge for the purpose of classification and detection, we hypothesized it would be proficient on such an imbalanced dataset. To further improve our learning, we used data augmentation to include variations in the dataset. We further evaluated the efficacy of different convolutional neural network models on this task. We obtained an accuracy of 91.18% using the transfer learning model MobileNetV2 as the base model with various geometric transformations, outperforming the state-of-the-art convolutional neural network based approach.

Read More...

Utilizing meteorological data and machine learning to predict and reduce the spread of California wildfires

Bilwar et al. | Jan 15, 2024

Utilizing meteorological data and machine learning to predict and reduce the spread of California wildfires
Image credit: Pixabay

This study hypothesized that a machine learning model could accurately predict the severity of California wildfires and determine the most influential meteorological factors. It utilized a custom dataset with information from the World Weather Online API and a Kaggle dataset of wildfires in California from 2013-2020. The developed algorithms classified fires into seven categories with promising accuracy (around 55 percent). They found that higher temperatures, lower humidity, lower dew point, higher wind gusts, and higher wind speeds are the most significant contributors to the spread of a wildfire. This tool could vastly improve the efficiency and preparedness of firefighters as they deal with wildfires.

Read More...

Search Articles

Search articles by title, author name, or tags

Clear all filters

Popular Tags

Browse by school level