This study utilizes machine learning models to predict missing and unclear signs from the Indus script, a writing system from an ancient civilization in the Indian subcontinent.
Read More...Statistical models for identifying missing and unclear signs of the Indus script
This study utilizes machine learning models to predict missing and unclear signs from the Indus script, a writing system from an ancient civilization in the Indian subcontinent.
Read More...Modeling Hartree-Fock approximations of the Schrödinger Equation for multielectron atoms from Helium to Xenon using STO-nG basis sets
The energy of an atom is extremely useful in nuclear physics and reaction mechanism pathway determination but is challenging to compute. This work aimed to synthesize regression models for Pople Gaussian expansions of Slater-type Orbitals (STO-nG) atomic energy vs. atomic number scatter plots to allow for easy approximation of atomic energies without using computational chemistry methods. The data indicated that of the regressions, sinusoidal regressions most aptly modeled the scatter plots.
Read More...Discovery of the Heart in Mathematics: Modeling the Chaotic Behaviors of Quantized Periods in the Mandelbrot Set
This study aimed to predict and explain chaotic behavior in the Mandelbrot Set, one of the world’s most popular models of fractals and exhibitors of Chaos Theory. The authors hypothesized that repeatedly iterating the Mandelbrot Set’s characteristic function would give rise to a more intricate layout of the fractal and elliptical models that predict and highlight “hotspots” of chaos through their overlaps. The positive and negative results from this study may provide a new perspective on fractals and their chaotic nature, helping to solve problems involving chaotic phenomena.
Read More...Transfer Learning for Small and Different Datasets: Fine-Tuning A Pre-Trained Model Affects Performance
In this study, the authors seek to improve a machine learning algorithm used for image classification: identifying male and female images. In addition to fine-tuning the classification model, they investigate how accuracy is affected by their changes (an important task when developing and updating algorithms). To determine accuracy, a set of images is used to train the model and then a separate set of images is used for validation. They found that the validation accuracy was close to the training accuracy. This study contributes to the expanding areas of machine learning and its applications to image identification.
Read More...Comparison of three large language models as middle school math tutoring assistants
Middle school math forms the basis for advanced mathematical courses leading up to the university level. Large language models (LLMs) have the potential to power next-generation educational technologies, acting as digital tutors to students. The main objective of this study was to determine whether LLMs like ChatGPT, Bard, and Llama 2 can serve as reliable middle school math tutoring assistants on three tutoring tasks: hint generation, comprehensive solution, and exercise creation.
Read More...An explainable model for content moderation
The authors looked at the ability of machine learning algorithms to interpret language given their increasing use in moderating content on social media. Using an explainable model they were able to achieve 81% accuracy in detecting fake vs. real news based on language of posts alone.
Read More...A land use regression model to predict emissions from oil and gas production using machine learning
Emissions from oil and natural gas (O&G) wells such as nitrogen dioxide (NO2), volatile organic compounds (VOCs), and ozone (O3) can severely impact the health of communities located near wells. In this study, we used O&G activity and wind-carried emissions to quantify the extent to which O&G wells affect the air quality of nearby communities, revealing that NO2, NOx, and NO are correlated to O&G activity. We then developed a novel land use regression (LUR) model using machine learning based on O&G prevalence to predict emissions.
Read More...Identification of a core set of model agnostic mRNA associated with nonalcoholic steatohepatitis (NASH)
In this study, the authors analyze gene expression datasets to determine if there is a core set of genes dysregulated during nonalcoholic steatohepatitis.
Read More...Rhythmic lyrics translation: Customizing a pre-trained language model using stacked fine-tuning
Neural machine translation (NMT) is a software that uses neural network techniques to translate text from one language to another. However, one of the most famous NMT models—Google Translate—failed to give an accurate English translation of a famous Korean nursery rhyme, "Airplane" (비행기). The authors fine-tuned a pre-trained model first with a dataset from the lyrics domain, and then with a smaller dataset containing the rhythmical properties, to teach the model to translate rhythmically accurate lyrics. This stacked fine-tuning method resulted in an NMT model that could maintain the rhythmical characteristics of lyrics during translation while single fine-tuned models failed to do so.
Read More...A novel encoding technique to improve non-weather-based models for solar photovoltaic forecasting
Several studies have applied different machine learning (ML) techniques to the area of forecasting solar photovoltaic power production. Most of these studies use weather data as inputs to predict power production; however, there are numerous practical issues with the procurement of this data. This study proposes models that do not use weather data as inputs, but rather use past power production data as a more practical substitute to weather-based models. Our proposed models demonstrate a better, cheaper, and more reliable alternatives to current weather models.
Read More...Search articles by title, author name, or tags