Sequence accessibility is an important factor affecting gene expression. Sequence accessibility or openness impacts the likelihood that a gene is transcribed and translated into a protein and performs functions and manifests traits. There are many potential factors that affect the accessibility of a gene. In this study, our hypothesis was that the content of nucleotides in a genetic sequence predicts its accessibility. Using a machine learning linear regression model, we studied the relationship between nucleotide content and accessibility.
Although the 5-year survival rate for colorectal cancer is below 10%, it increases to greater than 90% if it is diagnosed early. We hypothesized from our research that analyzing non-synonymous single nucleotide variants (SNVs) in a patient's exome sequence would be an indicator for high genetic risk of developing colorectal cancer.
The sequence of nitrogenous bases that make up the DNA of organisms can contain hidden mathematical sequences. Here the authors used BioPython, a programming tool, to find an organism that displays Gijswijt’s Sequence in its genome. In this manner they found that the common carp best displays Gijswijt’s Sequence in its genome.
A central challenge of cancer therapy is identifying treatments that will effectively target cancer cells while minimizing effects on healthy cells. To identify potential targets for treating a multiple myeloma, a frequently incurable cancer, Kochenderfer and Kochenderfer analyze RNA sequencing data from the Cancer Cell Line Encyclopedia to find genes with high expression in multiple myeloma cells and low expression in normal tissues
In this study, the authors developed a model named DNA Sequence Embedding Network (DNA-SEnet) to classify DNA-asthma associations using their genomic patterns.
Wound-healing involves a sequence of events, such as inflammation, proliferation, and migration of different cell types like fibroblasts. Zinc Finger CCCH-type with G-Patch Domain Containing Protein (ZGPAT), encodes a protein that has its main role as a transcription repressor by binding to a specific DNA sequence. The aim of the study was to find out whether inhibiting ZGPAT will expedite the wound healing process by accelerating cell migration. This treatment strategy can provide a key to the development of wound healing strategies in medicine and cellular biology.
Although the United States maintains millions of square kilometers of nature reserves to protect the biodiversity of the specimens living there, little is known about how confining these species within designated protected lands influences the genetic variation required for a healthy population. In this study, the authors sequenced genetic barcodes of insects from a recently established nature reserve, the Southwestern Riverside County Multi-Species Reserve (SWRCMSR), and a non-protected area, the Mt. San Jacinto College (MSJC) Menifee campus, to compare the genetic variation between the two populations. Their results demonstrated that the midge fly population from the SWRCMSR had fewer unique DNA barcode sequence changes than the MSJC population, indicating that the comparatively younger nature reserve's population had likely not yet established its own unique genetic drift changes.
Advancement in DNA sequencing technology has greatly increased our understanding about the role of bacteria in soil. The authors of this study examine the microbial content of soil samples taken from three locations in southern New Hampshire with varying pH and plant composition.
The application of machine learning techniques has facilitated the automatic annotation of behavior in video sequences, offering a promising approach for ethological studies by reducing the manual effort required for annotating each video frame. Nevertheless, before solely relying on machine-generated annotations, it is essential to evaluate the accuracy of these annotations to ensure their reliability and applicability. While it is conventionally accepted that there cannot be a perfect annotation, the degree of error associated with machine-generated annotations should be commensurate with the error between different human annotators. We hypothesized that machine learning supervised with adequate human annotations would be able to accurately predict body parts from video sequences. Here, we conducted a comparative analysis of the quality of annotations generated by humans and machines for the body parts of sheep during treadmill walking. For human annotation, two annotators manually labeled six body parts of sheep in 300 frames. To generate machine annotations, we employed the state-of-the-art pose-estimating library, DeepLabCut, which was trained using the frames annotated by human annotators. As expected, the human annotations demonstrated high consistency between annotators. Notably, the machine learning algorithm also generated accurate predictions, with errors comparable to those between humans. We also observed that abnormal annotations with a high error could be revised by introducing Kalman Filtering, which interpolates the trajectory of body parts over the time series, enhancing robustness. Our results suggest that conventional transfer learning methods can generate behavior annotations as accurate as those made by humans, presenting great potential for further research.