The rapid growth of high-throughput data, including -omics technologies, gave rise to a significant demand for data science skills and experience with bioinformatics methods of analysis. To help introduce biologists, clinicians and students to cutting edge bioinformatics methods and commonly used data science concepts, our team designed an online bioinformatics training program called OmicsLogic. This online summer program is designed for Data science beginners students interested in data-driven research questions.
Session Topics Overview of commonly used “omics” data NGS, Mass-Spec, phenotypic data (genomics, transcriptomics, metagenomics) Phenotypes: clinical, imaging, metadata (research, clinical, biotech, pharma) The...
Session Topics Availability and variability of data Unprecedented Detail and volume Data heterogeneity, complexity, and noise Need for structure and reproducibility
Session Topics Analysis logic: from raw reads to a table of expression (RNA-seq example) Common sources of unwanted technical variation pre-processing steps, filtering and...
Summary statistics (histogram, boxplot, a scatterplot of 2 samples compared to each other, Excel “summary statistics” operation) Visualization of practice data – compare the...
Session Topics Learn how to make statistical representations of the data and how to address missing or data errors. How do you compare the...
Session Topics Hypothesis testing 101: compare conditions and find the p-value Data-driven discovery: discover groups or conditions Process of inference for a machine versus...
Session Topics Hypothesis testing 101: compare conditions and find the p-value Data-driven discovery: discover groups or conditions Process of inference for a machine versus human
Session Topics Finding patterns in the data and methods of data mining. PCA, k-means, h-clustering (run example on T-Bio and then open the script...
Session Topics Conceptual Introduction: Known sample data is used to train the computer to use these patterns to correlate to unknown data. Binary decision...
Session Topics Technical accuracy (ROC curve) Logical or biological relevance (compare feature selection with PCA by subtype or clinical phenotype) Trained Model validation: Learning...
Session Topics The interaction between artificial intelligence and human Differences between ML and AI In what ways can AI support human research and decision...
This is a private organization. To join you must be a registered site member and request organization membership.