seandavi(s12)
seandavi(s12)
About
Posts
Talks
Publications
Projects
Contact
Recent & Upcoming Talks
2021
Orchestra: A cloud platform for hosting hands-on computational workshop environments
Orchestra is a cloud platform for hosting hands-on computational workshop environments. In this talk, I review the detailed use case of Bioconductor Workshops and then proceed to a shallow dive into Kubernetes infrastructure that powers Orchestra.
Dec 2, 2021 8:00 AM — 8:30 AM
Online
Slides
Bioinformatics, HPC and AI
These are just introductor slides for a panel discussion at the Supercomputer21 conference.
Nov 17, 2021 2:30 PM — 4:00 PM
St. Louis, Missouri, USA and Remote
Slides
GenomicSuperSignature: Interpretation of RNA-Seq Experiments through Robust, Efficient Comparison to Public Databases
Millions of transcriptomic profiles have been deposited in public archives, yet remain underused for the interpretation of new experiments. Existing methods for leveraging these public resources have focused on the reanalysis of existing data or analysis of new datasets independently. We present a novel approach to interpreting new transcriptomic datasets by near-instantaneous comparison to public archives without high-performance computing requirements. All necessary data and functions to apply our approach to existing or new data are included in our software available as part of the Bioconductor project.
Nov 14, 2021 4:30 PM — 6:00 PM
Supercomputer Conference
Code
Slides
Bioconductor: Increasing the Value of Public Data with Software and Data Engineering
In this talk, I provide a high-level overview of the Bioconductor and then give some examples of tooling that connects Bioconductor to …
Oct 26, 2021 12:00 PM
Aurora, CO, USA
Sean Davis
Slides
Some quick thoughts on training, education, workforce development, and community
Oct 11, 2021 3:00 PM — 3:30 PM
University of Colorado Anschutz Medical Campus
Slides
2020
Big Data Approaches
Introduction What Big Data problem are we trying to solve? Class 1: Need only part of data, so extract from the whole and then operate on extracted data; eg., extracting a few dozen genes of interest, finding all intervals that overlap with a chromosomal region, Class 2: Need to operate on all the data, but can operate independently on parts; eg., computing over 1Mb bins on the genome, modeling over independent studies Class 3: Need to operate on all the data, but need to work on all at once (may be for performance reasons, also); eg.
Jun 4, 2020 12:00 AM
Code
2019
Howard County Community College STEM Career Panel
This is a career panel for students interested in STEM fields.
Oct 8, 2019 4:30 PM — 6:00 PM
Duncan Hall, Room 100
Bioconductor: Tools for interpreting high-throughput biological data
In this talk, I give a high-level overview of the Bioconductor project.
Aug 12, 2019 12:00 AM
Bethesda, MD, USA
Sean Davis
Slides
ATAC-Seq workshop
This workshop introduces ATAC-Seq, quality control approaches, isolating nucleosome compartments, and profile plots and heatmaps.
Jul 8, 2019 12:00 AM
Cold Spring Harbor Laboratory, Cold Spring Harbor, NY
Sean Davis
,
Thomas Carroll
Code
Statistical Methods in Functional Genomics
In a series of talks and exercises, I cover introduction to R, Bioconductor, genomic ranges, container classes, annotation of genes and …
Jul 1, 2019 12:00 AM
Cold Spring Harbor Laboratory, Cold Spring Harbor, NY
Sean Davis
Code
Slides
Public Data Resources and Bioconductor
The goal of this workshop is to introduce Bioconductor packages for finding, accessing, and using large-scale public data resources …
Jun 26, 2019 12:00 AM
Rockefeller University, New York, NY
Code
Slides
Lightweight data engineering, tools, and software to facilitate data reuse and data science
Lightweight tools, software, and publication processes that tie together data resources, analysis tools, documentation can powerful …
May 14, 2019 12:00 AM
Carnegie Mellon University, Pittsburgh, PA
Slides
2018
Practical Data Science and Informatics Training
This talk compares and contrasts four formats for data science and informatics education. The discussion will highlight some approaches …
Dec 3, 2018 12:00 AM
Wake Forest School of Medicine, Winston-Salem, NC
Slides
Bioconductor: software for interpreting high-throughput biological data
This talk presents a very quick overview of the Bioconductor project, focusing on its values of reproducibility, reuse, and openness.
Dec 3, 2018 12:00 AM
Wake Forest School of Medicine, Winston-Salem, NC
Slides
The cancer data ecosystem: data and cloud resources for cancer genomic data science
In this talk, I motivate the need for cloud-based cancer data resourdces. I provide an overview of the NCI Genomic Data Commons and how …
Oct 16, 2018 12:00 AM
Carnegie Mellon University, Pittsburgh, PA
Slides
Cloud Computing Approaches to Genomic Data Science
Jul 31, 2018 12:00 AM
Vancouver Convention Center, Vancouver, BC Canada
Slides
Accessing and Using Public Data Resources with Bioconductor
Workshop Description The goal of this workshop is to introduce Bioconductor packages for finding, accessing, and using large-scale public data resources including the Gene Expression Omnibus GEO, Sequence Read Archive SRA, the Genomic Data Commons GDC, and Bioconductor-hosted curated data resources for metagenomics, pharmacogenomics PharmacoDB, and The Cancer Genome Atlas. Instructor names and contact information Levi Waldron (City University of New York, New York, NY, USA) Benjamin Haibe-Kains (Princess Margaret Cancer Center, Toronto, Canada) Sean Davis (Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA) Pre-requisites Basic knowledge of R syntax Familiarity with the ExpressionSet and SummarizedExperiment classes Basic familiarity with ‘omics technologies such as microarray and NGS sequencing
Jul 28, 2018 12:00 AM
Victoria College, University of Toronto, Tononto, ON Canada
Code
Slides
The NCI SevenBridges Cancer Genomics Cloud Resource
I demonstrate the use of the
SevenBridges Cancer Genomics Cloud Resource
to run RNA-seq and DNAse-seq workflows on cloud resources.
Jul 7, 2018 12:00 AM
Cold Spring Harbor Lab, Laurel Hollow, NY
Slides
Bioconductor: software for reproducible genomic data science
Jul 2, 2018 12:00 AM
Cold Spring Harbor Lab, Laurel Hollow, NY
Slides
Tutorial: Using the NCI SevenBridges Cancer Genomics Cloud Resources
In this guided workshop, we will use the
SevenBridges Cancer Genomics Cloud Resource
to run simple RNA-seq workflow on cloud resources.
Jun 18, 2018 12:00 AM
Purdue University, West Lafayette, IN
Slides
Brief Introduction to Machine Learning in Biomedical Informatics
Jun 18, 2018 12:00 AM
Purdue University, West Lafayette, IN
Code
Cancer Bioinformatics Primer -- Papillary Renal Cell Carcinoma Hackathon
More than 150 researchers, engineers and data enthusiasts gathered at Salesforce in San Francisco for an intense weekend of exploration …
May 19, 2018 12:00 AM
Salesforce, San Francisco, CA
Slides
Bioconductor: A Hub in the Cancer Data Ecosystem
Apr 26, 2018 12:00 AM
University of Alabama School of Medicine, Birmingham, AB
The GenomicDataCommons Package: The bridge between the NCI Genomic Data Commons and Bioconductor
The National Cancer Institute has established the Genomic Data Commons (GDC) to provide access to large-scale, publicly available …
Apr 25, 2018 12:00 AM
University of Alabama School of Medicine, Birmingham, AB
Bioconductor: A Potential Hub in the Cancer Biomarker Data Ecosystem
Feb 8, 2018 12:00 AM
NCI Campus, Shady Grove, MD
Code
Slides
A quick tour of the cancer genomics landscape (in 45 minutes)
Jan 18, 2018 12:00 AM
Wake Forest Comprehensive Cancer Center, Winston-Salem, NC
2017
Bioconductor: orchestrating high-throughput biological data analysis
Progress in biotechnology is continually leading to new types of data, resulting in data sets that are rapidly increasing in volume, …
Dec 4, 2017 12:00 AM
Shady Grove, MD, USA
Slides
Thoughts on an Agricultural Data Ecosystem
The USDA and the Agricultural Research Service (ARS) hosted a workshop to develop their ten year plan for genomics and phenotype. These …
Nov 15, 2017 12:00 AM
Beltsville, MD, USA
Slides
Data Visualization Per Karl Broman and Rafa Irizzary
This is a very basic introduction to good vs bad graphics, built around data and material from Karl Broman and Rafa Irizarry. Concepts …
Sep 12, 2017 12:00 PM
National Institutes of Health, Bethesda, MD
Code
Slides
Machine Learning in Biomedicine
I present a high-level introduction to machine learning in Biomedicine with a few examples from the literature. The talk is accompanied …
Jul 9, 2017 12:00 AM
Cold Spring Harbor Laborarory, Cold Spring Harbor, NY
Code
Slides
Cite
×