Institutional Scholarship

Limitations of Genomic Analysis on Novel Species

Show simple item record

dc.contributor.advisor Mathieson, Sara
dc.contributor.author Contreras-Orendain, Luis
dc.date.accessioned 2021-07-12T11:57:40Z
dc.date.available 2021-07-12T11:57:40Z
dc.date.issued 2021
dc.identifier.uri http://hdl.handle.net/10066/23531
dc.description.abstract For widely studied species such as humans, fruit flies, and mice, there are many sequenced genomes, but for novel species, only afew or a singular processed genome is available. Being able to study novel species is important to understand their environmental impact, in the case of invasive species, or their genetic relation to other species but they pose the greatest difficulty to study. Genetic sequences are created using assemblers and assembling the genome for a diploid species is a computationally complex task which is why diploid assemblers create a phased collapsed genome that contains similar genetic information. Applications of Pairwise Sequential Markovian Coalescent (PSMC) modeling for population size inference and Phylogenetic tree generation for building a species family tree become more difficult with novel species and it is not clear how to proceed. Other tools exist, such as Read Mapping and NCBI's Blast, that provide the initial steps to the first two tools mentioned but are not well integrated with them. As a proof of concept, we applied Read Mapping with PSMC analysis and Blast with Phylogenetic tree construction on the novel species, the Spotted Lanternfly, to investigate the feasibility of these tools on a phased genome. At least for this genome, our analysis shows there to be significant limitations due to computational run time and with the processed output of our pipeline. More work is needed to better integrate various tools to analyze novel species.
dc.description.sponsorship Haverford College. Department of Computer Science
dc.language.iso eng
dc.rights.uri http://creativecommons.org/licenses/by-nc/4.0/
dc.title Limitations of Genomic Analysis on Novel Species
dc.type Thesis
dc.rights.access Open Access


Files in this item

This item appears in the following Collection(s)

Show simple item record

http://creativecommons.org/licenses/by-nc/4.0/ Except where otherwise noted, this item's license is described as http://creativecommons.org/licenses/by-nc/4.0/

Search


Browse

My Account