Limitations of Genomic Analysis on Novel Species

dc.contributor.advisorMathieson, Sara
dc.contributor.authorContreras-Orendain, Luis
dc.date.accessioned2021-07-12T11:57:40Z
dc.date.available2021-07-12T11:57:40Z
dc.date.issued2021
dc.description.abstractFor widely studied species such as humans, fruit flies, and mice, there are many sequenced genomes, but for novel species, only afew or a singular processed genome is available. Being able to study novel species is important to understand their environmental impact, in the case of invasive species, or their genetic relation to other species but they pose the greatest difficulty to study. Genetic sequences are created using assemblers and assembling the genome for a diploid species is a computationally complex task which is why diploid assemblers create a phased collapsed genome that contains similar genetic information. Applications of Pairwise Sequential Markovian Coalescent (PSMC) modeling for population size inference and Phylogenetic tree generation for building a species family tree become more difficult with novel species and it is not clear how to proceed. Other tools exist, such as Read Mapping and NCBI's Blast, that provide the initial steps to the first two tools mentioned but are not well integrated with them. As a proof of concept, we applied Read Mapping with PSMC analysis and Blast with Phylogenetic tree construction on the novel species, the Spotted Lanternfly, to investigate the feasibility of these tools on a phased genome. At least for this genome, our analysis shows there to be significant limitations due to computational run time and with the processed output of our pipeline. More work is needed to better integrate various tools to analyze novel species.
dc.description.sponsorshipHaverford College. Department of Computer Science
dc.identifier.urihttp://hdl.handle.net/10066/23531
dc.language.isoeng
dc.rights.accessOpen Access
dc.rights.urihttp://creativecommons.org/licenses/by-nc/4.0/
dc.titleLimitations of Genomic Analysis on Novel Species
dc.typeThesis
Files
Original bundle
Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
2021ContrerasOrendainL.pdf
Size:
1.02 MB
Format:
Adobe Portable Document Format
Description:
Thesis
Loading...
Thumbnail Image
Name:
2021ContrerasOrendainL_release.pdf
Size:
183.1 KB
Format:
Adobe Portable Document Format
Description:
** Archive Staff Only **
Collections