Matching Items (36)

Description
Background
Multicellular organisms consist of cells of many different types that are established during development. Each type of cell is characterized by the unique combination of expressed gene products as a result of spatiotemporal gene regulation. Currently, a fundamental challenge in regulatory biology is to elucidate the gene expression controls that generate the complex body plans during development. Recent advances in high-throughput biotechnologies have generated spatiotemporal expression patterns for thousands of genes in the model organism fruit fly Drosophila melanogaster. Existing qualitative methods enhanced by a quantitative analysis based on computational tools we present in this paper would provide promising ways for addressing key scientific questions.
Results
We develop a set of computational methods and open source tools for identifying co-expressed embryonic domains and the associated genes simultaneously. To map the expression patterns of many genes into the same coordinate space and account for the embryonic shape variations, we develop a mesh generation method to deform a meshed generic ellipse to each individual embryo. We then develop a co-clustering formulation to cluster the genes and the mesh elements, thereby identifying co-expressed embryonic domains and the associated genes simultaneously. Experimental results indicate that the gene and mesh co-clusters can be correlated to key developmental events during the stages of embryogenesis we study. The open source software tool has been made available at http://compbio.cs.odu.edu/fly/.
Conclusions
Our mesh generation and machine learning methods and tools improve upon the flexibility, ease-of-use and accuracy of existing methods.
Multicellular organisms consist of cells of many different types that are established during development. Each type of cell is characterized by the unique combination of expressed gene products as a result of spatiotemporal gene regulation. Currently, a fundamental challenge in regulatory biology is to elucidate the gene expression controls that generate the complex body plans during development. Recent advances in high-throughput biotechnologies have generated spatiotemporal expression patterns for thousands of genes in the model organism fruit fly Drosophila melanogaster. Existing qualitative methods enhanced by a quantitative analysis based on computational tools we present in this paper would provide promising ways for addressing key scientific questions.
Results
We develop a set of computational methods and open source tools for identifying co-expressed embryonic domains and the associated genes simultaneously. To map the expression patterns of many genes into the same coordinate space and account for the embryonic shape variations, we develop a mesh generation method to deform a meshed generic ellipse to each individual embryo. We then develop a co-clustering formulation to cluster the genes and the mesh elements, thereby identifying co-expressed embryonic domains and the associated genes simultaneously. Experimental results indicate that the gene and mesh co-clusters can be correlated to key developmental events during the stages of embryogenesis we study. The open source software tool has been made available at http://compbio.cs.odu.edu/fly/.
Conclusions
Our mesh generation and machine learning methods and tools improve upon the flexibility, ease-of-use and accuracy of existing methods.
ContributorsZhang, Wenlu (Author) / Feng, Daming (Author) / Li, Rongjian (Author) / Chernikov, Andrey (Author) / Chrisochoides, Nikos (Author) / Osgood, Christopher (Author) / Konikoff, Charlotte (Author) / Newfeld, Stuart (Author) / Kumar, Sudhir (Author) / Ji, Shuiwang (Author) / Biodesign Institute (Contributor) / Center for Evolution and Medicine (Contributor) / College of Liberal Arts and Sciences (Contributor) / School of Life Sciences (Contributor)
Created2013-12-28

Learning Sparse Representations for Fruit-Fly Gene Expression Pattern Image Annotation and Retrieval
Description
Background
Fruit fly embryogenesis is one of the best understood animal development systems, and the spatiotemporal gene expression dynamics in this process are captured by digital images. Analysis of these high-throughput images will provide novel insights into the functions, interactions, and networks of animal genes governing development. To facilitate comparative analysis, web-based interfaces have been developed to conduct image retrieval based on body part keywords and images. Currently, the keyword annotation of spatiotemporal gene expression patterns is conducted manually. However, this manual practice does not scale with the continuously expanding collection of images. In addition, existing image retrieval systems based on the expression patterns may be made more accurate using keywords.
Results
In this article, we adapt advanced data mining and computer vision techniques to address the key challenges in annotating and retrieving fruit fly gene expression pattern images. To boost the performance of image annotation and retrieval, we propose representations integrating spatial information and sparse features, overcoming the limitations of prior schemes.
Conclusions
We perform systematic experimental studies to evaluate the proposed schemes in comparison with current methods. Experimental results indicate that the integration of spatial information and sparse features lead to consistent performance improvement in image annotation, while for the task of retrieval, sparse features alone yields better results.
Fruit fly embryogenesis is one of the best understood animal development systems, and the spatiotemporal gene expression dynamics in this process are captured by digital images. Analysis of these high-throughput images will provide novel insights into the functions, interactions, and networks of animal genes governing development. To facilitate comparative analysis, web-based interfaces have been developed to conduct image retrieval based on body part keywords and images. Currently, the keyword annotation of spatiotemporal gene expression patterns is conducted manually. However, this manual practice does not scale with the continuously expanding collection of images. In addition, existing image retrieval systems based on the expression patterns may be made more accurate using keywords.
Results
In this article, we adapt advanced data mining and computer vision techniques to address the key challenges in annotating and retrieving fruit fly gene expression pattern images. To boost the performance of image annotation and retrieval, we propose representations integrating spatial information and sparse features, overcoming the limitations of prior schemes.
Conclusions
We perform systematic experimental studies to evaluate the proposed schemes in comparison with current methods. Experimental results indicate that the integration of spatial information and sparse features lead to consistent performance improvement in image annotation, while for the task of retrieval, sparse features alone yields better results.
ContributorsYuan, Lei (Author) / Woodard, Alexander (Author) / Ji, Shuiwang (Author) / Jiang, Yuan (Author) / Zhou, Zhi-Hua (Author) / Kumar, Sudhir (Author) / Ye, Jieping (Author) / Biodesign Institute (Contributor) / Center for Evolution and Medicine (Contributor) / Ira A. Fulton School of Engineering (Contributor) / College of Liberal Arts and Sciences (Contributor) / School of Life Sciences (Contributor)
Created2012-05-23

Description
Background
Drosophila melanogaster has been established as a model organism for investigating the developmental gene interactions. The spatio-temporal gene expression patterns of Drosophila melanogaster can be visualized by in situ hybridization and documented as digital images. Automated and efficient tools for analyzing these expression images will provide biological insights into the gene functions, interactions, and networks. To facilitate pattern recognition and comparison, many web-based resources have been created to conduct comparative analysis based on the body part keywords and the associated images. With the fast accumulation of images from high-throughput techniques, manual inspection of images will impose a serious impediment on the pace of biological discovery. It is thus imperative to design an automated system for efficient image annotation and comparison.
Results
We present a computational framework to perform anatomical keywords annotation for Drosophila gene expression images. The spatial sparse coding approach is used to represent local patches of images in comparison with the well-known bag-of-words (BoW) method. Three pooling functions including max pooling, average pooling and Sqrt (square root of mean squared statistics) pooling are employed to transform the sparse codes to image features. Based on the constructed features, we develop both an image-level scheme and a group-level scheme to tackle the key challenges in annotating Drosophila gene expression pattern images automatically. To deal with the imbalanced data distribution inherent in image annotation tasks, the undersampling method is applied together with majority vote. Results on Drosophila embryonic expression pattern images verify the efficacy of our approach.
Conclusion
In our experiment, the three pooling functions perform comparably well in feature dimension reduction. The undersampling with majority vote is shown to be effective in tackling the problem of imbalanced data. Moreover, combining sparse coding and image-level scheme leads to consistent performance improvement in keywords annotation.
Drosophila melanogaster has been established as a model organism for investigating the developmental gene interactions. The spatio-temporal gene expression patterns of Drosophila melanogaster can be visualized by in situ hybridization and documented as digital images. Automated and efficient tools for analyzing these expression images will provide biological insights into the gene functions, interactions, and networks. To facilitate pattern recognition and comparison, many web-based resources have been created to conduct comparative analysis based on the body part keywords and the associated images. With the fast accumulation of images from high-throughput techniques, manual inspection of images will impose a serious impediment on the pace of biological discovery. It is thus imperative to design an automated system for efficient image annotation and comparison.
Results
We present a computational framework to perform anatomical keywords annotation for Drosophila gene expression images. The spatial sparse coding approach is used to represent local patches of images in comparison with the well-known bag-of-words (BoW) method. Three pooling functions including max pooling, average pooling and Sqrt (square root of mean squared statistics) pooling are employed to transform the sparse codes to image features. Based on the constructed features, we develop both an image-level scheme and a group-level scheme to tackle the key challenges in annotating Drosophila gene expression pattern images automatically. To deal with the imbalanced data distribution inherent in image annotation tasks, the undersampling method is applied together with majority vote. Results on Drosophila embryonic expression pattern images verify the efficacy of our approach.
Conclusion
In our experiment, the three pooling functions perform comparably well in feature dimension reduction. The undersampling with majority vote is shown to be effective in tackling the problem of imbalanced data. Moreover, combining sparse coding and image-level scheme leads to consistent performance improvement in keywords annotation.
ContributorsSun, Qian (Author) / Muckatira, Sherin (Author) / Yuan, Lei (Author) / Ji, Shuiwang (Author) / Newfeld, Stuart (Author) / Kumar, Sudhir (Author) / Ye, Jieping (Author) / Biodesign Institute (Contributor) / Center for Evolution and Medicine (Contributor) / College of Liberal Arts and Sciences (Contributor) / School of Life Sciences (Contributor) / Ira A. Fulton School of Engineering (Contributor)
Created2013-12-03

Description
Diacylglycerol kinase catalyses the ATP-dependent conversion of diacylglycerol to phosphatidic acid in the plasma membrane of Escherichia coli. The small size of this integral membrane trimer, which has 121 residues per subunit, means that available protein must be used economically to craft three catalytic and substrate-binding sites centred about the membrane/cytosol interface. How nature has accomplished this extraordinary feat is revealed here in a crystal structure of the kinase captured as a ternary complex with bound lipid substrate and an ATP analogue. Residues, identified as essential for activity by mutagenesis, decorate the active site and are rationalized by the ternary structure. The γ-phosphate of the ATP analogue is positioned for direct transfer to the primary hydroxyl of the lipid whose acyl chain is in the membrane. A catalytic mechanism for this unique enzyme is proposed. The active site architecture shows clear evidence of having arisen by convergent evolution.
ContributorsLi, Dianfan (Author) / Stansfeld, Phillip J. (Author) / Sansom, Mark S. P. (Author) / Keogh, Aaron (Author) / Vogeley, Lutz (Author) / Howe, Nicole (Author) / Lyons, Joseph A. (Author) / Aragao, David (Author) / Fromme, Petra (Author) / Fromme, Raimund (Author) / Basu, Shibom (Author) / Grotjohann, Ingo (Author) / Kupitz, Christopher (Author) / Rendek, Kimberley (Author) / Weierstall, Uwe (Author) / Zatsepin, Nadia (Author) / Cherezov, Vadim (Author) / Liu, Wei (Author) / Bandaru, Sateesh (Author) / English, Niall J. (Author) / Gati, Cornelius (Author) / Barty, Anton (Author) / Yefanov, Oleksandr (Author) / Chapman, Henry N. (Author) / Diederichs, Kay (Author) / Messerschmidt, Marc (Author) / Boutet, Sebastien (Author) / Williams, Garth J. (Author) / Seibert, M. Marvin (Author) / Caffrey, Martin (Author) / College of Liberal Arts and Sciences (Contributor) / School of Molecular Sciences (Contributor) / Biodesign Institute (Contributor) / Applied Structural Discovery (Contributor) / Department of Physics (Contributor)
Created2015-12-17

Description
Phytochromes are a family of photoreceptors that control light responses of plants, fungi and bacteria. A sequence of structural changes, which is not yet fully understood, leads to activation of an output domain. Time-resolved serial femtosecond crystallography (SFX) can potentially shine light on these conformational changes. Here we report the room temperature crystal structure of the chromophore-binding domains of the Deinococcus radiodurans phytochrome at 2.1 Å resolution. The structure was obtained by serial femtosecond X-ray crystallography from microcrystals at an X-ray free electron laser. We find overall good agreement compared to a crystal structure at 1.35 Å resolution derived from conventional crystallography at cryogenic temperatures, which we also report here. The thioether linkage between chromophore and protein is subject to positional ambiguity at the synchrotron, but is fully resolved with SFX. The study paves the way for time-resolved structural investigations of the phytochrome photocycle with time-resolved SFX.
ContributorsEdlund, Petra (Author) / Takala, Heikki (Author) / Claesson, Elin (Author) / Henry, Leocadie (Author) / Dods, Robert (Author) / Lehtivuori, Heli (Author) / Panman, Matthijs (Author) / Pande, Kanupriya (Author) / White, Thomas (Author) / Nakane, Takanori (Author) / Berntsson, Oskar (Author) / Gustavsson, Emil (Author) / Bath, Petra (Author) / Modi, Vaibhav (Author) / Roy Chowdhury, Shatabdi (Author) / Zook, James (Author) / Berntsen, Peter (Author) / Pandey, Suraj (Author) / Poudyal, Ishwor (Author) / Tenboer, Jason (Author) / Kupitz, Christopher (Author) / Barty, Anton (Author) / Fromme, Petra (Author) / Koralek, Jake D. (Author) / Tanaka, Tomoyuki (Author) / Spence, John (Author) / Liang, Mengning (Author) / Hunter, Mark S. (Author) / Boutet, Sebastien (Author) / Nango, Eriko (Author) / Moffat, Keith (Author) / Groenhof, Gerrit (Author) / Ihalainen, Janne (Author) / Stojkovic, Emina A. (Author) / Schmidt, Marius (Author) / Westenhoff, Sebastian (Author) / College of Liberal Arts and Sciences (Contributor) / School of Molecular Sciences (Contributor) / Biodesign Institute (Contributor) / Applied Structural Discovery (Contributor) / Department of Physics (Contributor)
Created2016-10-19

Description
Antibodies are essential for structural determinations and functional studies of membrane proteins, but antibody generation is limited by the availability of properly-folded and purified antigen. We describe the first application of genetic immunization to a structurally diverse set of membrane proteins to show that immunization of mice with DNA alone produced antibodies against 71% (n = 17) of the bacterial and viral targets. Antibody production correlated with prior reports of target immunogenicity in host organisms, underscoring the efficiency of this DNA-gold micronanoplex approach. To generate each antigen for antibody characterization, we also developed a simple in vitro membrane protein expression and capture method. Antibody specificity was demonstrated upon identifying, for the first time, membrane-directed heterologous expression of the native sequences of the FopA and FTT1525 virulence determinants from the select agent Francisella tularensis SCHU S4. These approaches will accelerate future structural and functional investigations of therapeutically-relevant membrane proteins.
ContributorsHansen, Debra (Author) / Robida, Mark (Author) / Craciunescu, Felicia (Author) / Loskutov, Andrey (Author) / Dorner, Katerina (Author) / Rodenberry, John-Charles (Author) / Wang, Xiao (Author) / Olson, Tien (Author) / Patel, Hetal (Author) / Fromme, Petra (Author) / Sykes, Kathryn (Author) / Biodesign Institute (Contributor) / Innovations in Medicine (Contributor) / Applied Structural Discovery (Contributor) / College of Liberal Arts and Sciences (Contributor) / School of Molecular Sciences (Contributor)
Created2016-02-24

Description
Serial femtosecond crystallography (SFX) using X-ray free-electron laser sources is an emerging method with considerable potential for time-resolved pump-probe experiments. Here we present a lipidic cubic phase SFX structure of the light-driven proton pump bacteriorhodopsin (bR) to 2.3 Å resolution and a method to investigate protein dynamics with modest sample requirement. Time-resolved SFX (TR-SFX) with a pump-probe delay of 1 ms yields difference Fourier maps compatible with the dark to M state transition of bR. Importantly, the method is very sample efficient and reduces sample consumption to about 1 mg per collected time point. Accumulation of M intermediate within the crystal lattice is confirmed by time-resolved visible absorption spectroscopy. This study provides an important step towards characterizing the complete photocycle dynamics of retinal proteins and demonstrates the feasibility of a sample efficient viscous medium jet for TR-SFX.
ContributorsNogly, Przemyslaw (Author) / Panneels, Valerie (Author) / Nelson, Garrett (Author) / Gati, Cornelius (Author) / Kimura, Tetsunari (Author) / Milne, Christopher (Author) / Milathianaki, Despina (Author) / Kubo, Minoru (Author) / Wu, Wenting (Author) / Conrad, Chelsie (Author) / Coe, Jesse (Author) / Bean, Richard (Author) / Zhao, Yun (Author) / Bath, Petra (Author) / Dods, Robert (Author) / Harimoorthy, Rajiv (Author) / Beyerlein, Kenneth R. (Author) / Rheinberger, Jan (Author) / James, Daniel (Author) / Deponte, Daniel (Author) / Li, Chufeng (Author) / Sala, Leonardo (Author) / Williams, Garth J. (Author) / Hunter, Mark S. (Author) / Koglin, Jason E. (Author) / Berntsen, Peter (Author) / Nango, Eriko (Author) / Iwata, So (Author) / Chapman, Henry N. (Author) / Fromme, Petra (Author) / Frank, Matthias (Author) / Abela, Rafael (Author) / Boutet, Sebastien (Author) / Barty, Anton (Author) / White, Thomas A. (Author) / Weierstall, Uwe (Author) / Spence, John (Author) / Neutze, Richard (Author) / Schertler, Gebhard (Author) / Standfuss, Jorg (Author) / College of Liberal Arts and Sciences (Contributor) / Department of Physics (Contributor) / Department of Chemistry and Biochemistry (Contributor) / Biodesign Institute (Contributor) / Applied Structural Discovery (Contributor) / School of Molecular Sciences (Contributor)
Created2016-08-22

Description
The entire history of HIV-1 is hidden in its ten thousand bases, where information regarding its evolutionary traversal through the human population can only be unlocked with fine-scale sequence analysis. Measurable footprints of mutation and recombination have imparted upon us a wealth of knowledge, from multiple chimpanzee-to-human transmissions to patterns of neutralizing antibody and drug resistance. Extracting maximum understanding from such diverse data can only be accomplished by analyzing the viral population from many angles. This body of work explores two primary aspects of HIV sequence evolution, point mutation and recombination, through cross-sectional (inter-individual) and longitudinal (intra-individual) investigations, respectively. Cross-sectional Analysis: The role of Haiti in the subtype B pandemic has been hotly debated for years; while there have been many studies, up to this point, no one has incorporated the well-known mechanism of retroviral recombination into their biological model. Prior to the use of recombination detection, multiple analyses produced trees where subtype B appears to have first entered Haiti, followed by a jump into the rest of the world. The results presented here contest the Haiti-first theory of the pandemic and instead suggest simultaneous entries of subtype B into Haiti and the rest of the world. Longitudinal Analysis: Potential N-linked glycosylation sites (PNGS) are the most evolutionarily dynamic component of one of the most evolutionarily dynamic proteins known to date. While the number of mutations associated with the increase or decrease of PNGS frequency over time is high, there are a set of relatively stable sites that persist within and between longitudinally sampled individuals. Here, I identify the most conserved stable PNGSs and suggest their potential roles in host-virus interplay. In addition, I have identified, for the first time, what may be a gp-120-based environmental preference for N-linked glycosylation sites.
ContributorsHepp, Crystal Marie, 1981- (Author) / Rosenberg, Michael S. (Thesis advisor) / Hedrick, Philip (Committee member) / Escalante, Ananias (Committee member) / Kumar, Sudhir (Committee member) / Arizona State University (Publisher)
Created2013

Description
We propose a novel solution to prevent cancer by developing a prophylactic cancer. Several sources of antigens for cancer vaccines have been published. Among these, antigens that contain a frame-shift (FS) peptide or viral peptide are quite attractive for a variety of reasons. FS sequences, from either mistake in RNA processing or in genomic DNA, may lead to generation of neo-peptides that are foreign to the immune system. Viral peptides presumably would originate from exogenous but integrated viral nucleic acid sequences. Both are non-self, therefore lessen concerns about development of autoimmunity. I have developed a bioinformatical approach to identify these aberrant transcripts in the cancer transcriptome. Their suitability for use in a vaccine is evaluated by establishing their frequencies and predicting possible epitopes along with their population coverage according to the prevalence of major histocompatibility complex (MHC) types. Viral transcripts and transcripts with FS mutations from gene fusion, insertion/deletion at coding microsatellite DNA, and alternative splicing were identified in NCBI Expressed Sequence Tag (EST) database. 48 FS chimeric transcripts were validated in 50 breast cell lines and 68 primary breast tumor samples with their frequencies from 4% to 98% by RT-PCR and sequencing confirmation. These 48 FS peptides, if translated and presented, could be used to protect more than 90% of the population in Northern America based on the prediction of epitopes derived from them. Furthermore, we synthesized 150 peptides that correspond to FS and viral peptides that we predicted would exist in tumor patients and we tested over 200 different cancer patient sera. We found a number of serological reactive peptide sequences in cancer patients that had little to no reactivity in healthy controls; strong support for the strength of our bioinformatic approach. This study describes a process used to identify aberrant transcripts that lead to a new source of antigens that can be tested and used in a prophylactic cancer vaccine. The vast amount of transcriptome data of various cancers from the Cancer Genome Atlas (TCGA) project will enhance our ability to further select better cancer antigen candidates.
ContributorsLee, HoJoon (Author) / Johnston, Stephen A. (Thesis advisor) / Kumar, Sudhir (Committee member) / Miller, Laurence (Committee member) / Stafford, Phillip (Committee member) / Sykes, Kathryn (Committee member) / Arizona State University (Publisher)
Created2012

Description
Studies of ancient pathogens are moving beyond simple confirmatory analysis of diseased bone; bioarchaeologists and ancient geneticists are posing nuanced questions and utilizing novel methods capable of confronting the debates surrounding pathogen origins and evolution, and the relationships between humans and disease in the past. This dissertation examines two ancient human diseases through molecular and bioarchaeological lines of evidence, relying on techniques in paleogenetics and phylogenetics to detect, isolate, sequence and analyze ancient and modern pathogen DNA within an evolutionary framework. Specifically this research addresses outstanding issues regarding a) the evolution, origin and phylogenetic placement of the pathogen causing skeletal tuberculosis in New World prior to European contact, and b) the phylogeny and origins of the parasite causing the human leishmaniasis disease complex. An additional chapter presents a review of the major technological and theoretical advances in ancient pathogen genomics to frame the contributions of this work within a rapidly developing field. This overview emphasizes that understanding the evolution of human disease is critical to contextualizing relationships between humans and pathogens, and the epidemiological shifts observed both in the past and in the present era of (re)emerging infectious diseases. These questions continue to be at the forefront of not only pathogen research, but also
bioarchaeological and paleopathological scholarship.
bioarchaeological and paleopathological scholarship.
ContributorsHarkins, Kelly M (Author) / Buikstra, Jane E. (Thesis advisor) / Stone, Anne C (Thesis advisor) / Knudson, Kelly (Committee member) / Kumar, Sudhir (Committee member) / Krause, Johannes (Committee member) / Arizona State University (Publisher)
Created2014