
The transmission dynamics of Tuberculosis (TB) involve complex epidemiological and socio-economical interactions between individuals living in highly distinct regional conditions. The level of exogenous reinfection and first time infection rates within high-incidence settings may influence the impact of control programs on TB prevalence. The impact that effective population size and the distribution of individuals’ residence times in different patches have on TB transmission and control are studied using selected scenarios where risk is defined by the estimated or perceive first time infection and/or exogenous re-infection rates.
Methods
This study aims at enhancing the understanding of TB dynamics, within simplified, two patch, risk-defined environments, in the presence of short term mobility and variations in reinfection and infection rates via a mathematical model. The modeling framework captures the role of individuals’ ‘daily’ dynamics within and between places of residency, work or business via the average proportion of time spent in residence and as visitors to TB-risk environments (patches). As a result, the effective population size of Patch i (home of i-residents) at time t must account for visitors and residents of Patch i, at time t.
Results
The study identifies critical social behaviors mechanisms that can facilitate or eliminate TB infection in vulnerable populations. The results suggest that short-term mobility between heterogeneous patches contributes to significant overall increases in TB prevalence when risk is considered only in terms of direct new infection transmission, compared to the effect of exogenous reinfection. Although, the role of exogenous reinfection increases the risk that come from large movement of individuals, due to catastrophes or conflict, to TB-free areas.
Conclusions
The study highlights that allowing infected individuals to move from high to low TB prevalence areas (for example via the sharing of treatment and isolation facilities) may lead to a reduction in the total TB prevalence in the overall population. The higher the population size heterogeneity between distinct risk patches, the larger the benefit (low overall prevalence) under the same “traveling” patterns. Policies need to account for population specific factors (such as risks that are inherent with high levels of migration, local and regional mobility patterns, and first time infection rates) in order to be long lasting, effective and results in low number of drug resistant cases.

The maintenance of chromosomal integrity is an essential task of every living organism and cellular repair mechanisms exist to guard against insults to DNA. Given the importance of this process, it is expected that DNA repair proteins would be evolutionarily conserved, exhibiting very minimal sequence change over time. However, BRCA1, an essential gene involved in DNA repair, has been reported to be evolving rapidly despite the fact that many protein-altering mutations within this gene convey a significantly elevated risk for breast and ovarian cancers.
Results
To obtain a deeper understanding of the evolutionary trajectory of BRCA1, we analyzed complete BRCA1 gene sequences from 23 primate species. We show that specific amino acid sites have experienced repeated selection for amino acid replacement over primate evolution. This selection has been focused specifically on humans and our closest living relatives, chimpanzees (Pan troglodytes) and bonobos (Pan paniscus). After examining BRCA1 polymorphisms in 7 bonobo, 44 chimpanzee, and 44 rhesus macaque (Macaca mulatta) individuals, we find considerable variation within each of these species and evidence for recent selection in chimpanzee populations. Finally, we also sequenced and analyzed BRCA2 from 24 primate species and find that this gene has also evolved under positive selection.
Conclusions
While mutations leading to truncated forms of BRCA1 are clearly linked to cancer phenotypes in humans, there is also an underlying selective pressure in favor of amino acid-altering substitutions in this gene. A hypothesis where viruses are the drivers of this natural selection is discussed.

Learning Sparse Representations for Fruit-Fly Gene Expression Pattern Image Annotation and Retrieval
Fruit fly embryogenesis is one of the best understood animal development systems, and the spatiotemporal gene expression dynamics in this process are captured by digital images. Analysis of these high-throughput images will provide novel insights into the functions, interactions, and networks of animal genes governing development. To facilitate comparative analysis, web-based interfaces have been developed to conduct image retrieval based on body part keywords and images. Currently, the keyword annotation of spatiotemporal gene expression patterns is conducted manually. However, this manual practice does not scale with the continuously expanding collection of images. In addition, existing image retrieval systems based on the expression patterns may be made more accurate using keywords.
Results
In this article, we adapt advanced data mining and computer vision techniques to address the key challenges in annotating and retrieving fruit fly gene expression pattern images. To boost the performance of image annotation and retrieval, we propose representations integrating spatial information and sparse features, overcoming the limitations of prior schemes.
Conclusions
We perform systematic experimental studies to evaluate the proposed schemes in comparison with current methods. Experimental results indicate that the integration of spatial information and sparse features lead to consistent performance improvement in image annotation, while for the task of retrieval, sparse features alone yields better results.

Drosophila melanogaster has been established as a model organism for investigating the developmental gene interactions. The spatio-temporal gene expression patterns of Drosophila melanogaster can be visualized by in situ hybridization and documented as digital images. Automated and efficient tools for analyzing these expression images will provide biological insights into the gene functions, interactions, and networks. To facilitate pattern recognition and comparison, many web-based resources have been created to conduct comparative analysis based on the body part keywords and the associated images. With the fast accumulation of images from high-throughput techniques, manual inspection of images will impose a serious impediment on the pace of biological discovery. It is thus imperative to design an automated system for efficient image annotation and comparison.
Results
We present a computational framework to perform anatomical keywords annotation for Drosophila gene expression images. The spatial sparse coding approach is used to represent local patches of images in comparison with the well-known bag-of-words (BoW) method. Three pooling functions including max pooling, average pooling and Sqrt (square root of mean squared statistics) pooling are employed to transform the sparse codes to image features. Based on the constructed features, we develop both an image-level scheme and a group-level scheme to tackle the key challenges in annotating Drosophila gene expression pattern images automatically. To deal with the imbalanced data distribution inherent in image annotation tasks, the undersampling method is applied together with majority vote. Results on Drosophila embryonic expression pattern images verify the efficacy of our approach.
Conclusion
In our experiment, the three pooling functions perform comparably well in feature dimension reduction. The undersampling with majority vote is shown to be effective in tackling the problem of imbalanced data. Moreover, combining sparse coding and image-level scheme leads to consistent performance improvement in keywords annotation.
In completing this thesis project, I attempted to hypothesize the trigger in my own personal diagnosis of type 1 diabetes through literature research as well as further research on viruses and their contribution to autoimmune disorders. I had previously hypothesized that, based on my own family life, type 1 diabetes could possibly be a non-heritable disease despite its consistent inheritance pattern discovered by researchers; however, the research presented in this thesis project rejects this idea and supports the theory that I may have been previously susceptible to this disorder and would have developed type 1 diabetes naturally. There were multiple viruses discovered during the literature research conducted that could possibly have been triggers in the acceleration of my disease. The major link between enteroviruses and autoimmune disorders was discovered, as well as influenza A and SARS-COV-2 and this is explained further in this project.
Differences between basic and applied research were explored through a wet-lab case study. Vaccinia virus (VACV) infections are a prime model of the competition between a virus and its host. VACV contains a gene that is highly evasive of the host immune system, gene E3L. The protein encoded by E3L is E3, which contains two highly conserved regions, a C-terminus, and a N-terminus. While the C-terminus is well-understood, the mechanism by which the N-terminus grants IFN resistance was previously unknown. This project demonstrated that the N-terminus prevents the initiation of programmed necrosis through host-encoded cellular proteins RIP3 and DAI. These findings provide insight into the function of the N-terminus of E3, as well as the unique functions of induced programmed necrosis.
This project was an example of “basic” research. However, it highlights the interconnectivity of basic and applied research and the danger in isolating both projects and perspectives. It points to the difficult decisions that must be made in science, and the need for a better research classification system that considers what makes science “good” outside of antiquated social class ideologies that have shaped science since ancient Greece. While there are no easy answers to determine what makes research “good,” thinking critically about the types of research projects that will be pursued, and the effects that research has on both science and society, will raise awareness, initiate new conversations, and encourage more dialogue about science in the 21st century.