FlyBase (flybase.org) is an essential online database for researchers using Drosophila melanogaster as a model organism, facilitating access to a diverse array of information that includes genetic, molecular, genomic and reagent resources. Here, we describe the introduction of several new features at FlyBase, including Pathway Reports, paralog information, disease models based on orthology, customizable tables within reports and overview displays ('ribbons') of expression and disease data. We also describe a variety of recent important updates, including incorporation of a developmental proteome, upgrades to the GAL4 search tab, additional Experimental Tool Reports, migration to JBrowse for genome browsing and improvements to batch queries/downloads and the Fast-Track Your Paper tool.
Precise genome editing is a valuable tool to study gene function in model organisms. Prime editing, a precise editing system developed in mammalian cells, does not require double-strand breaks or donor DNA and has low off-target effects. Here, we applied prime editing for the model organism and developed conditions for optimal editing. By expressing prime editing components in cultured cells or somatic cells of transgenic flies, we precisely introduce premature stop codons in three classical visible marker genes, , , and Furthermore, by restricting editing to germ cells, we demonstrate efficient germ-line transmission of a precise edit in to 36% of progeny. Our results suggest that prime editing is a useful system in to study gene function, such as engineering precise point mutations, deletions, or epitope tags.
The FlyRNAi database at the Drosophila RNAi Screening Center and Transgenic RNAi Project (DRSC/TRiP) provides a suite of online resources that facilitate functional genomics studies with a special emphasis on Drosophila melanogaster. Currently, the database provides: gene-centric resources that facilitate ortholog mapping and mining of information about orthologs in common genetic model species; reagent-centric resources that help researchers identify RNAi and CRISPR sgRNA reagents or designs; and data-centric resources that facilitate visualization and mining of transcriptomics data, protein modification data, protein interactions, and more. Here, we discuss updated and new features that help biological and biomedical researchers efficiently identify, visualize, analyze, and integrate information and data for Drosophila and other species. Together, these resources facilitate multiple steps in functional genomics workflows, from building gene and reagent lists to management, analysis, and integration of data.
The accumulation of biological and biomedical literature outpaces the ability of most researchers and clinicians to stay abreast of their own immediate fields, let alone a broader range of topics. Although available search tools support identification of relevant literature, finding relevant and key publications is not always straightforward. For example, important publications might be missed in searches with an official gene name due to gene synonyms. Moreover, ambiguity of gene names can result in retrieval of a large number of irrelevant publications. To address these issues and help researchers and physicians quickly identify relevant publications, we developed BioLitMine, an advanced literature mining tool that takes advantage of the medical subject heading (MeSH) index and gene-to-publication annotations already available for PubMed literature. Using BioLitMine, a user can identify what MeSH terms are represented in the set of publications associated with a given gene of the interest, or start with a term and identify relevant publications. Users can also use the tool to find co-cited genes and a build a literature co-citation network. In addition, BioLitMine can help users build a gene list relevant to a MeSH term, such as a list of genes relevant to "stem cells" or "breast neoplasms." Users can also start with a gene or pathway of interest and identify authors associated with that gene or pathway, a feature that makes it easier to identify experts who might serve as collaborators or reviewers. Altogether, BioLitMine extends the value of PubMed-indexed literature and its existing expert curation by providing a robust and gene-centric approach to retrieval of relevant information.
Aging is characterized by extensive metabolic reprogramming. To identify metabolic pathways associated with aging, we analyzed age-dependent changes in the metabolomes of long-lived . Among the metabolites that changed, levels of tyrosine were increased with age in long-lived flies. We demonstrate that the levels of enzymes in the tyrosine degradation pathway increase with age in wild-type flies. Whole-body and neuronal-specific downregulation of enzymes in the tyrosine degradation pathway significantly extends lifespan, causes alterations of metabolites associated with increased lifespan, and upregulates the levels of tyrosine-derived neuromediators. Moreover, feeding wild-type flies with tyrosine increased their lifespan. Mechanistically, we show that suppression of ETC complex I drives the upregulation of enzymes in the tyrosine degradation pathway, an effect that can be rescued by tigecycline, an FDA-approved drug that specifically suppresses mitochondrial translation. In addition, tyrosine supplementation partially rescued lifespan of flies with ETC complex I suppression. Altogether, our study highlights the tyrosine degradation pathway as a regulator of longevity.
The tumor suppressor p53 regulates multiple metabolic pathways at the cellular level. However, its role in the context of a whole animal response to metabolic stress is poorly understood. Using Drosophila, we show that AMP-activated protein kinase (AMPK)-dependent Dmp53 activation is critical for sensing nutrient stress, maintaining metabolic homeostasis, and extending organismal survival. Under both nutrient deprivation and high-sugar diet, Dmp53 activation in the fat body represses expression of the Drosophila Leptin analog, Unpaired-2 (Upd2), which remotely controls Dilp2 secretion in insulin-producing cells. In starved Dmp53-depleted animals, elevated Upd2 expression in adipose cells and activation of Upd2 receptor Domeless in the brain result in sustained Dilp2 circulating levels and impaired autophagy induction at a systemic level, thereby reducing nutrient stress survival. These findings demonstrate an essential role for the AMPK-Dmp53 axis in nutrient stress responses and expand the concept that adipose tissue acts as a sensing organ that orchestrates systemic adaptation to nutrient status.
PDGF/VEGF ligands regulate a plethora of biological processes in multicellular organisms via autocrine, paracrine, and endocrine mechanisms. We investigated organ-specific metabolic roles of PDGF/VEGF-like factors (Pvfs). We combine genetic approaches and single-nuclei sequencing to demonstrate that muscle-derived Pvf1 signals to the hepatocyte-like cells/oenocytes to suppress lipid synthesis by activating the Pi3K/Akt1/TOR signaling cascade in the oenocytes. Functionally, this signaling axis regulates expansion of adipose tissue lipid stores in newly eclosed flies. Flies emerge after pupation with limited adipose tissue lipid stores and lipid level is progressively accumulated via lipid synthesis. We find that adult muscle-specific expression of increases rapidly during this stage and that muscle-to-oenocyte Pvf1 signaling inhibits expansion of adipose tissue lipid stores as the process reaches completion. Our findings provide the first evidence in a metazoan of a PDGF/VEGF ligand acting as a myokine that regulates systemic lipid homeostasis by activating TOR in hepatocyte-like cells.
Precise and efficient genome modifications provide powerful tools for biological studies. Previous CRISPR gene knockout methods in cell lines have relied on frameshifts caused by stochastic insertion/deletion in all alleles. However, this method is inefficient for genes with high copy number due to polyploidy or gene amplification because frameshifts in all alleles can be difficult to generate and detect. Here we describe a homology-directed insertion method to knockout genes in the polyploid Drosophila S2R+ cell line. This protocol allows generation of homozygous mutant cell lines using an insertion cassette which autocatalytically generates insertion mutations in all alleles. Knockout cells generated using this method can be directly identified by PCR without a need for DNA sequencing. This protocol takes 2-3 months and can be applied to other polyploid cell lines or high-copy-number genes.
The frizzled (fz) and dishevelled (dsh) genes are highly conserved members of both the planar cell polarity (PCP) pathway and the Wnt signaling pathway. Given these dual functions, several studies have examined whether Wnt ligands provide a tissue-scale orientation cue for PCP establishment during development, and these studies have reached differing conclusions. Here, we re-examine this issue in the Drosophila melanogaster wing and notum using split-Gal4 co-expression analysis, multiplex somatic CRISPR, and double RNAi experiments. Pairwise loss-of-function experiments targeting wg together with other Wnt genes, via somatic CRISPR or RNAi, do not produce PCP defects in the wing or notum. In addition, somatic CRISPR against evi (aka wntless), which is required for the secretion of Wnt ligands, did not produce detectable PCP phenotypes. Altogether, our results do not support the hypothesis that Wnt ligands contribute to PCP signaling in the Drosophila wing or notum.
The Drosophila lymph gland, the larval hematopoietic organ comprised of prohemocytes and mature hemocytes, has been a valuable model for understanding mechanisms underlying hematopoiesis and immunity. Three types of mature hemocytes have been characterized in the lymph gland: plasmatocytes, lamellocytes, and crystal cells, which are analogous to vertebrate myeloid cells, yet molecular underpinnings of the lymph gland hemocytes have been less investigated. Here, we use single-cell RNA sequencing to comprehensively analyze heterogeneity of developing hemocytes in the lymph gland, and discover previously undescribed hemocyte types including adipohemocytes, stem-like prohemocytes, and intermediate prohemocytes. Additionally, we identify the developmental trajectory of hemocytes during normal development as well as the emergence of the lamellocyte lineage following active cellular immunity caused by wasp infestation. Finally, we establish similarities and differences between embryonically derived- and larval lymph gland hemocytes. Altogether, our study provides detailed insights into the hemocyte development and cellular immune responses at single-cell resolution.
Peptide therapeutics, unlike small molecule drugs, display crucial advantages of target-specificity and the ability to block large interacting interfaces such as those of transcription factors. The transcription co-factor of the Hippo pathway, YAP/Yki, has been implicated in many cancers, and is dependent on its interaction with the DNA-binding TEAD/Sd proteins via a large Ω-loop. In addition, the mammalian Vestigial Like (VGLL) protein, specifically its TONDU domain, competitively inhibits YAP-TEAD interaction, resulting in arrest of tumor growth. Here, we show that either overexpression of the TONDU peptide or its oral uptake leads to suppression of Yorkie (Yki)-driven intestinal stem cell (ISC) tumors in the adult midgut. In addition, comparative proteomic analyses of peptide-treated and untreated tumors, together with ChIP analysis, reveal that integrin pathway members are part of the Yki-oncogenic network. Collectively, our findings establish as a reliable platform to screen for cancer oral therapeutic peptides and reveal a tumor suppressive role for integrins in Yki-driven tumors.
blood cells, called hemocytes, are classified into plasmatocytes, crystal cells, and lamellocytes based on the expression of a few marker genes and cell morphologies, which are inadequate to classify the complete hemocyte repertoire. Here, we used single-cell RNA sequencing (scRNA-seq) to map hemocytes across different inflammatory conditions in larvae. We resolved plasmatocytes into different states based on the expression of genes involved in cell cycle, antimicrobial response, and metabolism together with the identification of intermediate states. Further, we discovered rare subsets within crystal cells and lamellocytes that express fibroblast growth factor (FGF) ligand and receptor , respectively. We demonstrate that these FGF components are required for mediating effective immune responses against parasitoid wasp eggs, highlighting a novel role for FGF signaling in inter-hemocyte crosstalk. Our scRNA-seq analysis reveals the diversity of hemocytes and provides a rich resource of gene expression profiles for a systems-level understanding of their functions.
Cystic fibrosis (CF) is a recessive disease caused by mutations in the () gene. The most common symptoms include progressive lung disease and chronic digestive conditions. CF is the first human genetic disease to benefit from having five different species of animal models. Despite the phenotypic differences among the animal models and human CF, these models have provided invaluable insight into understanding disease mechanisms at the organ-system level. Here, we identify a member of the ABCC4 family, CG5789, that has the structural and functional properties expected for encoding the equivalent of human CFTR, and thus refer to it as (). We show that knockdown of in the adult intestine disrupts osmotic homeostasis and displays CF-like phenotypes that lead to intestinal stem cell hyperplasia. We also show that expression of wild-type human , but not mutant variants of CFTR that prevent plasma membrane expression, rescues the mutant phenotypes of Furthermore, we performed RNA sequencing (RNA-Seq)-based transcriptomic analysis using fly intestine and identified a mucin gene, , which is required for proper intestinal barrier protection. Altogether, our findings suggest that can be a powerful model organism for studying CF pathophysiology.
Manganese is considered essential for animal growth. Manganese ions serve as cofactors to three mitochondrial enzymes: superoxide dismutase (Sod2), arginase and glutamine synthase, and to glycosyltransferases residing in the Golgi. In Drosophila melanogaster, manganese has also been implicated in the formation of ceramide phosphoethanolamine, the insect's sphingomyelin analogue, a structural component of cellular membranes. Manganese overload leads to neurodegeneration and toxicity in both humans and Drosophila. Here, we report specific absorption and accumulation of manganese during the first week of adulthood in flies, which correlates with an increase in Sod2 activity during the same period. To test the requirement of dietary manganese for this accumulation, we generated a Drosophila model of manganese deficiency. Due to the lack of manganese-specific chelators, we used chemically defined media to grow the flies and deplete them of the metal. Dietary manganese depletion reduced Sod2 activity. We then examined gene and protein expression changes in the intestines of manganese depleted flies. We found adaptive responses to the presumed loss of known manganese-dependent enzymatic activities: less glutamine synthase activity (amination of glutamate to glutamine) was compensated by 50% reduction in glutaminase (deamination of glutamine to glutamate); less glycosyltransferase activity, predicted to reduce protein glycosylation, was compensated by 30% reduction in lysosomal mannosidases (protein deglycosylating enzymes); less ceramide phosphoethanolamine synthase activity was compensated by 30% reduction in the Drosophila sphingomyeline phospodiesterase, which could catabolize ceramide phosphoethanolamine in flies. Reduced Sod2 activity, predicted to cause superoxide-dependent iron-sulphur cluster damage, resulted in cellular iron misregulation.
Studies of the adult midgut have led to many insights in our understanding of cell-type diversity, stem cell regeneration, tissue homeostasis, and cell fate decision. Advances in single-cell RNA sequencing provide opportunities to identify new cell types and molecular features. We used single-cell RNA sequencing to characterize the transcriptome of midgut epithelial cells and identified 22 distinct clusters representing intestinal stem cells, enteroblasts, enteroendocrine cells (EEs), and enterocytes. This unbiased approach recovered most of the known intestinal stem cells/enteroblast and EE markers, highlighting the high quality of the dataset, and led to insights on intestinal stem cell biology, cell type-specific organelle features, the roles of new transcription factors in progenitors and regional variation along the gut, 5 additional EE gut hormones, EE hormonal expression diversity, and paracrine function of EEs. To facilitate mining of this rich dataset, we provide a web-based resource for visualization of gene expression in single cells. Altogether, our study provides a comprehensive resource for addressing functions of genes in the midgut epithelium.
Metabolites are increasingly appreciated for their roles as signaling molecules. To dissect the roles of metabolites, it is essential to understand their signaling pathways and their enzymatic regulations. From an RNA interference (RNAi) screen for regulators of intestinal stem cell (ISC) activity in the midgut, we identified () as a top candidate gene required for ISC proliferation. We demonstrate that Ras/MAPK and Protein Kinase A (PKA) signaling act downstream of AdoR and that Ras/MAPK mediates the major effect of AdoR on ISC proliferation. Extracellular adenosine, the ligand for AdoR, is a small metabolite that can be released by various cell types and degraded in the extracellular space by secreted adenosine deaminase. Interestingly, down-regulation of () from enterocytes is necessary for extracellular adenosine to activate AdoR and induce ISC overproliferation. As expression and its enzymatic activity decrease following tissue damage, our study provides important insights into how the enzymatic regulation of extracellular adenosine levels under tissue-damage conditions facilitates ISC proliferation.
The Transgenic RNAi Project (TRiP), a functional genomics platform at Harvard Medical School, was initiated in 2008 to generate and distribute a genome-scale collection of RNAi fly stocks. To date, the TRiP has generated >15,000 RNAi fly stocks. As this covers most genes, we have largely transitioned to development of new resources based on CRISPR technology. Here, we present an update on our libraries of publicly available RNAi and CRISPR fly stocks, and focus on the TRiP-CRISPR overexpression (TRiP-OE) and TRiP-CRISPR knockout (TRiP-KO) collections. TRiP-OE stocks express sgRNAs targeting upstream of a gene transcription start site. Gene activation is triggered by co-expression of catalytically dead Cas9 (dCas9) fused to an activator domain, either VP64-p65-Rta (VPR) or Synergistic Activation Mediator (SAM). TRiP-KO stocks express one or two sgRNAs targeting the coding sequence of a gene or genes. Cutting is triggered by co-expression of Cas9, allowing for generation of indels in both germline and somatic tissue. To date, we have generated more than 5,000 CRISPR-OE or -KO stocks for the community. These resources provide versatile, transformative tools for gene activation, gene repression, and genome engineering.
CRISPR-Cas9 is a powerful genome editing technology in which a short guide RNA (sgRNA) confers target site specificity to achieve Cas9-mediated genome editing. Numerous sgRNA design tools have been developed based on reference genomes for humans and model organisms. However, existing resources are not optimal as genetic mutations or single nucleotide polymorphisms (SNPs) within the targeting region affect the efficiency of CRISPR-based approaches by interfering with guide-target complementarity. To facilitate identification of sgRNAs (1) in non-reference genomes, (2) across varying genetic backgrounds, or (3) for specific targeting of SNP-containing alleles, for example, disease relevant mutations, we developed a web tool, SNP-CRISPR (https://www.flyrnai.org/tools/snp_crispr/). SNP-CRISPR can be used to design sgRNAs based on public variant data sets or user-identified variants. In addition, the tool computes efficiency and specificity scores for sgRNA designs targeting both the variant and the reference. Moreover, SNP-CRISPR provides the option to upload multiple SNPs and target single or multiple nearby base changes simultaneously with a single sgRNA design. Given these capabilities, SNP-CRISPR has a wide range of potential research applications in model systems and potential applications for design of sgRNAs for disease-associated mutant correction.
Targeted genomic knock-ins are a valuable tool to probe gene function. However, knock-in methods involving homology-directed repair (HDR) can be laborious. Here, we adapt the mammalian CRISPaint [clustered regularly interspaced short palindromic repeat (CRISPR)-assisted insertion tagging] homology-independent knock-in method for , which uses CRISPR/Cas9 and nonhomologous end joining to insert "universal" donor plasmids into the genome. Using this method in cultured S2R+ cells, we efficiently tagged four endogenous proteins with the bright fluorescent protein mNeonGreen, thereby demonstrating that an existing collection of CRISPaint universal donor plasmids is compatible with insect cells. In addition, we inserted the transgenesis marker into seven genes in the fly germ line, producing heritable loss-of-function alleles that were isolated by simple fluorescence screening. Unlike in cultured cells, insertions/deletions always occurred at the genomic insertion site, which prevents predictably matching the insert coding frame to the target gene. Despite this effect, we were able to isolate insertions in four genes that serve as expression reporters. Therefore, homology-independent insertion in is a fast and simple alternative to HDR that will enable researchers to dissect gene function.
The Alliance of Genome Resources (Alliance) is a consortium of the major model organism databases and the Gene Ontology that is guided by the vision of facilitating exploration of related genes in human and well-studied model organisms by providing a highly integrated and comprehensive platform that enables researchers to leverage the extensive body of genetic and genomic studies in these organisms. Initiated in 2016, the Alliance is building a central portal (www.alliancegenome.org) for access to data for the primary model organisms along with gene ontology data and human data. All data types represented in the Alliance portal (e.g. genomic data and phenotype descriptions) have common data models and workflows for curation. All data are open and freely available via a variety of mechanisms. Long-term plans for the Alliance project include a focus on coverage of additional model organisms including those without dedicated curation communities, and the inclusion of new data types with a particular focus on providing data and tools for the non-model-organism researcher that support enhanced discovery about human health and disease. Here we review current progress and present immediate plans for this new bioinformatics resource.