Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Mogno I, Kwasnieski JC, Cohen BA. Massively parallel synthetic promoter assays reveal the in vivo effects of binding site variants. Genome Res 2013;23:1908-15. [PMID: 23921661 PMCID: PMC3814890 DOI: 10.1101/gr.157891.113] [Citation(s) in RCA: 76] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

For:	Mogno I, Kwasnieski JC, Cohen BA. Massively parallel synthetic promoter assays reveal the in vivo effects of binding site variants. Genome Res 2013;23:1908-15. [PMID: 23921661 PMCID: PMC3814890 DOI: 10.1101/gr.157891.113] [Citation(s) in RCA: 76] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Number

Cited by Other Article(s)

Andreani V, South EJ, Dunlop MJ. Generating information-dense promoter sequences with optimal string packing. PLoS Comput Biol 2024;20:e1012276. [PMID: 39047028 PMCID: PMC11268586 DOI: 10.1371/journal.pcbi.1012276] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2024] [Accepted: 06/25/2024] [Indexed: 07/27/2024] Open

Abstract

Dense arrangements of binding sites within nucleotide sequences can collectively influence downstream transcription rates or initiate biomolecular interactions. For example, natural promoter regions can harbor many overlapping transcription factor binding sites that influence the rate of transcription initiation. Despite the prevalence of overlapping binding sites in nature, rapid design of nucleotide sequences with many overlapping sites remains a challenge. Here, we show that this is an NP-hard problem, coined here as the nucleotide String Packing Problem (SPP). We then introduce a computational technique that efficiently assembles sets of DNA-protein binding sites into dense, contiguous stretches of double-stranded DNA. For the efficient design of nucleotide sequences spanning hundreds of base pairs, we reduce the SPP to an Orienteering Problem with integer distances, and then leverage modern integer linear programming solvers. Our method optimally packs sets of 20-100 binding sites into dense nucleotide arrays of 50-300 base pairs in 0.05-10 seconds. Unlike approximation algorithms or meta-heuristics, our approach finds provably optimal solutions. We demonstrate how our method can generate large sets of diverse sequences suitable for library generation, where the frequency of binding site usage across the returned sequences can be controlled by modulating the objective function. As an example, we then show how adding additional constraints, like the inclusion of sequence elements with fixed positions, allows for the design of bacterial promoters. The nucleotide string packing approach we present can accelerate the design of sequences with complex DNA-protein interactions. When used in combination with synthesis and high-throughput screening, this design strategy could help interrogate how complex binding site arrangements impact either gene expression or biomolecular mechanisms in varied cellular contexts.

Collapse

Posfai A, Zhou J, McCandlish DM, Kinney JB. Gauge fixing for sequence-function relationships. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.12.593772. [PMID: 38798671 PMCID: PMC11118547 DOI: 10.1101/2024.05.12.593772] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]

He J, Huo X, Pei G, Jia Z, Yan Y, Yu J, Qu H, Xie Y, Yuan J, Zheng Y, Hu Y, Shi M, You K, Li T, Ma T, Zhang MQ, Ding S, Li P, Li Y. Dual-role transcription factors stabilize intermediate expression levels. Cell 2024;187:2746-2766.e25. [PMID: 38631355 DOI: 10.1016/j.cell.2024.03.023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Revised: 12/08/2023] [Accepted: 03/18/2024] [Indexed: 04/19/2024]

Affiliation(s)

Jinnan He The IDG/McGovern Institute for Brain Research, MOE Key Laboratory of Bioinformatics, State Key Lab of Molecular Oncology, Center for Synthetic and Systems Biology, Tsinghua University, Beijing 100084, China; School of Pharmaceutical Sciences, Tsinghua University, Beijing 100084, China
Xiangru Huo The IDG/McGovern Institute for Brain Research, MOE Key Laboratory of Bioinformatics, State Key Lab of Molecular Oncology, Center for Synthetic and Systems Biology, Tsinghua University, Beijing 100084, China; School of Pharmaceutical Sciences, Tsinghua University, Beijing 100084, China
Gaofeng Pei State Key Laboratory of Membrane Biology, Frontier Research Center for Biological Structure, School of Life Sciences, Tsinghua University, Beijing 100084, China; Tsinghua University-Peking University Joint Center for Life Sciences, Beijing 100084, China
Zeran Jia The IDG/McGovern Institute for Brain Research, MOE Key Laboratory of Bioinformatics, State Key Lab of Molecular Oncology, Center for Synthetic and Systems Biology, Tsinghua University, Beijing 100084, China; School of Pharmaceutical Sciences, Tsinghua University, Beijing 100084, China
Yiming Yan The IDG/McGovern Institute for Brain Research, MOE Key Laboratory of Bioinformatics, State Key Lab of Molecular Oncology, Center for Synthetic and Systems Biology, Tsinghua University, Beijing 100084, China; School of Pharmaceutical Sciences, Tsinghua University, Beijing 100084, China
Jiawei Yu The IDG/McGovern Institute for Brain Research, MOE Key Laboratory of Bioinformatics, State Key Lab of Molecular Oncology, Center for Synthetic and Systems Biology, Tsinghua University, Beijing 100084, China; School of Pharmaceutical Sciences, Tsinghua University, Beijing 100084, China
Haozhi Qu The IDG/McGovern Institute for Brain Research, MOE Key Laboratory of Bioinformatics, State Key Lab of Molecular Oncology, Center for Synthetic and Systems Biology, Tsinghua University, Beijing 100084, China; School of Pharmaceutical Sciences, Tsinghua University, Beijing 100084, China
Yunxin Xie The IDG/McGovern Institute for Brain Research, MOE Key Laboratory of Bioinformatics, State Key Lab of Molecular Oncology, Center for Synthetic and Systems Biology, Tsinghua University, Beijing 100084, China; School of Pharmaceutical Sciences, Tsinghua University, Beijing 100084, China
Junsong Yuan The IDG/McGovern Institute for Brain Research, MOE Key Laboratory of Bioinformatics, State Key Lab of Molecular Oncology, Center for Synthetic and Systems Biology, Tsinghua University, Beijing 100084, China; School of Pharmaceutical Sciences, Tsinghua University, Beijing 100084, China
Yuan Zheng The IDG/McGovern Institute for Brain Research, MOE Key Laboratory of Bioinformatics, State Key Lab of Molecular Oncology, Center for Synthetic and Systems Biology, Tsinghua University, Beijing 100084, China; School of Pharmaceutical Sciences, Tsinghua University, Beijing 100084, China
Yanyan Hu School of Pharmaceutical Sciences, Tsinghua University, Beijing 100084, China; Tsinghua University-Peking University Joint Center for Life Sciences, Beijing 100084, China
Minglei Shi Bioinformatics Division, National Research Center for Information Science and Technology, School of Medicine, Tsinghua University, Beijing 100084, China
Kaiqiang You Department of Biomedical Informatics, School of Basic Medical Sciences, Peking University Health Science Center, Beijing 100191, China
Tingting Li Department of Biomedical Informatics, School of Basic Medical Sciences, Peking University Health Science Center, Beijing 100191, China
Tianhua Ma School of Pharmaceutical Sciences, Tsinghua University, Beijing 100084, China; Tsinghua University-Peking University Joint Center for Life Sciences, Beijing 100084, China
Michael Q Zhang Bioinformatics Division, National Research Center for Information Science and Technology, School of Medicine, Tsinghua University, Beijing 100084, China; Department of Biological Sciences, Center for Systems Biology, The University of Texas, Dallas, TX 75080-3021, USA
Sheng Ding School of Pharmaceutical Sciences, Tsinghua University, Beijing 100084, China; Tsinghua University-Peking University Joint Center for Life Sciences, Beijing 100084, China
Pilong Li State Key Laboratory of Membrane Biology, Frontier Research Center for Biological Structure, School of Life Sciences, Tsinghua University, Beijing 100084, China; Tsinghua University-Peking University Joint Center for Life Sciences, Beijing 100084, China.
Yinqing Li The IDG/McGovern Institute for Brain Research, MOE Key Laboratory of Bioinformatics, State Key Lab of Molecular Oncology, Center for Synthetic and Systems Biology, Tsinghua University, Beijing 100084, China; School of Pharmaceutical Sciences, Tsinghua University, Beijing 100084, China.

Collapse

Liu J, Ashuach T, Inoue F, Ahituv N, Yosef N, Kreimer A. Optimizing sequence design strategies for perturbation MPRAs: a computational evaluation framework. Nucleic Acids Res 2024;52:1613-1627. [PMID: 38296821 PMCID: PMC10939410 DOI: 10.1093/nar/gkae012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Revised: 12/26/2023] [Accepted: 01/12/2024] [Indexed: 02/02/2024] Open

Kwak IY, Kim BC, Lee J, Kang T, Garry DJ, Zhang J, Gong W. Proformer: a hybrid macaron transformer model predicts expression values from promoter sequences. BMC Bioinformatics 2024;25:81. [PMID: 38378442 PMCID: PMC10877777 DOI: 10.1186/s12859-024-05645-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Accepted: 01/08/2024] [Indexed: 02/22/2024] Open

Andreani V, South EJ, Dunlop MJ. Generating information-dense promoter sequences with optimal string packing. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.11.01.565124. [PMID: 37961203 PMCID: PMC10635063 DOI: 10.1101/2023.11.01.565124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]

Abstract

Dense arrangements of binding sites within nucleotide sequences can collectively influence downstream transcription rates or initiate biomolecular interactions. For example, natural promoter regions can harbor many overlapping transcription factor binding sites that influence the rate of transcription initiation. Despite the prevalence of overlapping binding sites in nature, rapid design of nucleotide sequences with many overlapping sites remains a challenge. Here, we show that this is an NP-hard problem, coined here as the nucleotide String Packing Problem (SPP). We then introduce a computational technique that efficiently assembles sets of DNA-protein binding sites into dense, contiguous stretches of double-stranded DNA. For the efficient design of nucleotide sequences spanning hundreds of base pairs, we reduce the SPP to an Orienteering Problem with integer distances, and then leverage modern integer linear programming solvers. Our method optimally packs libraries of 20-100 binding sites into dense nucleotide arrays of 50-300 base pairs in 0.05-10 seconds. Unlike approximation algorithms or meta-heuristics, our approach finds provably optimal solutions. We demonstrate how our method can generate large sets of diverse sequences suitable for library generation, where the frequency of binding site usage across the returned sequences can be controlled by modulating the objective function. As an example, we then show how adding additional constraints, like the inclusion of sequence elements with fixed positions, allows for the design of bacterial promoters. The nucleotide string packing approach we present can accelerate the design of sequences with complex DNA-protein interactions. When used in combination with synthesis and high-throughput screening, this design strategy could help interrogate how complex binding site arrangements impact either gene expression or biomolecular mechanisms in varied cellular contexts.

Author Summary

The way protein binding sites are arranged on DNA can control the regulation and transcription of downstream genes. Areas with a high concentration of binding sites can enable complex interplay between transcription factors, a feature that is exploited by natural promoters. However, designing synthetic promoters that contain dense arrangements of binding sites is a challenge. The task involves overlapping many binding sites, each typically about 10 nucleotides long, within a constrained sequence area, which becomes increasingly difficult as sequence length decreases, and binding site variety increases. We introduce an approach to design nucleotide sequences with optimally packed protein binding sites, which we call the nucleotide String Packing Problem (SPP). We show that the SPP can be solved efficiently using integer linear programming to identify the densest arrangements of binding sites for a specified sequence length. We show how adding additional constraints, like the inclusion of sequence elements with fixed positions, allows for the design of bacterial promoters. The presented approach enables the rapid design and study of nucleotide sequences with complex, dense binding site architectures.

Collapse

Loell KJ, Friedman RZ, Myers CA, Corbo JC, Cohen BA, White MA. Transcription factor interactions explain the context-dependent activity of CRX binding sites. PLoS Comput Biol 2024;20:e1011802. [PMID: 38227575 PMCID: PMC10817189 DOI: 10.1371/journal.pcbi.1011802] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Revised: 01/26/2024] [Accepted: 01/06/2024] [Indexed: 01/18/2024] Open

Kleinschmidt H, Xu C, Bai L. Using Synthetic DNA Libraries to Investigate Chromatin and Gene Regulation. Chromosoma 2023;132:167-189. [PMID: 37184694 PMCID: PMC10542970 DOI: 10.1007/s00412-023-00796-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2023] [Revised: 04/25/2023] [Accepted: 04/26/2023] [Indexed: 05/16/2023]

Tack DS, Tonner PD, Pressman A, Olson ND, Levy SF, Romantseva EF, Alperovich N, Vasilyeva O, Ross D. Precision engineering of biological function with large-scale measurements and machine learning. PLoS One 2023;18:e0283548. [PMID: 36989327 PMCID: PMC10057847 DOI: 10.1371/journal.pone.0283548] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Accepted: 03/11/2023] [Indexed: 03/30/2023] Open

Cooper YA, Guo Q, Geschwind DH. Multiplexed functional genomic assays to decipher the noncoding genome. Hum Mol Genet 2022;31:R84-R96. [PMID: 36057282 PMCID: PMC9585676 DOI: 10.1093/hmg/ddac194] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2022] [Revised: 08/08/2022] [Accepted: 08/09/2022] [Indexed: 11/14/2022] Open

Shahein A, López-Malo M, Istomin I, Olson EJ, Cheng S, Maerkl SJ. Systematic analysis of low-affinity transcription factor binding site clusters in vitro and in vivo establishes their functional relevance. Nat Commun 2022;13:5273. [PMID: 36071116 PMCID: PMC9452512 DOI: 10.1038/s41467-022-32971-0] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Accepted: 08/25/2022] [Indexed: 11/10/2022] Open

Perkins ML, Gandara L, Crocker J. A synthetic synthesis to explore animal evolution and development. Philos Trans R Soc Lond B Biol Sci 2022;377:20200517. [PMID: 35634925 PMCID: PMC9149795 DOI: 10.1098/rstb.2020.0517] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open

Tareen A, Kooshkbaghi M, Posfai A, Ireland WT, McCandlish DM, Kinney JB. MAVE-NN: learning genotype-phenotype maps from multiplex assays of variant effect. Genome Biol 2022;23:98. [PMID: 35428271 PMCID: PMC9011994 DOI: 10.1186/s13059-022-02661-7] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Accepted: 03/24/2022] [Indexed: 12/17/2022] Open

He N, Wang W, Fang C, Tan Y, Li L, Hou C. Integration of Count Difference and Curve Similarity in Negative Regulatory Element Detection. Front Genet 2022;13:818344. [PMID: 35251128 PMCID: PMC8896116 DOI: 10.3389/fgene.2022.818344] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2021] [Accepted: 01/20/2022] [Indexed: 12/05/2022] Open

Anderson DA, Voigt CA. Competitive dCas9 binding as a mechanism for transcriptional control. Mol Syst Biol 2021;17:e10512. [PMID: 34747560 PMCID: PMC8574044 DOI: 10.15252/msb.202110512] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Revised: 10/10/2021] [Accepted: 10/11/2021] [Indexed: 12/24/2022] Open

Shih CH, Fay J. Cis-regulatory variants affect gene expression dynamics in yeast. eLife 2021;10:e68469. [PMID: 34369376 PMCID: PMC8367379 DOI: 10.7554/elife.68469] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2021] [Accepted: 08/06/2021] [Indexed: 12/14/2022] Open

Lee D, Kapoor A, Lee C, Mudgett M, Beer MA, Chakravarti A. Sequence-based correction of barcode bias in massively parallel reporter assays. Genome Res 2021;31:1638-1645. [PMID: 34285053 PMCID: PMC8415370 DOI: 10.1101/gr.268599.120] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2020] [Accepted: 07/07/2021] [Indexed: 11/24/2022]

Letiagina AE, Omelina ES, Ivankin AV, Pindyurin AV. MPRAdecoder: Processing of the Raw MPRA Data With a priori Unknown Sequences of the Region of Interest and Associated Barcodes. Front Genet 2021;12:618189. [PMID: 34046055 PMCID: PMC8148044 DOI: 10.3389/fgene.2021.618189] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2020] [Accepted: 03/25/2021] [Indexed: 11/13/2022] Open

Yu TC, Liu WL, Brinck MS, Davis JE, Shek J, Bower G, Einav T, Insigne KD, Phillips R, Kosuri S, Urtecho G. Multiplexed characterization of rationally designed promoter architectures deconstructs combinatorial logic for IPTG-inducible systems. Nat Commun 2021;12:325. [PMID: 33436562 PMCID: PMC7804116 DOI: 10.1038/s41467-020-20094-3] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2020] [Accepted: 11/04/2020] [Indexed: 12/21/2022] Open

Affiliation(s)

Timothy C Yu Department of Bioengineering, University of California, Los Angeles, CA, 90095, USA
Winnie L Liu Department of Molecular, Cell, and Developmental Biology, University of California, Los Angeles, CA, 90095, USA
Marcia S Brinck Department of Microbiology, Immunology, and Molecular Genetics, University of California, Los Angeles, CA, 90095, USA
Jessica E Davis Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, 90095, USA
Jeremy Shek Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, 90095, USA
Grace Bower Department of Molecular, Cell, and Developmental Biology, University of California, Los Angeles, CA, 90095, USA
Tal Einav Department of Physics, California Institute of Technology, Pasadena, CA, 91125, USA
Kimberly D Insigne Bioinformatics Interdepartmental Graduate Program, University of California, Los Angeles, CA, 90095, USA
Rob Phillips Department of Physics, California Institute of Technology, Pasadena, CA, 91125, USA Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, 91125, USA Department of Applied Physics, California Institute of Technology, Pasadena, CA, 91125, USA
Sriram Kosuri Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, 90095, USA. UCLA-DOE Institute for Genomics and Proteomics, Los Angeles, CA, 90095, USA. Institute for Quantitative and Computational Biosciences (QCB), University of California, Los Angeles, Los Angeles, CA, 90095, USA. Eli and Edythe Broad Center of Regenerative Medicine and Stem Cell Research, University of California, Los Angeles, Los Angeles, CA, 90095, USA. Jonsson Comprehensive Cancer Center, University of California, Los Angeles, CA, 90095, USA. Molecular Biology Interdepartmental Doctoral Program, University of California, Los Angeles, CA, 90095, USA.
Guillaume Urtecho Molecular Biology Interdepartmental Doctoral Program, University of California, Los Angeles, CA, 90095, USA.

Collapse

Mulvey B, Lagunas T, Dougherty JD. Massively Parallel Reporter Assays: Defining Functional Psychiatric Genetic Variants Across Biological Contexts. Biol Psychiatry 2021;89:76-89. [PMID: 32843144 PMCID: PMC7938388 DOI: 10.1016/j.biopsych.2020.06.011] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/11/2020] [Revised: 06/09/2020] [Accepted: 06/10/2020] [Indexed: 12/18/2022]

Abstract

Neuropsychiatric phenotypes have long been known to be influenced by heritable risk factors, directly confirmed by the past decade of genetic studies that have revealed specific genetic variants enriched in disease cohorts. However, the initial hope that a small set of genes would be responsible for a given disorder proved false. The more complex reality is that a given disorder may be influenced by myriad small-effect noncoding variants and/or by rare but severe coding variants, many de novo. Noncoding genomic sequences-for which molecular functions cannot usually be inferred-harbor a large portion of these variants, creating a substantial barrier to understanding higher-order molecular and biological systems of disease. Fortunately, novel genetic technologies-scalable oligonucleotide synthesis, RNA sequencing, and CRISPR (clustered regularly interspaced short palindromic repeats)-have opened novel avenues to experimentally identify biologically significant variants en masse. Massively parallel reporter assays (MPRAs) are an especially versatile technique resulting from such innovations. MPRAs are powerful molecular genetics tools that can be used to screen thousands of untranscribed or untranslated sequences and their variants for functional effects in a single experiment. This approach, though underutilized in psychiatric genetics, has several useful features for the field. We review methods for assaying putatively functional genetic variants and regions, emphasizing MPRAs and the opportunities they hold for dissection of psychiatric polygenicity. We discuss literature applying functional assays in neurogenetics, highlighting strengths, caveats, and design considerations-especially regarding disease-relevant variables (cell type, neurodevelopment, and sex), and we ultimately propose applications of MPRA to both computational and experimental neurogenetics of polygenic disease risk.

Collapse

Renganaath K, Chong R, Day L, Kosuri S, Kruglyak L, Albert FW. Systematic identification of cis-regulatory variants that cause gene expression differences in a yeast cross. eLife 2020;9:e62669. [PMID: 33179598 PMCID: PMC7685706 DOI: 10.7554/elife.62669] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2020] [Accepted: 11/11/2020] [Indexed: 02/06/2023] Open

Fuqua T, Jordan J, van Breugel ME, Halavatyi A, Tischer C, Polidoro P, Abe N, Tsai A, Mann RS, Stern DL, Crocker J. Dense and pleiotropic regulatory information in a developmental enhancer. Nature 2020;587:235-239. [PMID: 33057197 DOI: 10.1038/s41586-020-2816-5] [Citation(s) in RCA: 43] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2020] [Accepted: 07/22/2020] [Indexed: 01/08/2023]

Hammelman J, Krismer K, Banerjee B, Gifford DK, Sherwood RI. Identification of determinants of differential chromatin accessibility through a massively parallel genome-integrated reporter assay. Genome Res 2020;30:1468-1480. [PMID: 32973041 PMCID: PMC7605270 DOI: 10.1101/gr.263228.120] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2020] [Accepted: 08/26/2020] [Indexed: 12/20/2022]

Ray JP, de Boer CG, Fulco CP, Lareau CA, Kanai M, Ulirsch JC, Tewhey R, Ludwig LS, Reilly SK, Bergman DT, Engreitz JM, Issner R, Finucane HK, Lander ES, Regev A, Hacohen N. Prioritizing disease and trait causal variants at the TNFAIP3 locus using functional and genomic features. Nat Commun 2020;11:1237. [PMID: 32144282 PMCID: PMC7060350 DOI: 10.1038/s41467-020-15022-4] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2020] [Accepted: 02/17/2020] [Indexed: 12/19/2022] Open

Affiliation(s)

John P Ray Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
Carl G de Boer Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA Klarman Cell Observatory, Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
Charles P Fulco Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA Department of Systems Biology, Harvard Medical School, Boston, MA, 02115, USA
Caleb A Lareau Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA Program in Biological and Biomedical Sciences, Harvard Medical School, Boston, MA, 02115, USA
Masahiro Kanai Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA Program in Bioinformatics and Integrative Genomics, Harvard Medical School, Boston, MA, 02115, USA
Jacob C Ulirsch Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA Program in Biological and Biomedical Sciences, Harvard Medical School, Boston, MA, 02115, USA
Ryan Tewhey Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, 02138, USA
Leif S Ludwig Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
Steven K Reilly Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, 02138, USA
Drew T Bergman Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
Jesse M Engreitz Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA Harvard Society of Fellows, Harvard University, Cambridge, MA, 02138, USA
Robbyn Issner Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
Hilary K Finucane Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
Eric S Lander Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA Department of Systems Biology, Harvard Medical School, Boston, MA, 02115, USA Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, 02142, USA
Aviv Regev Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA. Klarman Cell Observatory, Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA. Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, 02142, USA. Howard Hughes Medical Institute, Cambridge, MA, 02142, USA.
Nir Hacohen Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA. Center for Cancer Research, Massachusetts General Hospital, Boston, MA, 02114, USA.

Collapse

King DM, Hong CKY, Shepherdson JL, Granas DM, Maricque BB, Cohen BA. Synthetic and genomic regulatory elements reveal aspects of cis-regulatory grammar in mouse embryonic stem cells. eLife 2020;9:41279. [PMID: 32043966 PMCID: PMC7077988 DOI: 10.7554/elife.41279] [Citation(s) in RCA: 43] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2018] [Accepted: 02/07/2020] [Indexed: 01/08/2023] Open

Abstract

In embryonic stem cells (ESCs), a core transcription factor (TF) network establishes the gene expression program necessary for pluripotency. To address how interactions between four key TFs contribute to cis-regulation in mouse ESCs, we assayed two massively parallel reporter assay (MPRA) libraries composed of binding sites for SOX2, POU5F1 (OCT4), KLF4, and ESRRB. Comparisons between synthetic cis-regulatory elements and genomic sequences with comparable binding site configurations revealed some aspects of a regulatory grammar. The expression of synthetic elements is influenced by both the number and arrangement of binding sites. This grammar plays only a small role for genomic sequences, as the relative activities of genomic sequences are best explained by the predicted occupancy of binding sites, regardless of binding site identity and positioning. Our results suggest that the effects of transcription factor binding sites (TFBS) are influenced by the order and orientation of sites, but that in the genome the overall occupancy of TFs is the primary determinant of activity.

Transcription factors are proteins that flip genetic switches; their role is to control when and where genes are active. They do this by binding to short stretches of DNA called cis-regulatory sequences. Each sequence can have several binding sites for different transcription factors, but it is largely unclear whether the transcription factors binding to the same regulatory sequence actually work together.

It is possible that each transcription factor may work independently and there only needs to be critical mass of transcription factors bound to throw the genetic switch. If this is the case, the most important features of a cis-regulatory sequence should be the number of binding sites it contains, and how tightly the transcription factors bind to those sites. The more transcription factors and the more strongly they bind, the more active the gene should be. An alternative option is that certain transcription factors may work better together, enhancing each other's effects such that the total effect is more than the sum of its parts. If this is true, the order, orientation and spacing of the binding sites within a sequence should matter more than the number.

One way to investigate to distinguish between these possibilities is to study mouse embryonic stem cells, which have a core set of four transcription factors. Looking directly at a real genome, however, can be confusing and it is difficult to measure the effects of different cis-regulatory sequences because genes differ in so many other ways. To tackle this problem, King et al. created a synthetic set of cis-regulatory sequences based on the four core transcription factors found in mouse stem cells.

The synthetic set had every combination of two, three or four of the binding sites, with each site either facing forwards or backwards along the DNA strand. King et al. attached each of the synthetic cis-regulatory sequences to a reporter gene to find out how well each sequence performed. This revealed that the cis-regulatory sequences with the most binding sites and the tightest binding affinities work best, suggesting that transcription factors mainly work independently.

There was evidence of some interaction between some transcription factors, because, of the synthetic sequences with four binding sites, some worked better than others, and there were patterns in the most effective binding site combinations. However, these effects were small and when King et al. went on to test sequences from the real mouse genome, the most important factor by far was the number of binding sites.

Synthetic libraries of DNA sequences allow researchers to examine gene regulation more clearly than is possible in real genomes. Yet this approach does have its limitations and it is impossible to capture every type of cis-regulatory sequence in one library. The next step to extend this work is to combine the two approaches, taking sequences from the real genome and manipulating them one by one. This could help to unravel the rules that govern how cis-regulatory sequences work in real cells.

Collapse

Esposito D, Weile J, Shendure J, Starita LM, Papenfuss AT, Roth FP, Fowler DM, Rubin AF. MaveDB: an open-source platform to distribute and interpret data from multiplexed assays of variant effect. Genome Biol 2019;20:223. [PMID: 31679514 PMCID: PMC6827219 DOI: 10.1186/s13059-019-1845-6] [Citation(s) in RCA: 112] [Impact Index Per Article: 22.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2019] [Accepted: 10/01/2019] [Indexed: 11/10/2022] Open

Affiliation(s)

Daniel Esposito Bioinformatics Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC, Australia
Jochen Weile The Donnelly Centre, University of Toronto, Toronto, ON, Canada Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, ON, Canada Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada Department of Computer Science, University of Toronto, Toronto, ON, Canada
Jay Shendure Department of Genome Sciences, University of Washington, Seattle, WA, USA Brotman Baty Institute for Precision Medicine, Seattle, WA, USA Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
Lea M Starita Department of Genome Sciences, University of Washington, Seattle, WA, USA Brotman Baty Institute for Precision Medicine, Seattle, WA, USA
Anthony T Papenfuss Bioinformatics Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC, Australia Department of Medical Biology, University of Melbourne, Melbourne, VIC, Australia Bioinformatics and Cancer Genomics Laboratory, Peter MacCallum Cancer Centre, Melbourne, VIC, Australia Sir Peter MacCallum Department of Oncology, University of Melbourne, Melbourne, VIC, Australia Department of Mathematics and Statistics, University of Melbourne, Melbourne, VIC, Australia
Frederick P Roth The Donnelly Centre, University of Toronto, Toronto, ON, Canada. Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, ON, Canada. Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada. Department of Computer Science, University of Toronto, Toronto, ON, Canada. Canadian Institute for Advanced Research, Toronto, ON, Canada.
Douglas M Fowler Department of Genome Sciences, University of Washington, Seattle, WA, USA. Canadian Institute for Advanced Research, Toronto, ON, Canada. Department of Bioengineering, University of Washington, Seattle, WA, USA.
Alan F Rubin Bioinformatics Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC, Australia. Department of Medical Biology, University of Melbourne, Melbourne, VIC, Australia. Bioinformatics and Cancer Genomics Laboratory, Peter MacCallum Cancer Centre, Melbourne, VIC, Australia.

Collapse

Penzar DD, Zinkevich AO, Vorontsov IE, Sitnik VV, Favorov AV, Makeev VJ, Kulakovskiy IV. What Do Neighbors Tell About You: The Local Context of Cis-Regulatory Modules Complicates Prediction of Regulatory Variants. Front Genet 2019;10:1078. [PMID: 31737053 PMCID: PMC6834773 DOI: 10.3389/fgene.2019.01078] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2019] [Accepted: 10/09/2019] [Indexed: 02/05/2023] Open

Abstract

Many problems of modern genetics and functional genomics require the assessment of functional effects of sequence variants, including gene expression changes. Machine learning is considered to be a promising approach for solving this task, but its practical applications remain a challenge due to the insufficient volume and diversity of training data. A promising source of valuable data is a saturation mutagenesis massively parallel reporter assay, which quantitatively measures changes in transcription activity caused by sequence variants. Here, we explore the computational predictions of the effects of individual single-nucleotide variants on gene transcription measured in the massively parallel reporter assays, based on the data from the recent "Regulation Saturation" Critical Assessment of Genome Interpretation challenge. We show that the estimated prediction quality strongly depends on the structure of the training and validation data. Particularly, training on the sequence segments located next to the validation data results in the "information leakage" caused by the local context. This information leakage allows reproducing the prediction quality of the best CAGI challenge submissions with a fairly simple machine learning approach, and even obtaining notably better-than-random predictions using irrelevant genomic regions. Validation scenarios preventing such information leakage dramatically reduce the measured prediction quality. The performance at independent regulatory regions entirely excluded from the training set appears to be much lower than needed for practical applications, and even the performance estimation will become reliable only in the future with richer data from multiple reporters. The source code and data are available at https://bitbucket.org/autosomeru_cagi2018/cagi2018_regsat and https://genomeinterpretation.org/content/expression-variants.

Collapse

Empirical measures of mutational effects define neutral models of regulatory evolution in Saccharomyces cerevisiae. Proc Natl Acad Sci U S A 2019;116:21085-21093. [PMID: 31570626 DOI: 10.1073/pnas.1902823116] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Vainberg Slutskin I, Weinberger A, Segal E. Sequence determinants of polyadenylation-mediated regulation. Genome Res 2019;29:1635-1647. [PMID: 31530582 PMCID: PMC6771402 DOI: 10.1101/gr.247312.118] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2018] [Accepted: 08/13/2019] [Indexed: 12/31/2022]

Kreimer A, Yan Z, Ahituv N, Yosef N. Meta-analysis of massively parallel reporter assays enables prediction of regulatory function across cell types. Hum Mutat 2019;40:1299-1313. [PMID: 31131957 PMCID: PMC6771677 DOI: 10.1002/humu.23820] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2019] [Revised: 05/18/2019] [Accepted: 05/24/2019] [Indexed: 01/01/2023]

Wollman AJM, Hedlund EG, Shashkova S, Leake MC. Towards mapping the 3D genome through high speed single-molecule tracking of functional transcription factors in single living cells. Methods 2019;170:82-89. [PMID: 31252059 PMCID: PMC6971689 DOI: 10.1016/j.ymeth.2019.06.021] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2019] [Accepted: 06/22/2019] [Indexed: 10/26/2022] Open

Wang X, Zhou T, Wunderlich Z, Maurano MT, DePace AH, Nuzhdin SV, Rohs R. Analysis of Genetic Variation Indicates DNA Shape Involvement in Purifying Selection. Mol Biol Evol 2019;35:1958-1967. [PMID: 29850830 PMCID: PMC6063282 DOI: 10.1093/molbev/msy099] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open

Kinney JB, McCandlish DM. Massively Parallel Assays and Quantitative Sequence-Function Relationships. Annu Rev Genomics Hum Genet 2019;20:99-127. [PMID: 31091417 DOI: 10.1146/annurev-genom-083118-014845] [Citation(s) in RCA: 76] [Impact Index Per Article: 15.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Qiu C, Kaplan CD. Functional assays for transcription mechanisms in high-throughput. Methods 2019;159-160:115-123. [PMID: 30797033 PMCID: PMC6589137 DOI: 10.1016/j.ymeth.2019.02.017] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2019] [Accepted: 02/18/2019] [Indexed: 01/12/2023] Open

Swank Z, Laohakunakorn N, Maerkl SJ. Cell-free gene-regulatory network engineering with synthetic transcription factors. Proc Natl Acad Sci U S A 2019;116:5892-5901. [PMID: 30850530 PMCID: PMC6442555 DOI: 10.1073/pnas.1816591116] [Citation(s) in RCA: 42] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open

Myint L, Avramopoulos DG, Goff LA, Hansen KD. Linear models enable powerful differential activity analysis in massively parallel reporter assays. BMC Genomics 2019;20:209. [PMID: 30866806 PMCID: PMC6417258 DOI: 10.1186/s12864-019-5556-x] [Citation(s) in RCA: 40] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2018] [Accepted: 02/22/2019] [Indexed: 12/15/2022] Open

Hartl D, Krebs AR, Grand RS, Baubec T, Isbel L, Wirbelauer C, Burger L, Schübeler D. CG dinucleotides enhance promoter activity independent of DNA methylation. Genome Res 2019;29:554-563. [PMID: 30709850 PMCID: PMC6442381 DOI: 10.1101/gr.241653.118] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2018] [Accepted: 01/24/2019] [Indexed: 11/24/2022]

Shapshak P, Balaji S, Kangueane P, Chiappelli F, Somboonwit C, Menezes LJ, Sinnott JT. Innovative Technologies for Advancement of WHO Risk Group 4 Pathogens Research. GLOBAL VIROLOGY III: VIROLOGY IN THE 21ST CENTURY 2019. [PMCID: PMC7122670 DOI: 10.1007/978-3-030-29022-1_15] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Forcier TL, Ayaz A, Gill MS, Jones D, Phillips R, Kinney JB. Measuring cis-regulatory energetics in living cells using allelic manifolds. eLife 2018;7:40618. [PMID: 30570483 PMCID: PMC6301791 DOI: 10.7554/elife.40618] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2018] [Accepted: 11/27/2018] [Indexed: 12/04/2022] Open

Chakravorty S, Hegde M. Inferring the effect of genomic variation in the new era of genomics. Hum Mutat 2018;39:756-773. [PMID: 29633501 DOI: 10.1002/humu.23427] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2017] [Revised: 03/20/2018] [Accepted: 03/28/2018] [Indexed: 12/11/2022]

Park J, Wang HH. Systematic and synthetic approaches to rewire regulatory networks. CURRENT OPINION IN SYSTEMS BIOLOGY 2018;8:90-96. [PMID: 30637352 PMCID: PMC6329604 DOI: 10.1016/j.coisb.2017.12.009] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]

Iyer S, Acharya KR, Subramanian V. A comparative bioinformatic analysis of C9orf72. PeerJ 2018;6:e4391. [PMID: 29479499 PMCID: PMC5822839 DOI: 10.7717/peerj.4391] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2017] [Accepted: 01/29/2018] [Indexed: 12/12/2022] Open

Unraveling the determinants of microRNA mediated regulation using a massively parallel reporter assay. Nat Commun 2018;9:529. [PMID: 29410437 PMCID: PMC5802814 DOI: 10.1038/s41467-018-02980-z] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2017] [Accepted: 01/11/2018] [Indexed: 12/16/2022] Open

Gan KA, Carrasco Pro S, Sewell JA, Fuxman Bass JI. Identification of Single Nucleotide Non-coding Driver Mutations in Cancer. Front Genet 2018;9:16. [PMID: 29456552 PMCID: PMC5801294 DOI: 10.3389/fgene.2018.00016] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2017] [Accepted: 01/12/2018] [Indexed: 12/14/2022] Open

A Simple Grammar Defines Activating and Repressing cis-Regulatory Elements in Photoreceptors. Cell Rep 2017;17:1247-1254. [PMID: 27783940 DOI: 10.1016/j.celrep.2016.09.066] [Citation(s) in RCA: 49] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2016] [Revised: 08/06/2016] [Accepted: 09/20/2016] [Indexed: 12/22/2022] Open

Hartl D, Krebs AR, Jüttner J, Roska B, Schübeler D. Cis-regulatory landscapes of four cell types of the retina. Nucleic Acids Res 2017;45:11607-11621. [PMID: 29059322 PMCID: PMC5714137 DOI: 10.1093/nar/gkx923] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2017] [Revised: 07/28/2017] [Accepted: 10/02/2017] [Indexed: 12/18/2022] Open

Brown AJ, Gibson SJ, Hatton D, James DC. In silico design of context-responsive mammalian promoters with user-defined functionality. Nucleic Acids Res 2017;45:10906-10919. [PMID: 28977454 PMCID: PMC5737543 DOI: 10.1093/nar/gkx768] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2017] [Accepted: 08/22/2017] [Indexed: 12/19/2022] Open

Kreimer A, Zeng H, Edwards MD, Guo Y, Tian K, Shin S, Welch R, Wainberg M, Mohan R, Sinnott-Armstrong NA, Li Y, Eraslan G, AMIN TB, Goke J, Mueller NS, Kellis M, Kundaje A, Beer MA, Keles S, Gifford DK, Yosef N. Predicting gene expression in massively parallel reporter assays: A comparative study. Hum Mutat 2017;38:1240-1250. [PMID: 28220625 PMCID: PMC5560998 DOI: 10.1002/humu.23197] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2016] [Revised: 01/19/2017] [Accepted: 02/12/2017] [Indexed: 02/03/2023]

Affiliation(s)

Anat Kreimer Department of Electrical Engineering and Computer Science and Center for Computational Biology, University of California, Berkeley, Berkeley, CA 94720, USA Department of Bioengineering and Therapeutic Sciences, Institute for Human Genetics, University of California, San Francisco, San Francisco, California, USA
Haoyang Zeng Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
Matthew D. Edwards Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
Yuchun Guo Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
Kevin Tian Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
Sunyoung Shin Department of Statistics, Department of Biostatistics and Medical Informatics University of Wisconsin-Madison, Madison, Wisconsin, USA
Rene Welch Department of Statistics, Department of Biostatistics and Medical Informatics University of Wisconsin-Madison, Madison, Wisconsin, USA
Michael Wainberg Department of Genetics, Stanford University School of Medicine, Department of Computer Science, Stanford, California 94305, USA
Rahul Mohan Department of Genetics, Stanford University School of Medicine, Department of Computer Science, Stanford, California 94305, USA
Nicholas A. Sinnott-Armstrong Department of Genetics, Stanford University School of Medicine, Department of Computer Science, Stanford, California 94305, USA
Yue Li Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, 32 Vassar St, Cambridge, Massachusetts 02139, USA Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, 32 Vassar St, Cambridge, Massachusetts 02139, USA
Gökcen Eraslan Computational Cell Maps, Institute of Computational Biology, Helmholtz Zentrum München, Ingolstädter Landstr. 1 85764 Neuherberg, Germany
Talal Bin AMIN Computational and Systems Biology, Genome Institute of Singapore, Singapore 138672, Singapore
Jonathan Goke Computational and Systems Biology, Genome Institute of Singapore, Singapore 138672, Singapore
Nikola S. Mueller Computational Cell Maps, Institute of Computational Biology, Helmholtz Zentrum München, Ingolstädter Landstr. 1 85764 Neuherberg, Germany
Manolis Kellis Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, 32 Vassar St, Cambridge, Massachusetts 02139, USA Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, 32 Vassar St, Cambridge, Massachusetts 02139, USA
Anshul Kundaje Department of Genetics, Stanford University School of Medicine, Department of Computer Science, Stanford, California 94305, USA
Michael A Beer McKusick-Nathans Institute of Genetic Medicine, Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
Sunduz Keles Department of Statistics, Department of Biostatistics and Medical Informatics University of Wisconsin-Madison, Madison, Wisconsin, USA
David K. Gifford Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
Nir Yosef Department of Electrical Engineering and Computer Science and Center for Computational Biology, University of California, Berkeley, Berkeley, CA 94720, USA Ragon Institute of Massachusetts General Hospital, MIT and Harvard, Cambridge, MA, 02139

Collapse

Huminiecki Ł, Horbańczuk J. Can We Predict Gene Expression by Understanding Proximal Promoter Architecture? Trends Biotechnol 2017;35:530-546. [PMID: 28377102 DOI: 10.1016/j.tibtech.2017.03.007] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2016] [Revised: 02/14/2017] [Accepted: 03/09/2017] [Indexed: 10/19/2022]

Inukai S, Kock KH, Bulyk ML. Transcription factor-DNA binding: beyond binding site motifs. Curr Opin Genet Dev 2017;43:110-119. [PMID: 28359978 PMCID: PMC5447501 DOI: 10.1016/j.gde.2017.02.007] [Citation(s) in RCA: 189] [Impact Index Per Article: 27.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2016] [Revised: 02/02/2017] [Accepted: 02/07/2017] [Indexed: 12/12/2022]