Difference between revisions of "Research"

From Zhang Laboratory

Jump to: navigation, search
Line 1: Line 1:
We are working on RNA at the interface of Systems Biology, Computer Science and Molecular Neuroscience, currently sponsored by NIH, Simons Foundation, and Columbia University Medical Center.
+
__NOTOC__
  
The importance of post-transcriptional regulation at the RNA level for orchestrated gene expression in mammalian systems is increasingly recognized.  Such regulation is dictated by interaction of at least several hundred RNA-binding proteins (RBPs) with their target transcripts, or RNA-regulatory networks, and has profound impact on the output of the transcriptome, especial for the development and function of the nervous system. 
 
[[File:splicing.png|right|border|300px]]
 
To understand the function of RNA regulatory network, a first step is to infer the structure of such networks, which is challenging due in part to the fact that most RBPs recognize very short and generate sequence motifs with limited information content.  In the past few years, we took advantage of advances in high-throughput biochemical and molecular biology assays (most of which are related to next-generation sequencing), such as HITS-CLIP (or CLIP-Seq) and RNA-Seq that profile transcriptomes and protein-RNA interactomes.  We applied these assays to specific neuronal RBPs,mostly using mouse brain as a model system, and intersect them with statistical and machine learning approaches to identify exons under tissue-specific regulation and predict specific protein-RNA interactions at single nucleotide resolution.  Importantly, we also developed an integrative modeling approach to combine multiple modalities of data to infer direct and functional targets of specific RBPs with high accuracy and sensitivity. Investigation of the resulting networks allow us to make unexpected findings such as combinatorial regulation by multiple RBPs.
 
  
 +
We are working on '''RNA''' at the interface of Systems Biology, Computer Science and Molecular Neuroscience, currently sponsored by NIH, Simons Foundation, and Columbia University Medical Center.
 +
 +
==Overview==
 +
 +
{|class="zebra wikitable" width="100%" style="border:0px solid; border-collapse:collapse;"
 +
 +
|-
 +
|
 +
Post-transcriptional regulation at the RNA level has profound impact on gene expression, especial for the development and function of the nervous system.  Such regulation is dictated by interaction of at least several hundred RNA-binding proteins (RBPs) with their target transcripts, or RNA-regulatory networks.  Our current work focus on '''neuron-specific alternative splicing''', one critical step of RNA regulation.  Our research interests range from '''basic understanding of the specificity of protein-RNA interaction, to organization of splicing regulatory networks, and to function of specific splice variants'''.
 +
 +
 +
|
 +
[[File:splicing.png|right|300px]]
 +
 +
|}
 +
 +
 +
==Protein-RNA interactions at single nucleotide resolution==
 +
 +
{|class="zebra wikitable" width="100%" style="border:0px solid; border-collapse:collapse;"
 +
 +
|-
 +
|
 +
The RBP binding sites are notoriously difficult to map probably because of the flexibility of RNA.  The development of cross linking and immunoprecipitation followed by high-throughput sequencing (HITS-CLIP or CLIP-Seq) has been a revolution to the RNA field. This high-throughput biochemical assay makes it possible to map in vivo protein-RNA interactions on a genome wide scale.  Standard analysis pipelines of CLIP data typically map protein-RNA interaction sites at a resolution of ~30-60 nt.  We developed a new computational method to determine the exact protein-RNA crosslink sites.  This method, termed CIMS and CITS analysis, allows us to '''map protein-RNA interactions at single nucleotide resolution'''.  We are using this powerful approach to obtain unprecedented details about how the specificity of protein-RNA interaction is determined.
 +
 +
*Moore, M.*, Zhang, C.*, Gantman, E.C., Mele, A., Darnell, J.C., Darnell, R.B. 2014. Mapping Argonaute and conventional RNA-binding protein interactions with RNA at single-nucleotide resolution using HITS-CLIP and CIMS analysis. Nat Protocols.  9:263-293. ([[CIMS_Documentation|Software]])
 +
 +
*Zhang, C.†, Darnell, R.B.† 2011. Mapping in vivo protein-RNA interactions at single-nucleotide resolution from HITS-CLIP data. ''Nat Biotech'', 29:607-614.
 +
 +
 +
|
 +
[[File:crosslink.png|right|300px]]
 +
 +
|}
  
 
==Organizational principles and functional impact of neuronal RNA-regulatory networks==
 
==Organizational principles and functional impact of neuronal RNA-regulatory networks==
For our current work, one focus is to investigate systematically how multiple RBPs work together to derive dynamic regulation during neurodevelopment. We aim to find predictive rules of combinatorial regulation, and how groups of RBPs define subsets of target transcripts that are functional distinct.  To this end, we rely on the structure of RBPs defined by integrative modeling approaches, and statistical inference to determine the correlation structure underlying time series transcriptome data.
 
  
Another exciting direction is the use of embryonic stem cells (ESCs) and in vitro differentiation of neurons from (ESCs).  We collaborate with Hynek Wichterle group at Columbia, who developed and optimized the directed, in vitro differentiation of motor neurons, a group of nerve cells through which the brain controls muscle contraction. Motor neurons are also the primary targets in several devastating neuron degenerative diseases including amyotrophic lateral sclerosis (ALS) and spinal muscular satrapy (SMA)This system has two key advantages: 1) the differential process recapitulate many key aspects of in vivo motor neuron development; 2) the use of ES cells makes it possible to leverage genome engineering techniques to perturb RNA regulatory networksBy dissecting RNA regulatory networks in detail, we hope to obtain a deeper understanding of their functional impact and the underlying regulatory mechanisms in this well defined, clinically relevant cell type.
+
{|class="zebra wikitable" width="100%" style="border:0px solid; border-collapse:collapse;"
 +
 
 +
|-
 +
|
 +
To understand the function of RNA regulatory network, a first step is to infer the structure of such networks, which is challenging due in part to the degeneracy and dynamic nature of the splicing code.  We pioneered the development of '''integrative analysis of splicing regulatory networks''', which has enabled us to identify alternative exons regulated by specific RBPs with unprecedented accuracy and sensitivityIn brief, our strategy was to identify altered splicing upon genetic RBP depletion, although such changes can be either direct or indirect.  The latter was distinguished by mapping the precise RBP binding sites in vivo using HITS-CLIP.  These data were then formally combined using a Bayesian network to determine high-confidence, direct target transcripts25,28.  Studies using this paradigm demonstrated the concerted regulation of hundreds of alternative exons by individual neuronal RBPsInvestigation of the resulting networks allow us to make unexpected findings such as coupling of splicing with post-translational modifications.
  
 
*Weyn-Vanhentenryck,S.,M.*, Mele,A.*, Yan,Q.*, Sun,S., Farny,N., Zhang,Z., Xue,C., Herre,M., Silver,P.A., Zhang,M.Q., Krainer,A.R., Darnell,R.B.†, Zhang,C. † 2014.  HITS-CLIP and integrative modeling define the Rbfox splicing-regulatory network linked to brain development and autism. ''Cell Rep'', 6:1139-1152.
 
*Weyn-Vanhentenryck,S.,M.*, Mele,A.*, Yan,Q.*, Sun,S., Farny,N., Zhang,Z., Xue,C., Herre,M., Silver,P.A., Zhang,M.Q., Krainer,A.R., Darnell,R.B.†, Zhang,C. † 2014.  HITS-CLIP and integrative modeling define the Rbfox splicing-regulatory network linked to brain development and autism. ''Cell Rep'', 6:1139-1152.
Line 15: Line 49:
 
*Zhang, C.*, Zhang, Z.*, Castle, J., Sun, S., Johnson, J., Krainer, A.R. and Zhang, M.Q. 2008. Defining the regulatory network of the tissue-specific splicing factors Fox-1 and Fox-2. ''Genes Dev'', 22:2550-2563.
 
*Zhang, C.*, Zhang, Z.*, Castle, J., Sun, S., Johnson, J., Krainer, A.R. and Zhang, M.Q. 2008. Defining the regulatory network of the tissue-specific splicing factors Fox-1 and Fox-2. ''Genes Dev'', 22:2550-2563.
  
 +
|
 +
[[File:bnet.png|right|300px]]
  
 +
|}
  
==Variation of RNA-regulatory networks in evolution, human populations and in neuronal disorders==
+
==RNA regulatory networks in neural development==
  
A second related direction of our lab is to evaluate the impact of mutations on RNA regulation in normal physiology or disease.  Our study spans three contexts: comparison of different species (e.g. rodents and primates), different human populations, and patients affected by neurological diseases and normal controls.  Such study is facilitated by our ability to determine protein-RNA interactions at a high resolution and distinction of functional vs. nonfunctional interactions.  We will apply this strategy to parallel systems in different species that are directly comparable, large transcriptome profiles of human populations generated by consortium efforts, and mutations identified by genomic sequencing compiled from the public domain and collaborators.
+
{|class="zebra wikitable" width="100%" style="border:0px solid; border-collapse:collapse;"
  
*Zhang, C. , Lee, K.-Y., Swanson, M.S., Darnell, R.B. † 2013. Prediction of clustered RNA-binding protein motif sites in the mammalian genome. Nucleic Acids Res. 41:6793-6807. PMCID: PMC3737533.
+
|-
 +
|style="vertical-align: top;"|
 +
For our current work, one focus is to investigate systematically how multiple RBPs work together to drive '''dynamic regulation during neurodevelopment'''. We aim to find predictive rules of combinatorial regulation, and how groups of RBPs define subsets of target transcripts that are functional distinct. To this end, we rely on the structure of RBPs defined by integrative modeling approaches, and statistical inference to determine the correlation structure underlying time series transcriptome data.
  
*Zhang, C.*, Zhang, Z.*, Castle, J., Sun, S., Johnson, J., Krainer, A.R. and Zhang, M.Q. 2008. Defining the regulatory network of the tissue-specific splicing factors Fox-1 and Fox-2. Genes Dev. 22:2550-2563.  
+
Another exciting direction is the use of embryonic stem cells (ESCs) and in vitro differentiation of neurons from (ESCs). We collaborate with Hynek Wichterle group at Columbia, who developed and optimized the directed, in vitro differentiation of motor neurons, a group of nerve cells through which the brain controls muscle contraction. Motor neurons are also the primary targets in several devastating neuron degenerative diseases including amyotrophic lateral sclerosis (ALS) and spinal muscular satrapy (SMA). This system has two key advantages: 1) the differential process recapitulate many key aspects of in vivo motor neuron development; 2) the use of ES cells makes it possible to leverage genome engineering techniques to perturb RNA regulatory networks. By dissecting RNA regulatory networks in detail, we hope to obtain a deeper understanding of their functional impact and the underlying regulatory mechanisms in this well defined, clinically relevant cell type.
  
 +
|
 +
[[File:eb.png|right|300px]]
  
 +
|}
  
==High-throughput transcriptomic data analysis==
+
==Variation of RNA-regulatory networks in evolution, human populations and in neuronal disorders==
 +
{|class="zebra wikitable" width="100%" style="border:0px solid; border-collapse:collapse;"
  
Our work heavily relies on high-throughput technologies which produce enormous amount of data, and on algorithms to transform these data into useful informationWe are interested in developing better algorithms to process transcroptomic data, such as mapping RNA-Seq reads, discovering and quantifying RNA processing in specific conditions, and analyzing CLIP data to map protein-RNA interactions at a single nucleotide resolution.
+
|-
 +
|style="vertical-align: top;"|
 +
Another related direction of our lab is to evaluate the impact of '''mutations on RNA regulation''' in normal physiology or diseaseOur study spans three contexts: comparison of different species (e.g. rodents and primates), different human populations, and patients affected by neurological diseases and normal controls.  Such study is facilitated by our ability to determine protein-RNA interactions at a high resolution and distinction of functional vs. nonfunctional interactions.  We will apply this strategy to parallel systems in different species that are directly comparable, large transcriptome profiles of human populations generated by consortium efforts, and mutations identified by genomic sequencing compiled from the public domain and collaborators.
  
*Moore, M.*, Zhang, C.*, Gantman, E.C., Mele, A., Darnell, J.C., Darnell, R.B. 2014. Mapping Argonaute and conventional RNA-binding protein interactions with RNA at single-nucleotide resolution using HITS-CLIP and CIMS analysis. Nat Protocols.  9:263-293. ([[CIMS_Documentation|Software]])
 
  
*Wu,J., Anczukow,O., Krainer,A.R., Zhang,M.Q. †, Zhang,C. †, 2013. OLego: Fast and sensitive mapping of spliced mRNA-Seq reads using small seeds. ''Nucleic Acids Res.'' , In press. ([[OLego|Software]])
+
|
 +
[[File:evo.png|right|300px]]
  
*Zhang, C.†, Darnell, R.B.† 2011. Mapping in vivo protein-RNA interactions at single-nucleotide resolution from HITS-CLIP data. ''Nat Biotech'', 29:607-614.
+
|}

Revision as of 23:45, 9 March 2016


We are working on RNA at the interface of Systems Biology, Computer Science and Molecular Neuroscience, currently sponsored by NIH, Simons Foundation, and Columbia University Medical Center.

Overview

Post-transcriptional regulation at the RNA level has profound impact on gene expression, especial for the development and function of the nervous system. Such regulation is dictated by interaction of at least several hundred RNA-binding proteins (RBPs) with their target transcripts, or RNA-regulatory networks. Our current work focus on neuron-specific alternative splicing, one critical step of RNA regulation. Our research interests range from basic understanding of the specificity of protein-RNA interaction, to organization of splicing regulatory networks, and to function of specific splice variants.


Splicing.png


Protein-RNA interactions at single nucleotide resolution

The RBP binding sites are notoriously difficult to map probably because of the flexibility of RNA. The development of cross linking and immunoprecipitation followed by high-throughput sequencing (HITS-CLIP or CLIP-Seq) has been a revolution to the RNA field. This high-throughput biochemical assay makes it possible to map in vivo protein-RNA interactions on a genome wide scale. Standard analysis pipelines of CLIP data typically map protein-RNA interaction sites at a resolution of ~30-60 nt. We developed a new computational method to determine the exact protein-RNA crosslink sites. This method, termed CIMS and CITS analysis, allows us to map protein-RNA interactions at single nucleotide resolution. We are using this powerful approach to obtain unprecedented details about how the specificity of protein-RNA interaction is determined.

  • Moore, M.*, Zhang, C.*, Gantman, E.C., Mele, A., Darnell, J.C., Darnell, R.B. 2014. Mapping Argonaute and conventional RNA-binding protein interactions with RNA at single-nucleotide resolution using HITS-CLIP and CIMS analysis. Nat Protocols. 9:263-293. (Software)
  • Zhang, C.†, Darnell, R.B.† 2011. Mapping in vivo protein-RNA interactions at single-nucleotide resolution from HITS-CLIP data. Nat Biotech, 29:607-614.


Crosslink.png

Organizational principles and functional impact of neuronal RNA-regulatory networks

To understand the function of RNA regulatory network, a first step is to infer the structure of such networks, which is challenging due in part to the degeneracy and dynamic nature of the splicing code. We pioneered the development of integrative analysis of splicing regulatory networks, which has enabled us to identify alternative exons regulated by specific RBPs with unprecedented accuracy and sensitivity. In brief, our strategy was to identify altered splicing upon genetic RBP depletion, although such changes can be either direct or indirect. The latter was distinguished by mapping the precise RBP binding sites in vivo using HITS-CLIP. These data were then formally combined using a Bayesian network to determine high-confidence, direct target transcripts25,28. Studies using this paradigm demonstrated the concerted regulation of hundreds of alternative exons by individual neuronal RBPs. Investigation of the resulting networks allow us to make unexpected findings such as coupling of splicing with post-translational modifications.

  • Weyn-Vanhentenryck,S.,M.*, Mele,A.*, Yan,Q.*, Sun,S., Farny,N., Zhang,Z., Xue,C., Herre,M., Silver,P.A., Zhang,M.Q., Krainer,A.R., Darnell,R.B.†, Zhang,C. † 2014. HITS-CLIP and integrative modeling define the Rbfox splicing-regulatory network linked to brain development and autism. Cell Rep, 6:1139-1152.
  • Zhang, C.†, Frias, M.A., Mele, A., Ruggiu, M., Eom, T., Marney, C.B., Wang, H., Licatalosi, D.D., Fak, J.J., Darnell, R.B.† 2010. Integrative modeling defines the Nova splicing-regulatory network and its combinatorial controls. Science, 329: 439-443.
  • Zhang, C.*, Zhang, Z.*, Castle, J., Sun, S., Johnson, J., Krainer, A.R. and Zhang, M.Q. 2008. Defining the regulatory network of the tissue-specific splicing factors Fox-1 and Fox-2. Genes Dev, 22:2550-2563.
Bnet.png

RNA regulatory networks in neural development

For our current work, one focus is to investigate systematically how multiple RBPs work together to drive dynamic regulation during neurodevelopment. We aim to find predictive rules of combinatorial regulation, and how groups of RBPs define subsets of target transcripts that are functional distinct. To this end, we rely on the structure of RBPs defined by integrative modeling approaches, and statistical inference to determine the correlation structure underlying time series transcriptome data.

Another exciting direction is the use of embryonic stem cells (ESCs) and in vitro differentiation of neurons from (ESCs). We collaborate with Hynek Wichterle group at Columbia, who developed and optimized the directed, in vitro differentiation of motor neurons, a group of nerve cells through which the brain controls muscle contraction. Motor neurons are also the primary targets in several devastating neuron degenerative diseases including amyotrophic lateral sclerosis (ALS) and spinal muscular satrapy (SMA). This system has two key advantages: 1) the differential process recapitulate many key aspects of in vivo motor neuron development; 2) the use of ES cells makes it possible to leverage genome engineering techniques to perturb RNA regulatory networks. By dissecting RNA regulatory networks in detail, we hope to obtain a deeper understanding of their functional impact and the underlying regulatory mechanisms in this well defined, clinically relevant cell type.

Eb.png

Variation of RNA-regulatory networks in evolution, human populations and in neuronal disorders

Another related direction of our lab is to evaluate the impact of mutations on RNA regulation in normal physiology or disease. Our study spans three contexts: comparison of different species (e.g. rodents and primates), different human populations, and patients affected by neurological diseases and normal controls. Such study is facilitated by our ability to determine protein-RNA interactions at a high resolution and distinction of functional vs. nonfunctional interactions. We will apply this strategy to parallel systems in different species that are directly comparable, large transcriptome profiles of human populations generated by consortium efforts, and mutations identified by genomic sequencing compiled from the public domain and collaborators.


Evo.png