January 2008 Release NotesAbout this documentMicroRNA.org is a web-accessible information resource produced and maintained at cBio, the Computational Biology Center at Memorial Sloan Kettering Cancer Center (MSKCC). This document describes technical details of parameters and algorithmic details used to produce the January 2008 data release. ReferencemicroRNA target predictions: The microRNA.org resource: targets and expression.
Betel D, Wilson M, Gabow A, Marks DS, Sander C., Nucleic Acids Res. 2008 Jan; 36(Database Issue): D149-53.
ContactSend comments and questions to SummaryThe MicroRNA.org site is a comprehensive resource of microRNA target predictions and expression profiles. Target predictions are based on a development of the miRanda algorithm which incorporates current biological knowledge on target rules and on the use of an up-to-date compendium of mammalian microRNAs. MicroRNA expression profiles are derived from a comprehensive sequencing project of a large set of mammalian tissues and cell lines of normal and disease origin. This website enables users to explore:
DataMicroRNA sequences were collected from MirBase release 10.0. Mammalian 3'UTR sequences from the rat (rno4), mouse (mm7) and human (hg18) genomes were downloaded from the UCSC genome browser. MicroRNA expression profiles were collected from a recently published comprehensive cloning and sequencing effort of 172 human, 64 mouse and 16 rat small RNA libraries extracted from major organs and cell types (Landgraf et al., Cell, 129, (2007), 1401-1414). Expression values represent the number of cloned mature microRNAs that were sequenced in each library and reported as clone counts. The counts are normalized by the total number of microRNAs that were cloned in each library. Prediction parametersPredictions were generated using the 2004 version of the miRanda algorithm (John et al., PLoS Biology, 2, (2004), e363) with the following command-line options: Score cutoff S>= 140
The match value s(i) is multiplied by a position specific weight w(i) before being evaluated in the alignment algorithm.
The position specific weights reflect the non-homogeneous effect of different positions,
such as the importance of the 'seed' or 'nucleus' generally defined as positions 2-8.
Thus the total score S for a particular alignment is
Note that a perfect heptamer match in positions 2-8 corresponds to a score of 140 (5*4*7), where 5 reflects a match, 4 the positional weight and 7 the number of positions. Caveat:These base-pairing scores were optimized against a limited set of validated targets. We are currently (early 2008) in the process of refining these values and will probably refine the parameters by mid-2008. Therefore, target predictions may differ in subsequent releases. Users will have access to previous, archived releases through links on the site. ConservationWe used PhastCons conservation score, which measures the evolutionary conservation of sequence blocks across multiple vertebrates using a phylogenetic hidden Markov model (Siepel et al., Genome research, 15, (2005), 1034-1050.), to filter out less conserved predicted target sites. Target sites are filtered for PhastCons 0.57, which roughly corresponds to conservation across all mammals. Software and data filesThe miRanda source code (labeled Jan 2008 version) is available on the microRNA.org web site under the LGPL open source license. Data files in tabular form (labeled Jan 2008) are also available for download. Additional References
|
|||||||||
|
|
|||||||||