http://www.snp-nexus.org/index.html

SNPnexus database is designed to simplify and assist in the selection of functionally relevant Single Nucleotide Polymorphisms (SNP) for large-scale genotyping studies of multifactorial disorders.

SNPnexus allows single or batch queries using dbSNP identifiers or in-house SNPs genomic coordinates on clones, contigs or chromosomes.

SNPnexus will be updated on a regular monthly basis and will provide the scientific community with a common friendly web- interface to perform the following analyses:

A wide range of possible SNPs functional consequences is computed on the major gene annotation systems from NCBI RefSeq (Pruitt et al., 2007), UCSC Known Genes (Hsu et al 2006), Ensembl (Hubbard et al, 2007), Vega (Ashurst et al, 2005) and Acembly (Thierry-Mieg et al 2006). SNPs effects fall into 7 categories: intronic, 5’UTR, 3’UTR, 5-upstream, 3-downstream, splicing site or coding (synonymous or non-synonymous). For intronic SNPs, the distance to the splicing site is reported. For 5-upstream and 3-downstream, a distance of 2 kb from the UTR start or end is considered appropriate. For coding SNPs, the coordinates of the position within the cdna, cds, amino acid as well as the peptide change produced (if any) are reported

SNPnexus retrieves links to dbSNP (http://www.ncbi.nlm.nih.gov/projects/SNP/) (Sherry et al 2001) and related genotypes or alleles frequency estimation from the HapMap populations (http://www.hapmap.org/) (The International HapMap Consortium 2007).

Regulatory SNPs can be queried against any overlap with conserved Transcription Factor Binding Sites (TFBS) and First-Exon and Promoter Prediction (FirstEF) (Davuluri et al 2001) available through the UCSC Genome browser. Overlaps with miRBASE 11.0 (Griffiths-Jones 2004), TargetScan miRNA Regulatory Sites (Lewis et al 2003), microRNAs (miRNA Registry) / snoRNAs and scaRNAs (snoRNA-LBME-DB) (Griffiths-Jones 2004 and Weber 2005)

Overlaps with putative copy number polymorphisms (CNP) and sites of detected intermediate-sized structural variation (ISV) determined by various methods can be retrieved. These are derived from the Database of Genomic Variants (DGV) (http://projects.tcag.ca/variation/) via UCSC and cover deletions from genotype analysis using the HapMap Phase I data, release 16a (McCarroll et al 2006 ), HapMap Phase I data, release 16c.1, CEU and YRI samples (Conrad et al 2006) and from haploid hybridization analysis in 24 unrelated individuals from the Polymorphism Discovery Resource, selected for SNP LD study (Hinds et al 2006). This also include Copy Number Polymorphism (CNP) from BAC microarray analysis in a first population of 55 individuals (Iafrate et al 2004), a second population of 47 individuals (Sharp et al 2005) as well in the HapMap Phase II data (Redon et al 2006). Further CNP regions are identified using array CGH in the HapMap populations (269 individuals) (Locke et al 2006) and using Representational oligonucleotide microarray analysis (ROMA) (Sebat et al 2004) in a population of 20 normal individuals. ISV sites are detected by mapping paired-end sequences from a human fosmid DNA library (Tuzun et al 2005).

 

文章標籤
全站熱搜
創作者介紹
創作者 tear2001 的頭像
tear2001

La Folie, Quo Vadis, Evo-Devo?

tear2001 發表在 痞客邦 留言(0) 人氣(32)