RiceVarMap is a database that contains 17,397,026 variations (including 14,541,446 SNPs and 2,855,580 small INDELs ) of 4,726 cultivars all over the world.
The database provides various variation query functions (like 'Search for Variation by Region', 'Search for Variation in Gene' and 'Search for Genotype With Variation ID' et. al. ) and many useful analysis tools (like primer design tools, 'Design Primer by Region', 'Design Primer by Variation ID' and 'Haplotype Network Analysis' et. al.). All the variations are annotated by snpEff, CooVar, and ployphen2. The GWAS results and chromatin accessibility data from ATAC-seq are also used to curate the variation effects, all the significantly associated variations are saved in the database. Moreover, the query results are available for users to download through our website.
You can visit this version of RiceVarMap by entering its address http://ricevarmap.ncpgr.cn/v2. Background of data collection, processing and evaluation could be found in Notes and Data Evaluation page.
You can click this link, specify a chromosome and input a range (required), and also you can filter Variations by major allele frequencies (optional). For results display, you can select only output SNP or INDEL and populations major allele frequencies in this region.
The results show like this:
You can click this link, enter one MSU Osal Rice Loci (e.g. LOC_Os01g01070), and if you want to search for variations in the upstream or downstream region, you can enter the upstream or downstream distance between the gene (e.g. upstream: 0.5kb, downstream 0.2kb, optional).
First, you can see a figure of variations distribution in this region, it can be zoom in or download.
And the results table is also can be sorted and searched.
Firstly, you should collect variation IDs you need (you can find that in "search for Variations by region" page), and select "Variation Information" and click "Search for Genotype with Variation ID" (the link). Search and select cultivar name, click ">" (only add selected cultivars) or click ">>" (add all cultivars), enter variation IDs (e.g. vg0500004123,vg0500073814,vg0500152745,vg0500208833) , you can select only get the raw genotype (from the VCF file), only get imputed genotype (Imputed by an LD-KNN algorithm) or output imputed and raw genotype, and then click submit to get results.
In the result table, each column represents one variation and chromosome, position, variation type (SNP?INDEL), reference allele, major allele and minor allele information are also listed out. The allele in red color means it is a minor allele, background color in green means it is the raw genotype (not imputed).
You can click this link, the cultivar can be searched by input the key word to reduce the selection time. The detailed steps are in the figure below:
The results are two tables, the first is the cultivars information table, including cultivar name, ID name in the database, subpopulation information, and location information. The second table contains the genotype of two selected cultivars.
In this link, you just need input one variation ID and can get the result.
Results include the Flanking Sequence (20bp) around the variation, allele frequencies of each population. variation effect and the GWAS results of the variation.
In this link, cultivars can be selected and the results will output the cultivars location image and the detailed information table.
By using this link, first, you should select one phenotype name, and then choose one population. Up to now, we collected 14 agronomic traits and three populations (All, Indica All and Japonica All).
For each phenotype, we first draw one phenotype distribution histogram plot.
And you can download phenotype information table for each cultivar:
If the GWAS data available, the LMM model results will be plotted ( p values <= 1e-5 and in top 20000):
The Significant Candidate Loci information can be downloaded also.
'Design Primer by Variation ID' is intended for researchers to pick PCR-primers to validate SNPs/INDELs or develop molecular markers, 'Design Primer by Region' is intended for researchers to pick PCR-primers to amplify genomic regions avoiding to overlap with known SNPs/INDELs. They all Primer3 as a backend engine.
In result, the primer positions and variation position are marked out.
Haplotype network is frequently used in population genetic analysis, you can input selected variation IDs (must be SNP, INDELs will be filtered out) and select one population classification for haplotype analysis. and users can download CSV format and SVG file for further analysis (the page link).
You can just input one variation ID or the chromosome position or convert the variation ID on this page.