This is assuming you are starting from a gene symbol and don't have the sequence or accession ID yet.
Step 1: search for your gene in NCBI's databases
Start with either NCBI Nucleotide or NCBI Gene.
Search results for noncoding gene "XIST" in NCBI Gene, then clicking the "+" to expand the "RefSeq Sequences" section:
Luckily, there is only 1 reference transcript for XIST. For genes with several transcripts, either continue with all or pick some based on another criteria (e.g. which one is longest, which ones code for protein, which one is suggested as "RefSeq Select" or "canonical").
Search results for protein-coding gene "UTY" in NCBI Gene, after expanding the RefSeq Sequences section, but before clicking on any result:
Step 2: Select an NCBI accession ID to BLAST
- NR_ = RNA transcript (noncoding or partial or undescribed)
- NM_ = mRNA transcript (polyadenylated RNA transcript)
- NP_ = protein sequence
- Change the Database drop-down selection to one of the reference sequences databases.
- Reference RNA sequences (refseq_rna)
- RefSeq Genome Database (refseq_genomes)
- Human RefSeqGene sequences (RefSeq_Gene)
- Adjust the Program Selection in the BLAST options (previous page) and rerun the results.
- Try "somewhat similar sequences (blastn)".
- Add multiple accession IDs from the same gene to the Query Sequence text box.
- Get these from NCBI Gene or NCBI Nucleotides
- Get only RNA or only protein sequence accession IDs (don't mix them up)
No comments:
Post a Comment