ensembl-hive  2.8.1
XrefParser::RGDParser Class Reference

Public Member Functions

public run ()
 
public List sort_refseq_accessions ()
 
public Int process_synonyms ()
 

Detailed Description

Description

Designed to parse the Rat Genome Database download file, historically hosted at
ftp://ftp.rgd.mcw.edu/pub/data_release/GENES_RAT.txt . It comprises 40+ columns in a
tab-separated format

It contains RGD IDs (which are numeric), and associates them either with Ensembl genes or
RefSeq records (mainly transcripts).

Definition at line 14 of file RGDParser.pm.

Member Function Documentation

◆ process_synonyms()

public Int XrefParser::RGDParser::process_synonyms ( )
Arg [1]     : Xref dbID to attach synonyms to
Arg [2]     : Synonym string as read from file
Description : Process the synonym column into potentially many items and add
              them to the synonym table. Synonyms are ';' separated
Returntype  : Int - the count of synonyms added
 
Code:
click to view

◆ run()

public XrefParser::RGDParser::run ( )
Description: Triggers the parsing of the RGD file specified in files parameter
             It uses Text::CSV to consume the source file.
 
Code:
click to view

◆ sort_refseq_accessions()

public List XrefParser::RGDParser::sort_refseq_accessions ( )
Arg [1..n]  : Original list of accessions
Description : Filter out any accessions which are not in the "normal" set of
              genomic features. The column in question contains EMBL accessions
              as well as other things, and we don't have the ability to make
              Xrefs to all sources
Returntype  : List of sorted and filtered accessions
 
Code:
click to view

The documentation for this class was generated from the following file:
XrefParser::RGDParser::sort_refseq_accessions
public List sort_refseq_accessions()
XrefParser::RGDParser::process_synonyms
public Int process_synonyms()
XrefParser::RGDParser::run
public run()