[UGENE-6591] Parsing of LOCUS line in GenBank should be token-based Created: 20/Sep/19  Updated: 23/Jun/21  Resolved: 23/Jun/21

Status: Closed
Project: UGENE
Component/s: Basic-Nucl, Basic-Phy
Affects Version/s: 1.32
Fix Version/s: None

Type: Bug Priority: Trivial
Reporter: Olga Golosova Assignee: Unassigned
Resolution: Fixed  
Labels: formats
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Attachments: Text File gbrel.txt    
Affect Type: Userdefined

 Description   

Currently, data specified in the LOCUS line must start from position 13 in the UGENE implementation of the GenBank format. However, it is not essential according to the GenBank format specification (ftp://ftp.ncbi.nih.gov/genbank/gbrel.txt):

3.4.4 LOCUS Format
3.4.4.1 : Important notice about parsing the LOCUS line
Users who process the data elements of the LOCUS line should use a
token-based parsing approach rather than parsing its content based on
fixed column positions.

When a file is read, implement a token-based, not positional, parting of the LOCUS line. On writing a GenBank file keep the historical indentation, if possible.


Generated at Fri Mar 29 13:41:19 NOVT 2024 using Jira 8.5.0#805000-sha1:facbf8be6a56ed8ab71dea158b6e159962506101.