[UGENE-5981] GZ: RefSeq for CLARK Created: 23/Jan/18  Updated: 31/Oct/18  Resolved: 25/Jul/18

Status: Closed
Project: UGENE
Component/s: NGS, Workflow
Affects Version/s: virogenesis
Fix Version/s: 1.31

Type: Improvement Priority: Critical
Reporter: Olga Golosova Assignee: Aleksey Tiunov [X] (Inactive)
Resolution: Fixed  
Labels: formats, usability
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Issue Links:
Blocks
is blocked by UGENE-6010 Multi-FASTA: RefSeq for CLARK Closed
Story Points: 5
Assigned Tester: Dmitrii Sukhomlinov
Epic Link: VIROGENESIS-GZ
Sprint: DEV-30-6, DEV-30-7, DEV-30-8-RELEASE, DEV-31-2, DEV-31-3, DEV-31-4
Affect Type: Userdefined

 Description   

Support the GZ format as input for CLARK. Make sure an archived multi-FASTA can also be processed (see UGENE-6010).



 Comments   
Comment by Alexey Varlamov [ 10/Apr/18 ]

Implemented GZ for reference data (i.e. used to build DB) + fixed most prominent performance bottlenecks.
This part can be tested (clark rebuild is required).

Comment by Dmitrii Sukhomlinov [ 24/Jul/18 ]

Archived sequence doesn't pass in CLARK, when the same non-archived passes correctly

Comment by Aleksey Tiunov [X] (Inactive) [ 25/Jul/18 ]

CLARK can process compressed files during the database building. It can't classify compressed reads now, there is another issue for this: UGENE-6041.

Generated at Tue Mar 04 03:41:38 NOVT 2025 using Jira 8.5.0#805000-sha1:facbf8be6a56ed8ab71dea158b6e159962506101.