Uploaded image for project: 'UGENE'
  1. UGENE
  2. UGENE-1101

Read assembly worker: sort unsorted BAMs before using

    XMLWordPrintable

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 1.12
    • Component/s: Workflow
    • Labels:
    • Affect Type:
      Userdefined

      Description

      If a BAM file is sorted then it can be opened in WD with samtools based DBI (because index could be built only for sorted BAMs).

      But if it is unsorted, then it just can be imported to ugenedb. And it is very slow.

      For example, times for 100 Mb unsorted BAM:
      1) import to ugenedb = 2 h 30 min.
      2) sort using samtools = 29 sec!!!
      3) build index for sorted one using samtools = 2 sec.
      4) import sorted one without index = 13 min 20 sec!!!
      5) import sorted one with index = 1 min 34 sec!!!

      So, it is better to sort a BAM file using samtools and then use the sorted one in WD.

        Attachments

          Activity

            People

            Assignee:
            ggrekhov German Grekhov
            Reporter:
            ggrekhov German Grekhov
            Watchers:
            1 Start watching this issue

              Dates

              Created:
              Updated:
              Resolved: