Uploaded image for project: 'UGENE'
  1. UGENE
  2. UGENE-6178

Include sample name into the TopHat result files

    XMLWordPrintable

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Critical
    • Resolution: Fixed
    • Affects Version/s: 1.30
    • Fix Version/s: 1.31
    • Component/s: NGS, Workflow
    • Labels:
    • Environment:

      Linux, Mac OS X

    • Story Points:
      2
    • Epic Link:
    • Sprint:
      DEV-31-4, DEV-31-5, DEV-31-6-RELEASE
    • Affect Type:
      Userdefined

      Description

      TopHat, in particular, generates the following output files into a folder, specified in the "Output folder" parameter of the element:

      • accepted_hits.bam
      • junctions.bed
      • insertions.bed
      • deletions.bed

      In 1.31 modify this as follows (note that this may be further improved in future): automatically rename the output files as follows:

      • samplename.bam
      • samplename_junctions.bed
      • samplename_insertions.bed
      • samplename_deletions.bed

      Here "samplename" is generated automatically based on the input file names (consider scenarios when SE and PE reads are input).

      Note that this issue is important for usability of the "RNA-Seq analysis with TopHat and StringTie" workflow. A report, generated by "StringTie Gene Abundance Report", uses input file names to distinguish different samples.

        Attachments

          Activity

            People

            Assignee:
            atiunov Aleksey Tiunov [X] (Inactive)
            Reporter:
            oigl Olga Golosova
            Assigned Tester:
            Kirill Rasputin
            Watchers:
            1 Start watching this issue

              Dates

              Created:
              Updated:
              Resolved: