NAME

CracTools::SAMReader::SAMline - The object for manipulation a SAM line.

VERSION

version 1.25

SYNOPSIS

use CracTools::SAMReader::SAMline;

$sam_line = CracTools::SAMReader::SAMline->new($line);

DESCRIPTION

An object for easy acces to SAM line fields. See SAM Specifications for more informations : http://samtools.sourceforge.net/SAM1.pdf

Variables

%flags

SAM flags :

  • MULTIPLE_SEGMENTS => 1

  • PROPERLY_ALIGNED => 2

  • UNMAPPED => 4,

  • NEXT_UNMAPPED => 8,

  • REVERSE_COMPLEMENTED => 16,

  • NEXT_REVERSE_COMPLEMENTED => 32,

  • FIRST_SEGMENT => 64,

  • LAST_SEGMENT => 128,

  • SECONDARY_ALIGNMENT => 256,

  • QUALITY_CONTROLS_FAILED => 512,

  • PCR_DUPLICATED => 1024,

  • CHIMERIC_ALIGNMENT => 2048,

STATIC PARSING METHODS

These methods can be used without creating an CracTools::SAMReader::SAMline object. They are designed to provided efficient performance when parsing huge SAM files, because creating object in Perl can be long and useless for some purposes.

hasEvent

Arg [1] : String - SAM line
Arg [2] : eventType

Methods

new

Arg [1] : String - SAM line in TAB-separated format.

Example     : $sam_line = CracTools::SAMline->new$($line);
Description : Create a new CracTools::SAMline obect.
ReturnType  : CracTools::SAMline
Exceptions  : none

isFlagged

Arg [1] : Integer - The flag to test (1,2,4,8, ... ,1024)

Example     : if($SAMline->isFlagged($fags{unmapped}) {
                DO_SOMETHING... 
              };
Description : Test if the line has the flag in parameter setted.
ReturnType  : Boolean
Exceptions  : none

getStrand

Example     : $strand = $SAMline->getStrand(); 
Description : Return the strand of the SAMline :
              - "1" if forward strand
              - "-1" if reverse strand
ReturnType  : 1 or -1
Exceptions  : none

getOriginalSeq

Descrition   : Return the original sequence as it was in the FASTQ file.
               In fact we reverse complemente the sequence if flag 16 is raised.

getLocAsCracFormat

Example     : $loc = $SAMline->getLocAsCracFormat(); 
Description : Return the location of the sequence using CRAC format : "chr|strand,position".
              For example : X|-1,2154520
ReturnType  : String
Exceptions  : none

getPatch

Description : If the SAMline has been modified, this method will generate
              a patch in UnifiedDiff format that represent the changes.
ReturnType  : String (patch) if line has changed, False (0) either.
Exceptions  : none

GETTERS AND SETTERS

line

Description : Getter for the whole SAMline as a string.
ReturnType  : String
Exceptions  : none

updatedLine

Description : Getter/Setter for the updated line.
              If there is not updated line, this method return
              the original SAM line.
RetrunType  : String

qname

Description : Getter/Setter for attribute qname
ReturnType  : String
Exceptions  : none

flag

Description : Getter/Setter for attribute flag
ReturnType  : String
Exceptions  : none

rname

Description : Getter/Setter for attribute rname (chromosome for eucaryotes)
ReturnType  : String
Exceptions  : none

chr

Description : Getter/Setter for attribute rname (Alias)
ReturnType  : String
Exceptions  : none

pos

Description : Getter/Setter for attribute pos (position of the sequence)
ReturnType  : String
Exceptions  : none

mapq

Description : Getter/Setter for attribute mapq (mapping quality)
ReturnType  : String
Exceptions  : none

cigar

Description : Getter/Setter for attribute cigar (see SAM doc)
ReturnType  : String
Exceptions  : none

rnext

Description : Getter/Setter for attribute rnext (see SAM doc)
ReturnType  : String
Exceptions  : none

pnext

Description : Getter/Setter for attribute pnext (see SAM doc)
ReturnType  : Integer
Exceptions  : none

tlen

Description : Getter/Setter for attribute tlen (sequence length)
ReturnType  : Integer
Exceptions  : none

seq

Description : Getter/Setter for attribute seq (the sequence).
              Please use getOriginalSeq if you want to retrieve the oriented
              sequence, that what you need in most cases.
ReturnType  : String
Exceptions  : none

qual

Description : Getter/Setter for attribute qual (sequence quality)
ReturnType  : String
Exceptions  : none

getOptionalField

Example     : 
Description : 
ReturnType  : 

getChimericAlignments

Description : Parser of SA fields of SAM file in order to find chimeric reads
ReturnType  : Array reference
              Elements are hash [ chr    => String, 
                                  pos    => int, 
                                  strand => 1/-1, 
                                  cigar  => String,
                                  mapq   => int,
                                  edist  => int
                                ]

getCigarOperatorsCount

Example     : my %cigar_counts = %{ $sam_line->getCigarOperatorsCount() };
              print "nb mismatches; ",$cigar_counts{X},"\n";
Description : Return a hash reference where the keys are the cigar operators and the values
              the sum of length associated for each operator.
              For cigar 5S3M1X2M10S, getCigarOperatorsCounts() will retrun :
              { 'S' => 15,
                'M' => 5,
                'X' => 1,
              };
ReturnType  : Hash reference 

pSupport

Description : Return the support profile of the read if the SAM file has been generated with
              CRAC option --detailed
ReturnType  : String

pLoc

Description : Return the location profile of the read if the SAM file has been generated with
              CRAC option --detailed
ReturnType  : String

pairedChimera

Description : return the chimeric coordinates of the paired chimera associated to this read if there is one

ReturnType  : array(chr1,pos1,strand1,chr2,pos2,strand2) or undef

isPairedClassified

Arg [1] : String - The class to test :
          - "unique"
          - "duplicated"
          - "multiple"

Description : Test paired-end read clasification

ReturnType  : Boolean

genericInfo

[1] : Key of the generic info
[2] : (Optional) Value of the generic info

Description : Getter/Setter enable to store additional (generic) information 
              about the SAMline as a Key/Value. 
Example : # Set a generic info
          $read->genericInfo("foo","bar")

          # Get a generic info
          print $read->genericInfo("foo"); # this will print "bar"
ReturnType : ?
Exceptions : none

isClassified

Arg [1] : String - The class to test :
          - "unique"
          - "duplicated"
          - "multiple"
          - "normal"
          - "almostNormal"

Example     : if($sam_line->isClassified('normal')) {
                DO_SOMETHING;
              }
Description : Test if the line is classified according to the parameter value.
ReturnType  : Boolean
Exceptions  : none

events

Arg [1] : String - The event type to return :
          - Junction
          - Ins
          - Del
          - SNP
          - Error
          - Chimera
          - Undetermined
          - BioUndetermined
          - ... (see CRAC SAM format specifications for more informations).
Example     : my @junctions = @{$line->events('Junction')};
              foreach my $junction (@junctions) {
                print "Foud Junction : [type : $junction->{type}, loc : $junction->{loc}, gap : $junction->{gap}]\n";
              } 
Description : Return all events of the type specified in parameter
ReturnType  : Array reference
Exceptions  : none

PRIVATE METHODS

loadEvents

Example     : $sam_line->loadEvents();
Description : Loading of events attributes
ReturnType  : none
Exceptions  : none

addEvent

Arg [1] : String - The event type
Arg [2] : Hash reference - The event object
Example     : $line->addEvent($event_type,\%event); 
Description : Return all events of the type specified in parameter
ReturnType  : none
Exceptions  : none

removeEvent

Arg [1] : Hash reference - The event object

Description : Remove the event from the event hash and from the line.

updateEvent

loadSamDetailed

Example     : $sam_line->loadSamDetailed();
Description : Loading of sam detaileds attributes
ReturnType  : none
Exceptions  : none

loadPaired

Example     : $sam_line->loadPaired();
Description : Loading of sam detaileds attributes
ReturnType  : none
Exceptions  : none

expandCracLoc

Arg [1] : String - Localisation in crac format : Chromosome|strand,position
          Ex : X|-1,2332377

Description : Extract Chromosme, position and strand as separated variable from
              the localisation in CRAC format.
ReturnType  : Array($chromosome,$position,$strand)

compressCracLoc

Arg [1] : String - Chromosome
Arg [2] : Integer - Postition
Arg [3] : Integer (1,-1) - Strand

Description : Reverse function of "expandCracLoc"
ReturnType  : String (localisation in CRAC format)

AUTHORS

  • Nicolas PHILIPPE <nphilippe.research@gmail.com>

  • Jérôme AUDOUX <jaudoux@cpan.org>

  • Sacha BEAUMEUNIER <sacha.beaumeunier@gmail.com>

COPYRIGHT AND LICENSE

This software is Copyright (c) 2017 by IRMB/INSERM (Institute for Regenerative Medecine and Biotherapy / Institut National de la Santé et de la Recherche Médicale) and AxLR/SATT (Lanquedoc Roussilon / Societe d'Acceleration de Transfert de Technologie).

This is free software, licensed under:

The GNU Affero General Public License, Version 3, November 2007