gatb.core-API-0.0.0
|
Interface for kmers distribution management. More...
#include <IHistogram.hpp>
Public Member Functions | |
virtual | ~IHistogram () |
virtual size_t | getLength ()=0 |
virtual size_t | getLength2 ()=0 |
virtual void | inc (u_int16_t index)=0 |
virtual void | inc2D (u_int16_t index1, u_int16_t index2)=0 |
virtual void | save (tools::storage::impl::Group &group)=0 |
virtual void | compute_threshold (int min_auto_threshold)=0 |
virtual u_int16_t | get_solid_cutoff ()=0 |
virtual u_int64_t | get_nbsolids_auto ()=0 |
virtual float | get_ratio_weak ()=0 |
virtual u_int16_t | get_first_peak ()=0 |
virtual u_int64_t & | get (u_int16_t idx)=0 |
virtual u_int64_t & | get2D (u_int16_t idx1, u_int16_t idx2)=0 |
Public Member Functions inherited from ISmartPointer | |
virtual | ~ISmartPointer () |
virtual void | use ()=0 |
virtual void | forget ()=0 |
Interface for kmers distribution management.
This interface allows to have an idea of the function y(x), where x is the occurrence number of a kmer and y is the number of kmers occurring x times.
It is often interesting to have a graphical display of this kind of distribution; for instance, it may give an estimation of the coverage of NGS data.
We can also find x0 at the first minimum of y(x) : for x<x0, we are likely to have sequencing errors. The first maximum at x1 (x1>x0) is also interesting because it provides an estimation of the reads coverage.
This interface is mainly used by the SortingCountAlgorithm.
Here is a command line for showing the histogram with gnuplot from the hdf5 file 'graph.h5'
For the sum of the distribution, you can use:
|
inlinevirtual |
Destructor.
|
pure virtual |
Compute first minimum at x0 and firt maximum at x1 (x1>x0).
Implemented in HistogramCache, HistogramNull, and Histogram.
|
pure virtual |
Retrieve the value for x.
[in] | idx | : x value. |
Implemented in HistogramCache, HistogramNull, and Histogram.
|
pure virtual |
Retrieve the value for x and y of histo2D.
[in] | idx1 | : x value. |
[in] | idx2 | : y value. |
Implemented in HistogramCache, HistogramNull, and Histogram.
|
pure virtual |
Get the x1 value at the first maximum after x0.
Implemented in HistogramCache, HistogramNull, and Histogram.
|
pure virtual |
Get the number of kmers for x>x0, aka solid kmers for x0 threshold
Implemented in HistogramCache, HistogramNull, and Histogram.
|
pure virtual |
Get the ratio of weak kmers in total volume
Implemented in HistogramCache, HistogramNull, and Histogram.
|
pure virtual |
Get the solid cutoff, ie the x0 at first minimum.
Implemented in HistogramCache, HistogramNull, and Histogram.
|
pure virtual |
Return the maximum allowed for X.
Implemented in HistogramCache, HistogramNull, and Histogram.
|
pure virtual |
Return the maximum allowed for Y in case of 2D histogram.
Implemented in HistogramCache, HistogramNull, and Histogram.
|
pure virtual |
Increase the number of kmers occurring X time
[in] | index | : the X value. |
Implemented in HistogramCache, HistogramNull, and Histogram.
|
pure virtual |
Increase the number of kmers occurring X time in genome and Y times in read
[in] | index1 | : the X value. |
[in] | index2 | : the Y value. |
Implemented in HistogramCache, HistogramNull, and Histogram.
|
pure virtual |
Save the distribution. It is saved into the bag provided at construction.
Implemented in HistogramCache, HistogramNull, and Histogram.