Title: | Calculate Chromosomal Instability Metrics |
---|---|
Description: | Implement various chromosomal instability metrics. 'CINmetrics' (Chromosomal INstability metrics) provides functions to calculate various chromosomal instability metrics on masked Copy Number Variation(CNV) data at individual sample level. The chromosomal instability metrics have been implemented as described in the following studies: Baumbusch LO et al. 2013 <doi:10.1371/journal.pone.0054356>, Davidson JM et al. 2014 <doi:10.1371/journal.pone.0079079>, Chin SF et al. 2007 <doi:10.1186/gb-2007-8-10-r215>. |
Authors: | Vishal H. Oza, Roshan Darji, Brittany N. Lasseigne |
Maintainer: | Vishal H. Oza <[email protected]> |
License: | GPL-3 |
Version: | 0.1.0 |
Built: | 2024-11-14 03:28:19 UTC |
Source: | https://github.com/cran/CINmetrics |
Calculate all CINmetrics on a given dataframe
CINmetrics( cnvData, segmentMean_tai = 0.2, segmentMean_cna = (log(1.7, 2) - 1), segmentMean_base_segments = 0.2, segmentMean_break_points = 0.2, segmentMean_fga = 0.2, numProbes = NA, segmentDistance_cna = 0.2, minSegSize_cna = 10, genomeSize_fga = 2873203431 )
CINmetrics( cnvData, segmentMean_tai = 0.2, segmentMean_cna = (log(1.7, 2) - 1), segmentMean_base_segments = 0.2, segmentMean_break_points = 0.2, segmentMean_fga = 0.2, numProbes = NA, segmentDistance_cna = 0.2, minSegSize_cna = 10, genomeSize_fga = 2873203431 )
cnvData |
dataframe containing following columns: Sample, Start, End, Num_Probes, Segment_Mean |
segmentMean_tai |
numerical value for the minimum segment_mean cutoff/ threshold for Total Aberration Index calculation. Default is 0.2 |
segmentMean_cna |
numerical value for the minimum segment_mean cutoff/ threshold for Copy Number Aberration calculation. Default is 0.2 |
segmentMean_base_segments |
numerical value for the minimum segment_mean cutoff/ threshold for Base segments calculation. Default is 0.2 |
segmentMean_break_points |
numerical value for the minimum segment_mean cutoff/ threshold for Break points calculation. Default is 0.2 |
segmentMean_fga |
numerical value for the minimum segment_mean cutoff/ threshold for Fraction of genome altered calculation. Default is 0.2 |
numProbes |
Number of Probes |
segmentDistance_cna |
Segment distance threshold |
minSegSize_cna |
Minimum segment size |
genomeSize_fga |
Size of the genome derived from Affymetrix 6.0 array probe. Default is 2873203431 calculated based on hg38 **excluding sex chromosomes** |
All Chromosomal INstability metrics
CINmetrics(cnvData = maskCNV_BRCA)
CINmetrics(cnvData = maskCNV_BRCA)
Calculates the number of copy number aberrations
cna( cnvData, segmentMean = (log(1.7, 2) - 1), numProbes = NA, segmentDistance = 0.2, minSegSize = 10 )
cna( cnvData, segmentMean = (log(1.7, 2) - 1), numProbes = NA, segmentDistance = 0.2, minSegSize = 10 )
cnvData |
dataframe containing following columns: Sample, Start, End, Num_Probes, Segment_Mean |
segmentMean |
numerical value for the minimum segment_mean cutoff/ threshold. Default is 0.2 |
numProbes |
Number of Probes |
segmentDistance |
Segment distance threshold |
minSegSize |
Minimum segment size |
Copy Number Aberrations (CNA) (Davidson JM, et al), are defined as a segment with copy number outside the pre-defined range of 1.7-2.3
that is not contiguous with an adjacent independent CNA of identical copy number. For our purposes, we have adapted the range to be
, which is only slightly larger than the original.
It is nearly identical to countingBreakPoints, except this one calculates breaks as adjacent segments that have a difference in segment means of .
Number of copy number aberrations between segments
cna(cnvData = maskCNV_BRCA)
cna(cnvData = maskCNV_BRCA)
Function for counting altered base segments
countingBaseSegments(cnvData, segmentMean = 0.2, numProbes = NA)
countingBaseSegments(cnvData, segmentMean = 0.2, numProbes = NA)
cnvData |
dataframe containing following columns: Sample, Start, End, Num_Probes, Segment_Mean |
segmentMean |
numerical value for the minimum segment_mean cutoff/ threshold. Default is 0.2 |
numProbes |
Number of Probes |
The Altered Base Segment calculation takes all the CNV data for a single patient and first filters it for a segmentation mean of > 0.2 and, if specified, the minimum number of probes covering that area. Then, it calculates the sums of the lengths of each segment for a particular patient and outputs that.
Number of Base segments for each unique sample
countingBaseSegments(cnvData = maskCNV_BRCA)
countingBaseSegments(cnvData = maskCNV_BRCA)
The Break Point calculation takes all the CNV data for a single patient and first filters it for segmentation mean of > 0.2 and, if specified, the minimum number of probes covering that area. Then it counts the number of rows of data and multiplies it by 2. This represents the break points at the 5' and 3' ends of each segment.
countingBreakPoints(cnvData, segmentMean = 0.2, numProbes = NA)
countingBreakPoints(cnvData, segmentMean = 0.2, numProbes = NA)
cnvData |
dataframe containing following columns: Sample, Start, End, Num_Probes, Segment_Mean |
segmentMean |
numerical value for the minimum segment_mean cutoff/ threshold. Default is 0.2 |
numProbes |
Number of Probes |
Number of Break points for each unique sample
Fraction Genome Altered looks at the fraction of the genome that deviates from a diploid state fga calculates the fraction of the genome altered (FGA; [Chin SF, et. al.](https://www.ncbi.nlm.nih.gov/pubmed/17925008)), measured by taking the sum of the number of bases altered and dividing it by the genome length covered ($G$). Genome length covered was calculated by summing the lengths of each probe on the Affeymetrix 6.0 array. This calculation **excludes** sex chromosomes.
fga(cnvData, segmentMean = 0.2, numProbes = NA, genomeSize = 2873203431)
fga(cnvData, segmentMean = 0.2, numProbes = NA, genomeSize = 2873203431)
cnvData |
dataframe containing following columns: Sample, Start, End, Num_Probes, Segment_Mean |
segmentMean |
numerical value for the minimum segment_mean cutoff/ threshold. Default is 0.2 |
numProbes |
Number of Probes |
genomeSize |
Size of the genome derived from Affymetrix 6.0 array probe. Default is 2873203431 calculated based on hg38 **excluding sex chromosomes** |
Fraction of the genome altered
fga(cnvData = maskCNV_BRCA)
fga(cnvData = maskCNV_BRCA)
Breast Cancer Data from TCGA Data Release 25.0 GDC Product: Data Release Date: July 22, 2020 Masked Copy Number variation data for Breast Cancer for 10 unique samples selected randomly from TCGA
data(maskCNV_BRCA)
data(maskCNV_BRCA)
An object of class dataframe
https://portal.gdc.cancer.gov/
Koboldt, D., Fulton, R., McLellan, M. et al. (2012) Nature 490, 61–70 https://www.nature.com/articles/nature11412
data(maskCNV_BRCA) tai <- tai(maskCNV_BRCA)
data(maskCNV_BRCA) tai <- tai(maskCNV_BRCA)
Total Aberration Index calculation takes the sum of lengths of each segment times its segmentation mean for each sample and divides it by the sum of the lengths of each sample.
tai(cnvData, segmentMean = 0.2, numProbes = NA)
tai(cnvData, segmentMean = 0.2, numProbes = NA)
cnvData |
dataframe containing following columns: Sample, Start, End, Num_Probes, Segment_Mean |
segmentMean |
numerical value for the minimum segment_mean cutoff/ threshold. Default is 0.2 |
numProbes |
Number of Probes |
The Total Aberration Index (TAI) (Baumbusch LO, et. al.) is “a measure of the abundance of genomic size of copy number changes in a tumour". It is defined as a weighted sum of the segment means
Average of lengths weighted by segmentation mean for each unique sample
tai(cnvData = maskCNV_BRCA)
tai(cnvData = maskCNV_BRCA)
Modified Total Aberration Index calculation takes the sum of lengths of each segment times its segmentation mean for each sample and divides it by the sum of the lengths of each sample.
taiModified(cnvData, segmentMean = 0, numProbes = NA)
taiModified(cnvData, segmentMean = 0, numProbes = NA)
cnvData |
dataframe containing following columns: Sample, Start, End, Num_Probes, Segment_Mean |
segmentMean |
numerical value for the minimum segment_mean cutoff/ threshold. Default is 0.2 |
numProbes |
Number of Probes |
Modified Total Aberration Index uses all sample values instead of those in aberrant copy number state, thus does not remove the directionality from the score.
Average of lengths weighted by segmentation mean for each unique sample
taiModified(cnvData = maskCNV_BRCA)
taiModified(cnvData = maskCNV_BRCA)