R/preProcessing.R
validateKinaseTable.Rd
This function is designed to identify kinases and phosphatases by matching the peptide IDs. Identified kinases or phosphatases are required to accurately infer a signaling network.
validateKinaseTable(
netPhorceData = netPhorceData,
defaultKinaseTable = TRUE,
userKinaseTable = NULL,
abbrev = NULL,
species = NULL,
verbose = TRUE
)
(Required). netPhorceData calculated from processData
function.
(Required). If TRUE, this function will search the preloaded kinase/phosphatase table, which contains the kinase and/or phosphatase IDs of 25 species. Please use checkPreloadKinaseTable
function to check details of these 25 species.
(Optional). A user-provided kinase/phosphatase table, which is required to include an "ID" and "Family" column (CASE-SENSITIVE column names). The ID column includes the peptide's universal ID that matches the protein ID column from the raw MaxQuant data. The Family column will include information regarding whether the peptide is a "Kinase" or "Phosphatase" (CASE-SENSITIVE).
(Optional). A three character string of the species abbreviation from the preloaded kinases/phosphatase table. This parameter will be evaluated if the species parameter is NULL.
(Required). A string of the species name as listed in the preloaded kinases/phosphatase table. If the species and abbrev parameter are NULL, the function will assume the user is using customized kinase phosphatase data.
(Optional). If TRUE, print a summary table of matched/unmatched kinases and phosphatases.
A list of data.frames
if (FALSE) {
## Loading One Condition Data
data("oneConditionExample")
## Identify the Key Columns
identifiedCols <- confirmColumnNames(rawMaxQuant = oneConditionExample,
positionCol = "Position",
reverseCol = "Reverse",
localizationProbCol = "Localization prob",
potentialContaminationCol = "Potential contaminant",
aminoAcidCol = "Amino acid",
uniqueIDCol = "Protein",
seqWindowIDCol = "Sequence window",
fastaIDCol = "Fasta headers")
## Identify the Intensity Columns with Condition, Time Point and Replication Information
intensityCols <- confirmIntensityColumns(rawMaxQuant = oneConditionExample,
intensityPattern = "con_time_rep",
verbose = TRUE)
## Process the data based on the identified columns
netPhorceData <- processData(rawMaxQuant = oneConditionExample,
processedColNames = identifiedCols,
processedIntensity = intensityCols,
minReplication = 3,
minLocalProb = 0.75)
## Validating the Kinase/Phosphatase Information
netPhorceData <- validateKinaseTable(netPhorceData = netPhorceData,
defaultKinaseTable = TRUE,
abbrev = "Ath")
}