Prediction Results
Class prediction with nominal gene set selected with the random variance t-test (P=0.001):
Number of classes: 2 (SSc vs normal)
Based on 5000 random permutations,
the compound covariate predictor has p-value of 0.002
the 1-nearest neighbor classifier has p-value of < 2e-04
the 3-nearest neighbors classifier has p-value of < 2e-04
the nearest centroid classifier has p-value of < 2e-04
the support vector machines classifier has p-value of < 2e-04
the linear discriminant analysis classifier has p-value of 0.003
Note: t-values used for the compound covariate predictor were truncated at abs(t)=10 level.
Performance of classifiers during cross-validation:
| Pair ID | Number of genes in classifier | Compound Covariate Predictor Correct? | Diagonal Linear Discriminant Analysis Correct? | 1-Nearest Neighbor Correct? | 3-Nearest Neighbors Correct? | Nearest Centroid Correct? | Support Vector Machines Correct? | |
|---|---|---|---|---|---|---|---|---|
| 1 | 1 | 22 | YES | YES | YES | YES | YES | YES |
| 2 | 10 | 18 | YES | YES | YES | YES | YES | YES |
| 3 | 11 | 23 | YES | YES | YES | YES | YES | YES |
| 4 | 12 | 23 | YES | YES | YES | YES | YES | YES |
| 5 | 13 | 23 | YES | YES | YES | YES | YES | YES |
| 6 | 14 | 15 | YES | YES | YES | YES | YES | YES |
| 7 | 15 | 19 | YES | YES | YES | YES | YES | YES |
| 8 | 16 | 25 | YES | YES | YES | YES | YES | YES |
| 9 | 17 | 22 | YES | YES | YES | YES | YES | YES |
| 10 | 18 | 24 | YES | YES | YES | YES | YES | YES |
| 11 | 19 | 19 | YES | YES | YES | YES | YES | YES |
| 12 | 2 | 19 | YES | YES | YES | YES | YES | YES |
| 13 | 20 | 25 | YES | YES | YES | YES | YES | YES |
| 14 | 21 | 29 | YES | YES | YES | YES | YES | YES |
| 15 | 22 | 22 | YES | YES | YES | YES | YES | YES |
| 16 | 23 | 19 | YES | YES | YES | YES | YES | YES |
| 17 | 24 | 25 | YES | YES | YES | YES | YES | YES |
| 18 | 25 | 23 | YES | YES | YES | YES | YES | YES |
| 19 | 3 | 24 | YES | YES | YES | YES | YES | YES |
| 20 | 4 | 27 | NO | NO | YES | YES | YES | YES |
| 21 | 5 | 18 | YES | YES | YES | YES | YES | YES |
| 22 | 6 | 18 | YES | YES | YES | YES | YES | YES |
| 23 | 7 | 17 | YES | YES | YES | YES | YES | YES |
| 24 | 8 | 18 | YES | YES | YES | YES | YES | YES |
| 25 | 9 | 28 | YES | YES | YES | YES | YES | YES |
| Percent correctly classified: | 96 | 96 | 100 | 100 | 100 | 100 |
Composition of classifier (26 genes significant at the 1e-04 level):
Table – Sorted by t -value:
| t-value | Parametric p-value | % CV support | Geometric mean of ratios (class Disease /class Normal) | Qiagen oligo ID | Description | GB acc | UG cluster | Gene symbol | |
|---|---|---|---|---|---|---|---|---|---|
| 1 | -6.61 | p < 0.000001 | 100 | 0.621 | H003528_01 | Decay accelerating factor for complement (CD55, Cromer blood group system) | M30142 | 1369 | DAF |
| 2 | -5.92 | 1e-06 | 100 | 0.548 | H003827_01 | Serum/glucocorticoid regulated kinase |
AJ000512 | 296323 | SGK |
| 3 | -5.91 | 2e-06 | 100 | 0.548 | H016371_01 | Hypothetical protein FLJ21212 | NM_024642 | 47099 | FLJ21212 |
| 4 | -5.48 | 5e-06 | 100 | 0.584 | H009347_01 | Neuronal cell adhesion molecule | AB002341 | 7912 | NRCAM |
| 5 | -4.86 | 3.1e-05 | 100 | 0.811 | H005438_01 | Homo sapiens mRNA; cDNA DKFZp434B1620 (from clone DKFZp434B1620) |
AL137548 | 43112 | |
| 6 | -4.86 | 3.1e-05 | 100 | 0.728 | H004078_01 | Heme-binding protein | NM_015987 | 108675 | HEBP |
| 7 | -4.85 | 3.2e-05 | 100 | 0.465 | H001509_01 | Aldo-keto reductase family 1, member C3 (3-alpha hydroxysteroid dehydrogenase, type II) |
D17793 | 78183 | AKR1C3 |
| 8 | -4.8 | 3.8e-05 | 100 | 0.708 | H002860_01 | Inositol polyphosphate-1-phosphatase |
L08488 | 32309 | INPP1 |
| 9 | -4.79 | 3.8e-05 | 100 | 0.62 | H010655_01 | Hypothetical protein FLJ20546 | AK000953 | 279896 | FLJ20546 |
| 10 | -4.78 | 3.9e-05 | 100 | 0.768 | H002574_01 | Alcohol dehydrogenase 5 (class III), chi polypeptide |
M81118 | 78989 | ADH5 |
| 11 | -4.78 | 3.9e-05 | 100 | 0.517 | H003858_01 | Aldo-keto reductase family 1, member C2 (dihydrodiol dehydrogenase 2; bile acid binding protein; 3-a |
U05598 | 201967 | AKR1C2 |
| 12 | -4.72 | 4.6e-05 | 76 | 0.751 | H004767_01 | Tetraspan 3 | AK001326 | 100090 | TSPAN-3 |
| 13 | -4.53 | 8e-05 | 24 | 0.771 | H016494_01 | Hypothetical protein FLJ12436 | NM_024661 | 69485 | FLJ12436 |
| 14 | -4.52 | 8.4e-05 | 36 | 0.721 | H003965_01 | Cellular repressor of E1A-stimulated genes |
AF084523 | 5710 | CREG |
| 15 | -4.51 | 8.4e-05 | 32 | 0.724 | H002462_01 | Receptor tyrosine kinase-like orphan receptor 1 |
M97675 | 274243 | ROR1 |
| 16 | -4.47 | 9.6e-05 | 32 | 0.689 | H000959_01 | Glycophorin C (Gerbich blood group) |
NM_002101 | 81994 | GYPC |
| 17 | -4.46 | 9.8e-05 | 36 | 0.742 | H008089_01 | KIAA0469 gene product | AB007938 | 7764 | KIAA0469 |
| 18 | 4.53 | 8.1e-05 | 36 | 1.689 | H016341_01 | Platelet derived growth factor C | NM_016205 | 43080 | PDGFC |
| 19 | 4.53 | 8.1e-05 | 40 | 1.757 | H002994_01 | Collagen, type XVIII, alpha 1 | AF018081 | 78409 | COL18A1 |
| 20 | 4.55 | 7.7e-05 | 36 | 1.314 | H003383_01 | Ras-related C3 botulinum toxin substrate 2 (rho family, small GTP binding protein Rac2) |
Z82188 | 173466 | RAC2 |
| 21 | 4.6 | 6.6e-05 | 36 | 1.249 | H003776_01 | Aldehyde dehydrogenase 2 family (mitochondrial) |
X05409 | 195432 | ALDH2 |
| 22 | 4.64 | 5.9e-05 | 40 | 1.352 | H000498_01 | Desmoplakin (DPI, DPII) | AL031058 | 74316 | DSP |
| 23 | 4.69 | 5.1e-05 | 60 | 1.251 | H011854_01 | Heterogeneous nuclear ribonucleoprotein C (C1/C2) |
M16342 | 182447 | HNRPC |
| 24 | 4.71 | 4.8e-05 | 60 | 1.422 | H006183_01 | Metallothionein 1X | X65607 | 278462 | MT1X |
| 25 | 6.23 | p < 0.000001 | 100 | 1.453 | H007688_01 | 6-phosphofructo-2-kinase/fructose-2,6-biphosphatase 3 | AF109735 | 195471 | PFKFB3 |
| 26 | 7.26 | p < 0.000001 | 100 | 1.553 | H002655_01 | Collagen, type VII, alpha 1 (epidermolysis bullosa, dystrophic, dominant and recessive) |
L02870 | 1640 | COL7A1 |
Table – Sorted by mean difference:
| t-value | Parametric p-value | % CV support | Geometric mean of ratios (class Disease /class Normal ) | Qiagen oligo ID | Description | GB acc | UG cluster | Gene symbol | |
|---|---|---|---|---|---|---|---|---|---|
| 19 | 4.53 | 8.1e-05 | 40 | 1.757 | H002994_01 | Collagen, type XVIII, alpha 1 | AF018081 | 78409 | COL18A1 |
| 18 | 4.53 | 8.1e-05 | 36 | 1.689 | H016341_01 | Platelet derived growth factor C | NM_016205 | 43080 | PDGFC |
| 26 | 7.26 | p < 0.000001 | 100 | 1.553 | H002655_01 | Collagen, type VII, alpha 1 (epidermolysis bullosa, dystrophic, dominant and recessive) |
L02870 | 1640 | COL7A1 |
| 25 | 6.23 | p < 0.000001 | 100 | 1.453 | H007688_01 | 6-phosphofructo-2-kinase/fructose-2,6-biphosphatase 3 |
AF109735 | 195471 | PFKFB3 |
| 24 | 4.71 | 4.8e-05 | 60 | 1.422 | H006183_01 | Metallothionein 1X | X65607 | 278462 | MT1X |
| 22 | 4.64 | 5.9e-05 | 40 | 1.352 | H000498_01 | Desmoplakin (DPI, DPII) | AL031058 | 74316 | DSP |
| 20 | 4.55 | 7.7e-05 | 36 | 1.314 | H003383_01 | Ras-related C3 botulinum toxin substrate 2 (rho family, small GTP binding protein Rac2) |
Z82188 | 173466 | RAC2 |
| 23 | 4.69 | 5.1e-05 | 60 | 1.251 | H011854_01 | Heterogeneous nuclear ribonucleoprotein C (C1/C2) |
M16342 | 182447 | HNRPC |
| 21 | 4.6 | 6.6e-05 | 36 | 1.249 | H003776_01 | Aldehyde dehydrogenase 2 family (mitochondrial) |
X05409 | 195432 | ALDH2 |
| 5 | -4.86 | 3.1e-05 | 100 | 0.811 | H005438_01 | Homo sapiens mRNA; cDNA DKFZp434B1620 (from clone DKFZp434B1620) |
AL137548 | 43112 | |
| 13 | -4.53 | 8e-05 | 24 | 0.771 | H016494_01 | Hypothetical protein FLJ12436 | NM_024661 | 69485 | FLJ12436 |
| 10 | -4.78 | 3.9e-05 | 100 | 0.768 | H002574_01 | Alcohol dehydrogenase 5 (class III), chi polypeptide |
M81118 | 78989 | ADH5 |
| 12 | -4.72 | 4.6e-05 | 76 | 0.751 | H004767_01 | Tetraspan 3 | AK001326 | 100090 | TSPAN-3 |
| 17 | -4.46 | 9.8e-05 | 36 | 0.742 | H008089_01 | KIAA0469 gene product | AB007938 | 7764 | KIAA0469 |
| 6 | -4.86 | 3.1e-05 | 100 | 0.728 | H004078_01 | Heme-binding protein | NM_015987 | 108675 | HEBP |
| 15 | -4.51 | 8.4e-05 | 32 | 0.724 | H002462_01 | Receptor tyrosine kinase-like orphan receptor 1 |
M97675 | 274243 | ROR1 |
| 14 | -4.52 | 8.4e-05 | 36 | 0.721 | H003965_01 | Cellular repressor of E1A-stimulated genes |
AF084523 | 5710 | CREG |
| 8 | -4.8 | 3.8e-05 | 100 | 0.708 | H002860_01 | Inositol polyphosphate-1-phosphatase | L08488 | 32309 | INPP1 |
| 16 | -4.47 | 9.6e-05 | 32 | 0.689 | H000959_01 | Glycophorin C (Gerbich blood group) | NM_002101 | 81994 | GYPC |
| 1 | -6.61 | p < 0.000001 | 100 | 0.621 | H003528_01 | Decay accelerating factor for complement (CD55, Cromer blood group system) |
M30142 | 1369 | DAF |
| 9 | -4.79 | 3.8e-05 | 100 | 0.62 | H010655_01 | Hypothetical protein FLJ20546 | AK000953 | 279896 | FLJ20546 |
| 4 | -5.48 | 5e-06 | 100 | 0.584 | H009347_01 | Neuronal cell adhesion molecule | AB002341 | 7912 | NRCAM |
| 3 | -5.91 | 2e-06 | 100 | 0.548 | H016371_01 | Hypothetical protein FLJ21212 | NM_024642 | 47099 | FLJ21212 |
| 2 | -5.92 | 1e-06 | 100 | 0.548 | H003827_01 | Serum/glucocorticoid regulated kinase | AJ000512 | 296323 | SGK |
| 11 | -4.78 | 3.9e-05 | 100 | 0.517 | H003858_01 | Aldo-keto reductase family 1, member C2 (dihydrodiol dehydrogenase 2; bile acid binding protein; 3-a |
U05598 | 201967 | AKR1C2 |
| 7 | -4.85 | 3.2e-05 | 100 | 0.465 | H001509_01 | Aldo-keto reductase family 1, member C3 (3-alpha hydroxysteroid dehydrogenase, type II) |
D17793 | 78183 | AKR1C3 |
‘Observed v. Expected’ table of GO classes and parent classes, in list of 26 genes shown above:
Only GO classes and parent classes with at least 5 observations in the selected subset and with an ‘Observed vs. Expected’ ratio of at least 2 are shown.
Biological Process
| GO id | GO classification | Observed in selected subset | Expected in selected subset | Observed/Expected |
|---|---|---|---|---|
| 0009887 | organogenesis | 5 | 1.24 | 4.02 |
| 0009653 | morphogenesis | 5 | 1.57 | 3.19 |
| 0007275 | development | 7 | 2.37 | 2.95 |