Metam
ORF

Logo

Gene

Gene

ID CARD

ID

HGNC:2529

Aliases

CLN10
CPSD, CTSD, ENSG00000117984
HGNC:2529, NCBI:1509, OFF:CTSD

Chromosome

11

Transcripts

67 ORFs
8 known transcripts
73 ORF to known transcript associations

MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

ORFs

Display all the transcripts related to the entry.

2154 ENST00000637381 CTSD-210 protein_coding
MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF
length

The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Start codon

The start codon sequence of the ORF.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

ORF annotations

A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of the cell types in which the ORF has already been identified on the transcript.

633513 48

TTG ...

LPK ...

Ribo-seq
TTG

weak

4 sORFs_org_Human
    sORF
loayza_puch_2016, MDA-MB-231, RPE-1,
THP-1
903353 54

CAA ...

QVQ ...

Ribo-seq
CAA

1 sORFs_org_Human
    sORF
Jurkat
903357 234

AGT ...

STT ...

Ribo-seq
AGT

1 sORFs_org_Human
    sORF
Jurkat
2153 30

CTG ...

LQT ...

Ribo-seq
CTG

23 sORFs_org_Human
    sORF
BJ, Blood, Brain,
HAP1, HEK293, HEK293T,
HeLa, hES, LCL,
loayza_puch_2016, MDA-MB-231, MM1S,
Monocyte, RPE-1, THP-1,
U2OS
903349 42

CGA ...

RQV ...

Ribo-seq
CGA

3 sORFs_org_Human
    sORF
HeLa, Jurkat, THP-1
903351 48

CAA ...

QQR ...

Ribo-seq
CAA

3 sORFs_org_Human
    sORF
HeLa, Jurkat, THP-1
2156 48

GTG ...

VGP ...

Ribo-seq
GTG

20 sORFs_org_Human
    sORF
BJ, Blood, Brain,
HAP1, HEK293, HEK293T,
HeLa, hES, LCL,
loayza_puch_2016, MDA-MB-231, MM1S,
Monocyte, RPE-1, THP-1,
U2OS
1296804 219

TTG ...

LAS ...

Ribo-seq
TTG

1 sORFs_org_Human
    sORF
HEK293T
2158 87

GTG ...

VLH ...

Ribo-seq
GTG

24 sORFs_org_Human
    sORF
BJ, Blood, Brain,
HAP1, HEK293, HEK293T,
HeLa, hES, HFF,
LCL, loayza_puch_2016, MDA-MB-231,
MM1S, Monocyte, RPE-1,
THP-1, U2OS
903355 78

AGT ...

SRL ...

Ribo-seq
AGT

3 sORFs_org_Human
    sORF
HeLa, Jurkat, THP-1
3883589 ENST00000433655 CTSD-204 nonsense_mediated_decay
MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF
length

The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Start codon

The start codon sequence of the ORF.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

ORF annotations

A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of the cell types in which the ORF has already been identified on the transcript.

282253 54

ATG ...

MVP ...

Ribo-seq
ATG

strong

1 Johnstone2016
    Alternative
    InCDS
    NMD
    Overlapping
    sORF
Brain_tumor, HEK293, HeLa,
HFF
3311971 831

ATG ...

MQP ...

Ribo-seq
ATG

1 Johnstone2016
    CDS
    NMD
Brain_tumor, HEK293, HeLa,
HFF
2144 69

ATG ...

MAS ...

Ribo-seq
ATG

moderate

1 Johnstone2016
    Alternative
    InCDS
    NMD
    Overlapping
    sORF
Brain_tumor, HEK293, HeLa,
HFF
2129 45

ATG ...

MRC ...

Ribo-seq
ATG

weak

1 Johnstone2016
    Downstream
    NMD
    sORF
Brain_tumor, HEK293, HeLa,
HFF
9445 63

ATG ...

MSP ...

Ribo-seq
ATG

1 Johnstone2016
    Downstream
    NMD
    Overlapping
    sORF
Brain_tumor, HEK293, HeLa,
HFF
2145 ENST00000367196 CTSD-202 protein_coding
MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF
length

The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Start codon

The start codon sequence of the ORF.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

ORF annotations

A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of the cell types in which the ORF has already been identified on the transcript.

533482 270

CTG ...

LGT ...

Ribo-seq
CTG

14 sORFs_org_Human
    sORF
BJ, Brain, HEK293,
HEK293T, HeLa, hES,
loayza_puch_2016, MM1S, Monocyte,
RPE-1, THP-1, U2OS
2149 144

GTG ...

VSK ...

Ribo-seq
GTG

moderate

21 sORFs_org_Human
    sORF
BJ, Blood, Brain,
HEK293, HEK293T, HeLa,
hES, HFF, LCL,
loayza_puch_2016, MDA-MB-231, MM1S,
Monocyte, RPE-1, THP-1,
U2OS
526664 42

TTG ...

LTS ...

Ribo-seq
TTG

weak

15 sORFs_org_Human
    sORF
Blood, Brain, HAP1,
HEK293, HEK293T, HeLa,
HFF, loayza_puch_2016, MDA-MB-231,
MM1S, RPE-1, U2OS
915743 81

CCT ...

PPS ...

Ribo-seq
CCT

1 sORFs_org_Human
    sORF
Jurkat
7839 195

CTG ...

LPG ...

Ribo-seq
CTG

25 sORFs_org_Human
    sORF
BJ, Blood, Brain,
HAP1, HEK293, HEK293T,
HeLa, hES, HFF,
LCL, loayza_puch_2016, MDA-MB-231,
MM1S, Monocyte, RPE-1,
THP-1, U2OS
2144 69

ATG ...

MAS ...

Ribo-seq
ATG

moderate

17 sORFs_org_Human
    sORF
BJ, Blood, HEK293,
HeLa, hES, HFF,
LCL, loayza_puch_2016, MDA-MB-231,
MM1S, Monocyte, RPE-1,
THP-1, U2OS
2616873 153

AGA ...

RLA ...

Ribo-seq
AGA

2 sORFs_org_Human
    sORF
HeLa, THP-1
2147 120

TTG ...

LGR ...

Ribo-seq
TTG

moderate

20 sORFs_org_Human
    sORF
BJ, Blood, Brain,
HEK293, HeLa, hES,
HFF, LCL, loayza_puch_2016,
MDA-MB-231, MM1S, Monocyte,
RPE-1, THP-1, U2OS
936108 30

ATG ...

MAR ...

Ribo-seq
ATG

strong

10 sORFs_org_Human
    sORF
Blood, Brain, HAP1,
HEK293, HeLa, MDA-MB-231,
RPE-1
2153 30

CTG ...

LQT ...

Ribo-seq
CTG

1 sORFs_org_Human
    sORF
Flp-In_T-REx-293
2151 156

CTG ...

LPW ...

Ribo-seq
CTG

weak

20 sORFs_org_Human
    sORF
BJ, Blood, Brain,
HEK293, HeLa, hES,
HFF, LCL, loayza_puch_2016,
MDA-MB-231, MM1S, Monocyte,
RPE-1, THP-1, U2OS
915747 165

AGT ...

STT ...

Ribo-seq
AGT

1 sORFs_org_Human
    sORF
Jurkat
2616118 171

CAG ...

QRR ...

Ribo-seq
CAG

moderate

2 sORFs_org_Human
    sORF
HeLa, THP-1
2616116 42

AGC ...

SRS ...

Ribo-seq
AGC

weak

2 sORFs_org_Human
    sORF
HeLa, THP-1
533484 276

TTG ...

LAL ...

Ribo-seq
TTG

12 sORFs_org_Human
    sORF
BJ, Brain, HEK293T,
HeLa, hES, loayza_puch_2016,
MM1S, Monocyte, RPE-1,
THP-1, U2OS
282253 54

ATG ...

MVP ...

Ribo-seq
ATG

strong

21 sORFs_org_Human
    sORF
BJ, Blood, Brain,
Flp-In_T-REx-293, HAP1, HEK293,
HEK293T, HeLa, hES,
HFF, LCL, loayza_puch_2016,
MDA-MB-231, MM1S, Monocyte,
RPE-1, U2OS
7841 183

CTG ...

LSP ...

Ribo-seq
CTG

24 sORFs_org_Human
    sORF
BJ, Blood, Brain,
HAP1, HEK293, HEK293T,
HeLa, hES, HFF,
LCL, loayza_puch_2016, MDA-MB-231,
MM1S, Monocyte, RPE-1,
THP-1, U2OS
533486 300

ATG ...

MPL ...

Ribo-seq
ATG

9 sORFs_org_Human
    sORF
BJ, Brain, HEK293T,
HeLa, hES, loayza_puch_2016,
MM1S, Monocyte, U2OS
7837 150

TTG ...

LAS ...

Ribo-seq
TTG

21 sORFs_org_Human
    sORF
BJ, Blood, Brain,
HAP1, HEK293, HEK293T,
HeLa, hES, HFF,
LCL, loayza_puch_2016, MDA-MB-231,
MM1S, Monocyte, RPE-1,
THP-1, U2OS
3199897 36

AGC ...

SWW ...

Ribo-seq
AGC

moderate

1 sORFs_org_Human
    sORF
HEK293
3199899 81

CAG ...

QPS ...

Ribo-seq
CAG

weak

1 sORFs_org_Human
    sORF
HEK293
915745 114

TCT ...

SST ...

Ribo-seq
TCT

1 sORFs_org_Human
    sORF
Jurkat
9437 ENST00000637937 CTSD-214 processed_transcript
MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF
length

The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Start codon

The start codon sequence of the ORF.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

ORF annotations

A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of the cell types in which the ORF has already been identified on the transcript.

9436 222

CTG ...

LKL ...

Ribo-seq
CTG

32 sORFs_org_Human
    ncRNA
    sORF
BJ, Blood, Brain,
Flp-In_T-REx-293, guo_2014, HAP1,
HEK293, HEK293T, HeLa,
hES, HFF, LCL,
loayza_puch_2016, MDA-MB-231, MM1S,
Monocyte, RPE-1, THP-1,
U2OS
917610 210

GGC ...

GKG ...

Ribo-seq
GGC

3 sORFs_org_Human
    ncRNA
    sORF
HeLa, Jurkat, THP-1
9447 195

CTG ...

LSP ...

Ribo-seq
CTG

32 sORFs_org_Human
    ncRNA
    sORF
BJ, Blood, Brain,
Flp-In_T-REx-293, guo_2014, HAP1,
HEK293, HEK293T, HeLa,
hES, HFF, LCL,
loayza_puch_2016, MDA-MB-231, MM1S,
Monocyte, RPE-1, THP-1,
U2OS
917608 57

CCC ...

PAR ...

Ribo-seq
CCC

1 sORFs_org_Human
    ncRNA
    sORF
Jurkat
9443 246

GTG ...

VST ...

Ribo-seq
GTG

32 sORFs_org_Human
    ncRNA
    sORF
BJ, Blood, Brain,
Flp-In_T-REx-293, guo_2014, HAP1,
HEK293, HEK293T, HeLa,
hES, HFF, LCL,
loayza_puch_2016, MDA-MB-231, MM1S,
Monocyte, RPE-1, THP-1,
U2OS
2617008 225

ACA ...

TLK ...

Ribo-seq
ACA

2 sORFs_org_Human
    ncRNA
    sORF
HeLa, THP-1
9445 63

ATG ...

MSP ...

Ribo-seq
ATG

20 sORFs_org_Human
    ncRNA
    sORF
BJ, Blood, Brain,
HEK293, HEK293T, HeLa,
hES, HFF, LCL,
loayza_puch_2016, MDA-MB-231, MM1S,
Monocyte, RPE-1, THP-1,
U2OS
9439 216

CTG ...

LGG ...

Ribo-seq
CTG

32 sORFs_org_Human
    ncRNA
    sORF
BJ, Blood, Brain,
Flp-In_T-REx-293, guo_2014, HAP1,
HEK293, HEK293T, HeLa,
hES, HFF, LCL,
loayza_puch_2016, MDA-MB-231, MM1S,
Monocyte, RPE-1, THP-1,
U2OS
9441 237

CTG ...

LPA ...

Ribo-seq
CTG

32 sORFs_org_Human
    ncRNA
    sORF
BJ, Blood, Brain,
Flp-In_T-REx-293, guo_2014, HAP1,
HEK293, HEK293T, HeLa,
hES, HFF, LCL,
loayza_puch_2016, MDA-MB-231, MM1S,
Monocyte, RPE-1, THP-1,
U2OS
409852 261

ATC ...

IPC ...

Ribo-seq
ATC

1 sORFs_org_Human
    ncRNA
    sORF
HCT116
2125 ENST00000429746 CTSD-203 protein_coding
MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF
length

The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Start codon

The start codon sequence of the ORF.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

ORF annotations

A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of the cell types in which the ORF has already been identified on the transcript.

2124 48

CTG ...

LDP ...

Ribo-seq
CTG

strong

23 sORFs_org_Human
    sORF
BJ, Blood, Brain,
HAP1, HEK293, HeLa,
hES, HFF, LCL,
loayza_puch_2016, MDA-MB-231, MM1S,
Monocyte, RPE-1, THP-1,
U2OS
2616869 78

TGG ...

WEA ...

Ribo-seq
TGG

2 sORFs_org_Human
    sORF
HeLa, THP-1
2127 99

CTG ...

LPE ...

Ribo-seq
CTG

weak

25 sORFs_org_Human
    sORF
BJ, Blood, Brain,
HAP1, HEK293, HEK293T,
HeLa, hES, HFF,
LCL, loayza_puch_2016, MDA-MB-231,
MM1S, RPE-1, THP-1,
U2OS
903335 108

GAA ...

EDP ...

Ribo-seq
GAA

moderate

2 sORFs_org_Human
    sORF
HEK293, Jurkat
2130 ENST00000497544 CTSD-206 retained_intron
MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF
length

The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Start codon

The start codon sequence of the ORF.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

ORF annotations

A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of the cell types in which the ORF has already been identified on the transcript.

2134 93

GTG ...

VRP ...

Ribo-seq
GTG

moderate

24 sORFs_org_Human
    Intronic
    sORF
BJ, Blood, Brain,
HAP1, HEK293, HEK293T,
HeLa, hES, HFF,
LCL, loayza_puch_2016, MDA-MB-231,
MM1S, Monocyte, RPE-1,
THP-1, U2OS
903341 102

ACT ...

TEA ...

Ribo-seq
ACT

1 sORFs_org_Human
    Intronic
    sORF
Jurkat
2132 84

TTG ...

LWT ...

Ribo-seq
TTG

weak

24 sORFs_org_Human
    Intronic
    sORF
BJ, Blood, Brain,
HAP1, HEK293, HEK293T,
HeLa, hES, HFF,
LCL, loayza_puch_2016, MDA-MB-231,
MM1S, Monocyte, RPE-1,
THP-1, U2OS
2140 237

GTG ...

VAS ...

Ribo-seq
GTG

strong

16 sORFs_org_Human
    Intronic
    sORF
BJ, Brain, guo_2014,
HEK293, HeLa, hES,
HFF, loayza_puch_2016, MM1S,
U2OS
2142 243

GTG ...

VEV ...

Ribo-seq
GTG

moderate

18 sORFs_org_Human
    Intronic
    sORF
BJ, Brain, guo_2014,
HEK293, HeLa, hES,
HFF, loayza_puch_2016, MM1S,
THP-1, U2OS
2136 219

CTG ...

LCK ...

Ribo-seq
CTG

moderate

12 sORFs_org_Human
    Intronic
    sORF
BJ, Brain, guo_2014,
HEK293, HeLa, hES,
HFF, loayza_puch_2016
481628 48

CTG ...

LNV ...

Ribo-seq
CTG

6 sORFs_org_Human
    Intronic
    sORF
Flp-In_T-REx-293, HEK293, HEK293T,
HeLa
2616111 30

AGG ...

RCP ...

Ribo-seq
AGG

moderate

2 sORFs_org_Human
    Intronic
    sORF
HeLa, THP-1
2138 225

CTG ...

LTL ...

Ribo-seq
CTG

moderate

13 sORFs_org_Human
    Intronic
    sORF
BJ, Brain, guo_2014,
HEK293, HeLa, hES,
HFF, loayza_puch_2016, MM1S
1007345 192

GTG ...

VDT ...

Ribo-seq
GTG

strong

3 sORFs_org_Human
    Intronic
    sORF
guo_2014, HEK293, HeLa
282241 36

GTG ...

VRR ...

Ribo-seq
GTG

weak

21 sORFs_org_Human
    Intronic
    sORF
BJ, Blood, Brain,
HAP1, HEK293, HEK293T,
HeLa, hES, HFF,
LCL, loayza_puch_2016, MDA-MB-231,
MM1S, Monocyte, RPE-1,
THP-1, U2OS
903339 51

TGG ...

WEA ...

Ribo-seq
TGG

strong

3 sORFs_org_Human
    Intronic
    sORF
HeLa, Jurkat, THP-1
1007343 168

GTG ...

VGP ...

Ribo-seq
GTG

strong

1 sORFs_org_Human
    Intronic
    sORF
guo_2014
481630 57

CTG ...

LSY ...

Ribo-seq
CTG

5 sORFs_org_Human
    Intronic
    sORF
Flp-In_T-REx-293, HEK293, HEK293T,
HeLa
903337 78

GGC ...

GKG ...

Ribo-seq
GGC

3 sORFs_org_Human
    Intronic
    sORF
HeLa, Jurkat, THP-1
2616109 93

ACA ...

TLK ...

Ribo-seq
ACA

2 sORFs_org_Human
    Intronic
    sORF
HeLa, THP-1
2129 45

ATG ...

MRC ...

Ribo-seq
ATG

weak

21 sORFs_org_Human
    Intronic
    sORF
BJ, Blood, Brain,
guo_2014, HAP1, HEK293,
HeLa, HFF, LCL,
loayza_puch_2016, MDA-MB-231, MM1S,
RPE-1, THP-1, U2OS
282247 ENST00000438213 CTSD-205 protein_coding
MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF
length

The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Start codon

The start codon sequence of the ORF.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

ORF annotations

A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of the cell types in which the ORF has already been identified on the transcript.

282246 45

GTG ...

VAQ ...

Ribo-seq
GTG

moderate

7 sORFs_org_Human
    sORF
LCL, loayza_puch_2016, MDA-MB-231,
Monocyte, RPE-1, THP-1
2616114 36

ACT ...

TPS ...

Ribo-seq
ACT

weak

2 sORFs_org_Human
    sORF
HeLa, THP-1
633503 48

ATG ...

MSP ...

Ribo-seq
ATG

11 sORFs_org_Human
    sORF
BJ, Blood, Brain,
HeLa, hES, loayza_puch_2016,
MDA-MB-231, MM1S, RPE-1,
THP-1
292929 ENST00000636843 CTSD-208 protein_coding
MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF
length

The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Start codon

The start codon sequence of the ORF.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

ORF annotations

A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of the cell types in which the ORF has already been identified on the transcript.

9445 63

ATG ...

MSP ...

Ribo-seq
ATG

1 sORFs_org_Human
    Alternative
    InCDS
    Overlapping
    sORF
HeLa
292928 96

CTG ...

LHP ...

Ribo-seq
CTG

12 sORFs_org_Human
    Alternative
    InCDS
    Overlapping
    sORF
BJ, Blood, HEK293,
HEK293T, HeLa, LCL,
loayza_puch_2016, MDA-MB-231, RPE-1,
THP-1, U2OS

ORFs

67 ORFs
8 known transcripts
73 ORF to known transcript associations

MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF Length

The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Start codon

The start codon sequence of the ORF.

ORF annotations

A comma-separated list of all the annotations computed by our algorithm for the ORF. This list includes the annotations computed for the ORF for all transcripts. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of all the cell types in which the ORF has already been identified.

Transcripts

Display all the transcripts related to the entry.

633513 48

TTG...

LPK...

TTG sORF
loayza_puch_2016, MDA-MB-231, RPE-1,
THP-1
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2154 ENST00000637381 CTSD-210 protein_coding Ribo-seq
weak 4 sORFs_org_Human
282253 54

ATG...

MVP...

ATG Alternative
InCDS
NMD
Overlapping
sORF
Brain_tumor, HEK293, HeLa,
HFF, BJ, Blood,
Brain, Flp-In_T-REx-293, HAP1,
HEK293T, hES, LCL,
loayza_puch_2016, MDA-MB-231, MM1S,
Monocyte, RPE-1, U2OS
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

3883589 ENST00000433655 CTSD-204 nonsense_mediated_decay Ribo-seq
strong 1 Johnstone2016
2145 ENST00000367196 CTSD-202 protein_coding Ribo-seq
strong 21 sORFs_org_Human
3311971 831

ATG...

MQP...

ATG CDS
NMD

Brain_tumor, HEK293, HeLa,
HFF
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

3883589 ENST00000433655 CTSD-204 nonsense_mediated_decay Ribo-seq
1 Johnstone2016
3311963 UNKNOWN_TRANSCRIPT Ribo-seq
1 Erhard2018
533482 270

CTG...

LGT...

CTG sORF
BJ, Brain, HEK293,
HEK293T, HeLa, hES,
loayza_puch_2016, MM1S, Monocyte,
RPE-1, THP-1, U2OS
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2145 ENST00000367196 CTSD-202 protein_coding Ribo-seq
14 sORFs_org_Human
9436 222

CTG...

LKL...

CTG ncRNA
sORF
BJ, Blood, Brain,
Flp-In_T-REx-293, guo_2014, HAP1,
HEK293, HEK293T, HeLa,
hES, HFF, LCL,
loayza_puch_2016, MDA-MB-231, MM1S,
Monocyte, RPE-1, THP-1,
U2OS
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

9437 ENST00000637937 CTSD-214 processed_transcript Ribo-seq
32 sORFs_org_Human
2124 48

CTG...

LDP...

CTG sORF
BJ, Blood, Brain,
HAP1, HEK293, HeLa,
hES, HFF, LCL,
loayza_puch_2016, MDA-MB-231, MM1S,
Monocyte, RPE-1, THP-1,
U2OS
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2125 ENST00000429746 CTSD-203 protein_coding Ribo-seq
strong 23 sORFs_org_Human
2134 93

GTG...

VRP...

GTG Intronic
sORF
BJ, Blood, Brain,
HAP1, HEK293, HEK293T,
HeLa, hES, HFF,
LCL, loayza_puch_2016, MDA-MB-231,
MM1S, Monocyte, RPE-1,
THP-1, U2OS
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2130 ENST00000497544 CTSD-206 retained_intron Ribo-seq
moderate 24 sORFs_org_Human
903353 54

CAA...

QVQ...

CAA sORF
Jurkat
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2154 ENST00000637381 CTSD-210 protein_coding Ribo-seq
1 sORFs_org_Human
2149 144

GTG...

VSK...

GTG sORF
BJ, Blood, Brain,
HEK293, HEK293T, HeLa,
hES, HFF, LCL,
loayza_puch_2016, MDA-MB-231, MM1S,
Monocyte, RPE-1, THP-1,
U2OS
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2145 ENST00000367196 CTSD-202 protein_coding Ribo-seq
moderate 21 sORFs_org_Human
526664 42

TTG...

LTS...

TTG sORF
Blood, Brain, HAP1,
HEK293, HEK293T, HeLa,
HFF, loayza_puch_2016, MDA-MB-231,
MM1S, RPE-1, U2OS
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2145 ENST00000367196 CTSD-202 protein_coding Ribo-seq
weak 15 sORFs_org_Human
917610 210

GGC...

GKG...

GGC ncRNA
sORF
HeLa, Jurkat, THP-1
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

9437 ENST00000637937 CTSD-214 processed_transcript Ribo-seq
3 sORFs_org_Human
9447 195

CTG...

LSP...

CTG ncRNA
sORF
BJ, Blood, Brain,
Flp-In_T-REx-293, guo_2014, HAP1,
HEK293, HEK293T, HeLa,
hES, HFF, LCL,
loayza_puch_2016, MDA-MB-231, MM1S,
Monocyte, RPE-1, THP-1,
U2OS
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

9437 ENST00000637937 CTSD-214 processed_transcript Ribo-seq
32 sORFs_org_Human
903357 234

AGT...

STT...

AGT sORF
Jurkat
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2154 ENST00000637381 CTSD-210 protein_coding Ribo-seq
1 sORFs_org_Human
917608 57

CCC...

PAR...

CCC ncRNA
sORF
Jurkat
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

9437 ENST00000637937 CTSD-214 processed_transcript Ribo-seq
1 sORFs_org_Human
9443 246

GTG...

VST...

GTG ncRNA
sORF
BJ, Blood, Brain,
Flp-In_T-REx-293, guo_2014, HAP1,
HEK293, HEK293T, HeLa,
hES, HFF, LCL,
loayza_puch_2016, MDA-MB-231, MM1S,
Monocyte, RPE-1, THP-1,
U2OS
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

9437 ENST00000637937 CTSD-214 processed_transcript Ribo-seq
32 sORFs_org_Human
915743 81

CCT...

PPS...

CCT sORF
Jurkat
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2145 ENST00000367196 CTSD-202 protein_coding Ribo-seq
1 sORFs_org_Human
7839 195

CTG...

LPG...

CTG sORF
BJ, Blood, Brain,
HAP1, HEK293, HEK293T,
HeLa, hES, HFF,
LCL, loayza_puch_2016, MDA-MB-231,
MM1S, Monocyte, RPE-1,
THP-1, U2OS
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2145 ENST00000367196 CTSD-202 protein_coding Ribo-seq
25 sORFs_org_Human
2144 69

ATG...

MAS...

ATG sORF
Alternative
InCDS
NMD
Overlapping
BJ, Blood, HEK293,
HeLa, hES, HFF,
LCL, loayza_puch_2016, MDA-MB-231,
MM1S, Monocyte, RPE-1,
THP-1, U2OS, Brain_tumor
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2145 ENST00000367196 CTSD-202 protein_coding Ribo-seq
moderate 17 sORFs_org_Human
3883589 ENST00000433655 CTSD-204 nonsense_mediated_decay Ribo-seq
moderate 1 Johnstone2016
903341 102

ACT...

TEA...

ACT Intronic
sORF
Jurkat
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2130 ENST00000497544 CTSD-206 retained_intron Ribo-seq
1 sORFs_org_Human
2617008 225

ACA...

TLK...

ACA ncRNA
sORF
HeLa, THP-1
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

9437 ENST00000637937 CTSD-214 processed_transcript Ribo-seq
2 sORFs_org_Human
2616873 153

AGA...

RLA...

AGA sORF
HeLa, THP-1
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2145 ENST00000367196 CTSD-202 protein_coding Ribo-seq
2 sORFs_org_Human
9445 63

ATG...

MSP...

ATG ncRNA
sORF
Alternative
InCDS
Overlapping
Downstream
NMD
BJ, Blood, Brain,
HEK293, HEK293T, HeLa,
hES, HFF, LCL,
loayza_puch_2016, MDA-MB-231, MM1S,
Monocyte, RPE-1, THP-1,
U2OS, Brain_tumor
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

9437 ENST00000637937 CTSD-214 processed_transcript Ribo-seq
20 sORFs_org_Human
292929 ENST00000636843 CTSD-208 protein_coding Ribo-seq
1 sORFs_org_Human
3883589 ENST00000433655 CTSD-204 nonsense_mediated_decay Ribo-seq
1 Johnstone2016
2153 30

CTG...

LQT...

CTG sORF
BJ, Blood, Brain,
HAP1, HEK293, HEK293T,
HeLa, hES, LCL,
loayza_puch_2016, MDA-MB-231, MM1S,
Monocyte, RPE-1, THP-1,
U2OS, Flp-In_T-REx-293
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2154 ENST00000637381 CTSD-210 protein_coding Ribo-seq
23 sORFs_org_Human
2145 ENST00000367196 CTSD-202 protein_coding Ribo-seq
1 sORFs_org_Human
2132 84

TTG...

LWT...

TTG Intronic
sORF
BJ, Blood, Brain,
HAP1, HEK293, HEK293T,
HeLa, hES, HFF,
LCL, loayza_puch_2016, MDA-MB-231,
MM1S, Monocyte, RPE-1,
THP-1, U2OS
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2130 ENST00000497544 CTSD-206 retained_intron Ribo-seq
weak 24 sORFs_org_Human
2129 45

ATG...

MRC...

ATG Downstream
NMD
sORF
Intronic
Brain_tumor, HEK293, HeLa,
HFF, BJ, Blood,
Brain, guo_2014, HAP1,
LCL, loayza_puch_2016, MDA-MB-231,
MM1S, RPE-1, THP-1,
U2OS
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

3883589 ENST00000433655 CTSD-204 nonsense_mediated_decay Ribo-seq
weak 1 Johnstone2016
2130 ENST00000497544 CTSD-206 retained_intron Ribo-seq
weak 21 sORFs_org_Human
9439 216

CTG...

LGG...

CTG ncRNA
sORF
BJ, Blood, Brain,
Flp-In_T-REx-293, guo_2014, HAP1,
HEK293, HEK293T, HeLa,
hES, HFF, LCL,
loayza_puch_2016, MDA-MB-231, MM1S,
Monocyte, RPE-1, THP-1,
U2OS
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

9437 ENST00000637937 CTSD-214 processed_transcript Ribo-seq
32 sORFs_org_Human
2147 120

TTG...

LGR...

TTG sORF
BJ, Blood, Brain,
HEK293, HeLa, hES,
HFF, LCL, loayza_puch_2016,
MDA-MB-231, MM1S, Monocyte,
RPE-1, THP-1, U2OS
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2145 ENST00000367196 CTSD-202 protein_coding Ribo-seq
moderate 20 sORFs_org_Human
2140 237

GTG...

VAS...

GTG Intronic
sORF
BJ, Brain, guo_2014,
HEK293, HeLa, hES,
HFF, loayza_puch_2016, MM1S,
U2OS
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2130 ENST00000497544 CTSD-206 retained_intron Ribo-seq
strong 16 sORFs_org_Human
2142 243

GTG...

VEV...

GTG Intronic
sORF
BJ, Brain, guo_2014,
HEK293, HeLa, hES,
HFF, loayza_puch_2016, MM1S,
THP-1, U2OS
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2130 ENST00000497544 CTSD-206 retained_intron Ribo-seq
moderate 18 sORFs_org_Human
936108 30

ATG...

MAR...

ATG sORF
Blood, Brain, HAP1,
HEK293, HeLa, MDA-MB-231,
RPE-1
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2145 ENST00000367196 CTSD-202 protein_coding Ribo-seq
strong 10 sORFs_org_Human
282246 45

GTG...

VAQ...

GTG sORF
LCL, loayza_puch_2016, MDA-MB-231,
Monocyte, RPE-1, THP-1
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

282247 ENST00000438213 CTSD-205 protein_coding Ribo-seq
moderate 7 sORFs_org_Human
2616869 78

TGG...

WEA...

TGG sORF
HeLa, THP-1
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2125 ENST00000429746 CTSD-203 protein_coding Ribo-seq
2 sORFs_org_Human
2136 219

CTG...

LCK...

CTG Intronic
sORF
BJ, Brain, guo_2014,
HEK293, HeLa, hES,
HFF, loayza_puch_2016
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2130 ENST00000497544 CTSD-206 retained_intron Ribo-seq
moderate 12 sORFs_org_Human
481628 48

CTG...

LNV...

CTG Intronic
sORF
Flp-In_T-REx-293, HEK293, HEK293T,
HeLa
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2130 ENST00000497544 CTSD-206 retained_intron Ribo-seq
6 sORFs_org_Human
2616111 30

AGG...

RCP...

AGG Intronic
sORF
HeLa, THP-1
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2130 ENST00000497544 CTSD-206 retained_intron Ribo-seq
moderate 2 sORFs_org_Human
2138 225

CTG...

LTL...

CTG Intronic
sORF
BJ, Brain, guo_2014,
HEK293, HeLa, hES,
HFF, loayza_puch_2016, MM1S
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2130 ENST00000497544 CTSD-206 retained_intron Ribo-seq
moderate 13 sORFs_org_Human
9441 237

CTG...

LPA...

CTG ncRNA
sORF
BJ, Blood, Brain,
Flp-In_T-REx-293, guo_2014, HAP1,
HEK293, HEK293T, HeLa,
hES, HFF, LCL,
loayza_puch_2016, MDA-MB-231, MM1S,
Monocyte, RPE-1, THP-1,
U2OS
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

9437 ENST00000637937 CTSD-214 processed_transcript Ribo-seq
32 sORFs_org_Human
2151 156

CTG...

LPW...

CTG sORF
BJ, Blood, Brain,
HEK293, HeLa, hES,
HFF, LCL, loayza_puch_2016,
MDA-MB-231, MM1S, Monocyte,
RPE-1, THP-1, U2OS
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2145 ENST00000367196 CTSD-202 protein_coding Ribo-seq
weak 20 sORFs_org_Human
903349 42

CGA...

RQV...

CGA sORF
HeLa, Jurkat, THP-1
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2154 ENST00000637381 CTSD-210 protein_coding Ribo-seq
3 sORFs_org_Human
903351 48

CAA...

QQR...

CAA sORF
HeLa, Jurkat, THP-1
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2154 ENST00000637381 CTSD-210 protein_coding Ribo-seq
3 sORFs_org_Human
1007345 192

GTG...

VDT...

GTG Intronic
sORF
guo_2014, HEK293, HeLa
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2130 ENST00000497544 CTSD-206 retained_intron Ribo-seq
strong 3 sORFs_org_Human
2616114 36

ACT...

TPS...

ACT sORF
HeLa, THP-1
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

282247 ENST00000438213 CTSD-205 protein_coding Ribo-seq
weak 2 sORFs_org_Human
282241 36

GTG...

VRR...

GTG Intronic
sORF
BJ, Blood, Brain,
HAP1, HEK293, HEK293T,
HeLa, hES, HFF,
LCL, loayza_puch_2016, MDA-MB-231,
MM1S, Monocyte, RPE-1,
THP-1, U2OS
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2130 ENST00000497544 CTSD-206 retained_intron Ribo-seq
weak 21 sORFs_org_Human
915747 165

AGT...

STT...

AGT sORF
Jurkat
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2145 ENST00000367196 CTSD-202 protein_coding Ribo-seq
1 sORFs_org_Human
2616118 171

CAG...

QRR...

CAG sORF
HeLa, THP-1
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2145 ENST00000367196 CTSD-202 protein_coding Ribo-seq
moderate 2 sORFs_org_Human
2616116 42

AGC...

SRS...

AGC sORF
HeLa, THP-1
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2145 ENST00000367196 CTSD-202 protein_coding Ribo-seq
weak 2 sORFs_org_Human
903339 51

TGG...

WEA...

TGG Intronic
sORF
HeLa, Jurkat, THP-1
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2130 ENST00000497544 CTSD-206 retained_intron Ribo-seq
strong 3 sORFs_org_Human
533484 276

TTG...

LAL...

TTG sORF
BJ, Brain, HEK293T,
HeLa, hES, loayza_puch_2016,
MM1S, Monocyte, RPE-1,
THP-1, U2OS
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2145 ENST00000367196 CTSD-202 protein_coding Ribo-seq
12 sORFs_org_Human
1007343 168

GTG...

VGP...

GTG Intronic
sORF
guo_2014
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2130 ENST00000497544 CTSD-206 retained_intron Ribo-seq
strong 1 sORFs_org_Human
409852 261

ATC...

IPC...

ATC ncRNA
sORF
HCT116
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

9437 ENST00000637937 CTSD-214 processed_transcript Ribo-seq
1 sORFs_org_Human
481630 57

CTG...

LSY...

CTG Intronic
sORF
Flp-In_T-REx-293, HEK293, HEK293T,
HeLa
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2130 ENST00000497544 CTSD-206 retained_intron Ribo-seq
5 sORFs_org_Human
2127 99

CTG...

LPE...

CTG sORF
BJ, Blood, Brain,
HAP1, HEK293, HEK293T,
HeLa, hES, HFF,
LCL, loayza_puch_2016, MDA-MB-231,
MM1S, RPE-1, THP-1,
U2OS
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2125 ENST00000429746 CTSD-203 protein_coding Ribo-seq
weak 25 sORFs_org_Human
2156 48

GTG...

VGP...

GTG sORF
BJ, Blood, Brain,
HAP1, HEK293, HEK293T,
HeLa, hES, LCL,
loayza_puch_2016, MDA-MB-231, MM1S,
Monocyte, RPE-1, THP-1,
U2OS
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2154 ENST00000637381 CTSD-210 protein_coding Ribo-seq
20 sORFs_org_Human
903337 78

GGC...

GKG...

GGC Intronic
sORF
HeLa, Jurkat, THP-1
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2130 ENST00000497544 CTSD-206 retained_intron Ribo-seq
3 sORFs_org_Human
7841 183

CTG...

LSP...

CTG sORF
BJ, Blood, Brain,
HAP1, HEK293, HEK293T,
HeLa, hES, HFF,
LCL, loayza_puch_2016, MDA-MB-231,
MM1S, Monocyte, RPE-1,
THP-1, U2OS
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2145 ENST00000367196 CTSD-202 protein_coding Ribo-seq
24 sORFs_org_Human
1296804 219

TTG...

LAS...

TTG sORF
HEK293T
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2154 ENST00000637381 CTSD-210 protein_coding Ribo-seq
1 sORFs_org_Human
2616109 93

ACA...

TLK...

ACA Intronic
sORF
HeLa, THP-1
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2130 ENST00000497544 CTSD-206 retained_intron Ribo-seq
2 sORFs_org_Human
2158 87

GTG...

VLH...

GTG sORF
BJ, Blood, Brain,
HAP1, HEK293, HEK293T,
HeLa, hES, HFF,
LCL, loayza_puch_2016, MDA-MB-231,
MM1S, Monocyte, RPE-1,
THP-1, U2OS
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2154 ENST00000637381 CTSD-210 protein_coding Ribo-seq
24 sORFs_org_Human
292928 96

CTG...

LHP...

CTG Alternative
InCDS
Overlapping
sORF
BJ, Blood, HEK293,
HEK293T, HeLa, LCL,
loayza_puch_2016, MDA-MB-231, RPE-1,
THP-1, U2OS
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

292929 ENST00000636843 CTSD-208 protein_coding Ribo-seq
12 sORFs_org_Human
903355 78

AGT...

SRL...

AGT sORF
HeLa, Jurkat, THP-1
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2154 ENST00000637381 CTSD-210 protein_coding Ribo-seq
3 sORFs_org_Human
533486 300

ATG...

MPL...

ATG sORF
BJ, Brain, HEK293T,
HeLa, hES, loayza_puch_2016,
MM1S, Monocyte, U2OS
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2145 ENST00000367196 CTSD-202 protein_coding Ribo-seq
9 sORFs_org_Human
7837 150

TTG...

LAS...

TTG sORF
BJ, Blood, Brain,
HAP1, HEK293, HEK293T,
HeLa, hES, HFF,
LCL, loayza_puch_2016, MDA-MB-231,
MM1S, Monocyte, RPE-1,
THP-1, U2OS
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2145 ENST00000367196 CTSD-202 protein_coding Ribo-seq
21 sORFs_org_Human
3199897 36

AGC...

SWW...

AGC sORF
HEK293
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2145 ENST00000367196 CTSD-202 protein_coding Ribo-seq
moderate 1 sORFs_org_Human
3199899 81

CAG...

QPS...

CAG sORF
HEK293
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2145 ENST00000367196 CTSD-202 protein_coding Ribo-seq
weak 1 sORFs_org_Human
915745 114

TCT...

SST...

TCT sORF
Jurkat
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2145 ENST00000367196 CTSD-202 protein_coding Ribo-seq
1 sORFs_org_Human
903335 108

GAA...

EDP...

GAA sORF
HEK293, Jurkat
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

2125 ENST00000429746 CTSD-203 protein_coding Ribo-seq
moderate 2 sORFs_org_Human
633503 48

ATG...

MSP...

ATG sORF
BJ, Blood, Brain,
HeLa, hES, loayza_puch_2016,
MDA-MB-231, MM1S, RPE-1,
THP-1
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

282247 ENST00000438213 CTSD-205 protein_coding Ribo-seq
11 sORFs_org_Human

Export data

Identification method

Predicted

Ribo-seq

MS

Start Codon

Kozak context

moderate

weak

strong

optimal

ORF length

Transcript biotype

antisense

antisense_RNA

lincRNA

non_stop_decay

nonsense_mediated_decay

polymorphic_pseudogene

processed_pseudogene

processed_transcript

protein_coding

retained_intron

sense_intronic

sense_overlapping

TEC

transcribed_processed_pseudogene

transcribed_unitary_pseudogene

transcribed_unprocessed_pseudogene

unitary_pseudogene

unprocessed_pseudogene

ORF Annotations

Reading frame

Alternative

Relative position

CDS

Downstream

InCDS

Intronic

NewCDS

Overlapping

Upstream

Biotype

Intergenic

ncRNA

NMD

NSD

Pseudogene

Length

sORF

Cell types

B_cell

BJ

Blood

Brain

Brain_tumor

Breast

Flp-In_T-REx-293

guo_2014

HAP1

HCT116

HEK293

HEK293T

HeLa

hES

HFF

Jurkat

LCL

loayza_puch_2016

MDA-MB-231

MM1S

Monocyte

NCCIT

RPE-1

Skeletal_muscle

THP-1

U2OS