Gene HGNC:2529 | MetamORF

Gene

ID CARD

HGNC:2529

Aliases

CLN10
CPSD, CTSD, ENSG00000117984
HGNC:2529, NCBI:1509, OFF:CTSD

Chromosome

Transcripts

67 ORFs
8 known transcripts
73 ORF to known transcript associations

MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

ORFs

Display all the transcripts related to the entry.

2154

ENST00000637381

CTSD-210

protein_coding

MetamORF ORF ID The MetamORF ID of the ORF. This is an unique ID referring to the ORF.	ORF length The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.	Nucleic sequence The nucleic sequence of the ORF.	Amino acid sequence The amino acid sequence of the ORF.	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Start codon The start codon sequence of the ORF.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.	ORF annotations A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.	Cell types A comma-separated list of the cell types in which the ORF has already been identified on the transcript.
633513	48	TTG ...	LPK ...	Ribo-seq	TTG	weak	4	sORFs_org_Human	sORF	loayza_puch_2016, MDA-MB-231, RPE-1, THP-1
903353	54	CAA ...	QVQ ...	Ribo-seq	CAA		1	sORFs_org_Human	sORF	Jurkat
903357	234	AGT ...	STT ...	Ribo-seq	AGT		1	sORFs_org_Human	sORF	Jurkat
2153	30	CTG ...	LQT ...	Ribo-seq	CTG		23	sORFs_org_Human	sORF	BJ, Blood, Brain, HAP1, HEK293, HEK293T, HeLa, hES, LCL, loayza_puch_2016, MDA-MB-231, MM1S, Monocyte, RPE-1, THP-1, U2OS
903349	42	CGA ...	RQV ...	Ribo-seq	CGA		3	sORFs_org_Human	sORF	HeLa, Jurkat, THP-1
903351	48	CAA ...	QQR ...	Ribo-seq	CAA		3	sORFs_org_Human	sORF	HeLa, Jurkat, THP-1
2156	48	GTG ...	VGP ...	Ribo-seq	GTG		20	sORFs_org_Human	sORF	BJ, Blood, Brain, HAP1, HEK293, HEK293T, HeLa, hES, LCL, loayza_puch_2016, MDA-MB-231, MM1S, Monocyte, RPE-1, THP-1, U2OS
1296804	219	TTG ...	LAS ...	Ribo-seq	TTG		1	sORFs_org_Human	sORF	HEK293T
2158	87	GTG ...	VLH ...	Ribo-seq	GTG		24	sORFs_org_Human	sORF	BJ, Blood, Brain, HAP1, HEK293, HEK293T, HeLa, hES, HFF, LCL, loayza_puch_2016, MDA-MB-231, MM1S, Monocyte, RPE-1, THP-1, U2OS
903355	78	AGT ...	SRL ...	Ribo-seq	AGT		3	sORFs_org_Human	sORF	HeLa, Jurkat, THP-1

3883589

ENST00000433655

CTSD-204

nonsense_mediated_decay

MetamORF ORF ID The MetamORF ID of the ORF. This is an unique ID referring to the ORF.	ORF length The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.	Nucleic sequence The nucleic sequence of the ORF.	Amino acid sequence The amino acid sequence of the ORF.	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Start codon The start codon sequence of the ORF.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.	ORF annotations A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.	Cell types A comma-separated list of the cell types in which the ORF has already been identified on the transcript.
282253	54	ATG ...	MVP ...	Ribo-seq	ATG	strong	1	Johnstone2016	Alternative InCDS NMD Overlapping sORF	Brain_tumor, HEK293, HeLa, HFF
3311971	831	ATG ...	MQP ...	Ribo-seq	ATG		1	Johnstone2016	CDS NMD	Brain_tumor, HEK293, HeLa, HFF
2144	69	ATG ...	MAS ...	Ribo-seq	ATG	moderate	1	Johnstone2016	Alternative InCDS NMD Overlapping sORF	Brain_tumor, HEK293, HeLa, HFF
2129	45	ATG ...	MRC ...	Ribo-seq	ATG	weak	1	Johnstone2016	Downstream NMD sORF	Brain_tumor, HEK293, HeLa, HFF
9445	63	ATG ...	MSP ...	Ribo-seq	ATG		1	Johnstone2016	Downstream NMD Overlapping sORF	Brain_tumor, HEK293, HeLa, HFF

2145

ENST00000367196

CTSD-202

protein_coding

MetamORF ORF ID The MetamORF ID of the ORF. This is an unique ID referring to the ORF.	ORF length The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.	Nucleic sequence The nucleic sequence of the ORF.	Amino acid sequence The amino acid sequence of the ORF.	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Start codon The start codon sequence of the ORF.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.	ORF annotations A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.	Cell types A comma-separated list of the cell types in which the ORF has already been identified on the transcript.
533482	270	CTG ...	LGT ...	Ribo-seq	CTG		14	sORFs_org_Human	sORF	BJ, Brain, HEK293, HEK293T, HeLa, hES, loayza_puch_2016, MM1S, Monocyte, RPE-1, THP-1, U2OS
2149	144	GTG ...	VSK ...	Ribo-seq	GTG	moderate	21	sORFs_org_Human	sORF	BJ, Blood, Brain, HEK293, HEK293T, HeLa, hES, HFF, LCL, loayza_puch_2016, MDA-MB-231, MM1S, Monocyte, RPE-1, THP-1, U2OS
526664	42	TTG ...	LTS ...	Ribo-seq	TTG	weak	15	sORFs_org_Human	sORF	Blood, Brain, HAP1, HEK293, HEK293T, HeLa, HFF, loayza_puch_2016, MDA-MB-231, MM1S, RPE-1, U2OS
915743	81	CCT ...	PPS ...	Ribo-seq	CCT		1	sORFs_org_Human	sORF	Jurkat
7839	195	CTG ...	LPG ...	Ribo-seq	CTG		25	sORFs_org_Human	sORF	BJ, Blood, Brain, HAP1, HEK293, HEK293T, HeLa, hES, HFF, LCL, loayza_puch_2016, MDA-MB-231, MM1S, Monocyte, RPE-1, THP-1, U2OS
2144	69	ATG ...	MAS ...	Ribo-seq	ATG	moderate	17	sORFs_org_Human	sORF	BJ, Blood, HEK293, HeLa, hES, HFF, LCL, loayza_puch_2016, MDA-MB-231, MM1S, Monocyte, RPE-1, THP-1, U2OS
2616873	153	AGA ...	RLA ...	Ribo-seq	AGA		2	sORFs_org_Human	sORF	HeLa, THP-1
2147	120	TTG ...	LGR ...	Ribo-seq	TTG	moderate	20	sORFs_org_Human	sORF	BJ, Blood, Brain, HEK293, HeLa, hES, HFF, LCL, loayza_puch_2016, MDA-MB-231, MM1S, Monocyte, RPE-1, THP-1, U2OS
936108	30	ATG ...	MAR ...	Ribo-seq	ATG	strong	10	sORFs_org_Human	sORF	Blood, Brain, HAP1, HEK293, HeLa, MDA-MB-231, RPE-1
2153	30	CTG ...	LQT ...	Ribo-seq	CTG		1	sORFs_org_Human	sORF	Flp-In_T-REx-293
2151	156	CTG ...	LPW ...	Ribo-seq	CTG	weak	20	sORFs_org_Human	sORF	BJ, Blood, Brain, HEK293, HeLa, hES, HFF, LCL, loayza_puch_2016, MDA-MB-231, MM1S, Monocyte, RPE-1, THP-1, U2OS
915747	165	AGT ...	STT ...	Ribo-seq	AGT		1	sORFs_org_Human	sORF	Jurkat
2616118	171	CAG ...	QRR ...	Ribo-seq	CAG	moderate	2	sORFs_org_Human	sORF	HeLa, THP-1
2616116	42	AGC ...	SRS ...	Ribo-seq	AGC	weak	2	sORFs_org_Human	sORF	HeLa, THP-1
533484	276	TTG ...	LAL ...	Ribo-seq	TTG		12	sORFs_org_Human	sORF	BJ, Brain, HEK293T, HeLa, hES, loayza_puch_2016, MM1S, Monocyte, RPE-1, THP-1, U2OS
282253	54	ATG ...	MVP ...	Ribo-seq	ATG	strong	21	sORFs_org_Human	sORF	BJ, Blood, Brain, Flp-In_T-REx-293, HAP1, HEK293, HEK293T, HeLa, hES, HFF, LCL, loayza_puch_2016, MDA-MB-231, MM1S, Monocyte, RPE-1, U2OS
7841	183	CTG ...	LSP ...	Ribo-seq	CTG		24	sORFs_org_Human	sORF	BJ, Blood, Brain, HAP1, HEK293, HEK293T, HeLa, hES, HFF, LCL, loayza_puch_2016, MDA-MB-231, MM1S, Monocyte, RPE-1, THP-1, U2OS
533486	300	ATG ...	MPL ...	Ribo-seq	ATG		9	sORFs_org_Human	sORF	BJ, Brain, HEK293T, HeLa, hES, loayza_puch_2016, MM1S, Monocyte, U2OS
7837	150	TTG ...	LAS ...	Ribo-seq	TTG		21	sORFs_org_Human	sORF	BJ, Blood, Brain, HAP1, HEK293, HEK293T, HeLa, hES, HFF, LCL, loayza_puch_2016, MDA-MB-231, MM1S, Monocyte, RPE-1, THP-1, U2OS
3199897	36	AGC ...	SWW ...	Ribo-seq	AGC	moderate	1	sORFs_org_Human	sORF	HEK293
3199899	81	CAG ...	QPS ...	Ribo-seq	CAG	weak	1	sORFs_org_Human	sORF	HEK293
915745	114	TCT ...	SST ...	Ribo-seq	TCT		1	sORFs_org_Human	sORF	Jurkat

9437

ENST00000637937

CTSD-214

processed_transcript

MetamORF ORF ID The MetamORF ID of the ORF. This is an unique ID referring to the ORF.	ORF length The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.	Nucleic sequence The nucleic sequence of the ORF.	Amino acid sequence The amino acid sequence of the ORF.	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Start codon The start codon sequence of the ORF.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.	ORF annotations A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.	Cell types A comma-separated list of the cell types in which the ORF has already been identified on the transcript.
9436	222	CTG ...	LKL ...	Ribo-seq	CTG		32	sORFs_org_Human	ncRNA sORF	BJ, Blood, Brain, Flp-In_T-REx-293, guo_2014, HAP1, HEK293, HEK293T, HeLa, hES, HFF, LCL, loayza_puch_2016, MDA-MB-231, MM1S, Monocyte, RPE-1, THP-1, U2OS
917610	210	GGC ...	GKG ...	Ribo-seq	GGC		3	sORFs_org_Human	ncRNA sORF	HeLa, Jurkat, THP-1
9447	195	CTG ...	LSP ...	Ribo-seq	CTG		32	sORFs_org_Human	ncRNA sORF	BJ, Blood, Brain, Flp-In_T-REx-293, guo_2014, HAP1, HEK293, HEK293T, HeLa, hES, HFF, LCL, loayza_puch_2016, MDA-MB-231, MM1S, Monocyte, RPE-1, THP-1, U2OS
917608	57	CCC ...	PAR ...	Ribo-seq	CCC		1	sORFs_org_Human	ncRNA sORF	Jurkat
9443	246	GTG ...	VST ...	Ribo-seq	GTG		32	sORFs_org_Human	ncRNA sORF	BJ, Blood, Brain, Flp-In_T-REx-293, guo_2014, HAP1, HEK293, HEK293T, HeLa, hES, HFF, LCL, loayza_puch_2016, MDA-MB-231, MM1S, Monocyte, RPE-1, THP-1, U2OS
2617008	225	ACA ...	TLK ...	Ribo-seq	ACA		2	sORFs_org_Human	ncRNA sORF	HeLa, THP-1
9445	63	ATG ...	MSP ...	Ribo-seq	ATG		20	sORFs_org_Human	ncRNA sORF	BJ, Blood, Brain, HEK293, HEK293T, HeLa, hES, HFF, LCL, loayza_puch_2016, MDA-MB-231, MM1S, Monocyte, RPE-1, THP-1, U2OS
9439	216	CTG ...	LGG ...	Ribo-seq	CTG		32	sORFs_org_Human	ncRNA sORF	BJ, Blood, Brain, Flp-In_T-REx-293, guo_2014, HAP1, HEK293, HEK293T, HeLa, hES, HFF, LCL, loayza_puch_2016, MDA-MB-231, MM1S, Monocyte, RPE-1, THP-1, U2OS
9441	237	CTG ...	LPA ...	Ribo-seq	CTG		32	sORFs_org_Human	ncRNA sORF	BJ, Blood, Brain, Flp-In_T-REx-293, guo_2014, HAP1, HEK293, HEK293T, HeLa, hES, HFF, LCL, loayza_puch_2016, MDA-MB-231, MM1S, Monocyte, RPE-1, THP-1, U2OS
409852	261	ATC ...	IPC ...	Ribo-seq	ATC		1	sORFs_org_Human	ncRNA sORF	HCT116

2125

ENST00000429746

CTSD-203

protein_coding

MetamORF ORF ID The MetamORF ID of the ORF. This is an unique ID referring to the ORF.	ORF length The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.	Nucleic sequence The nucleic sequence of the ORF.	Amino acid sequence The amino acid sequence of the ORF.	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Start codon The start codon sequence of the ORF.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.	ORF annotations A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.	Cell types A comma-separated list of the cell types in which the ORF has already been identified on the transcript.
2124	48	CTG ...	LDP ...	Ribo-seq	CTG	strong	23	sORFs_org_Human	sORF	BJ, Blood, Brain, HAP1, HEK293, HeLa, hES, HFF, LCL, loayza_puch_2016, MDA-MB-231, MM1S, Monocyte, RPE-1, THP-1, U2OS
2616869	78	TGG ...	WEA ...	Ribo-seq	TGG		2	sORFs_org_Human	sORF	HeLa, THP-1
2127	99	CTG ...	LPE ...	Ribo-seq	CTG	weak	25	sORFs_org_Human	sORF	BJ, Blood, Brain, HAP1, HEK293, HEK293T, HeLa, hES, HFF, LCL, loayza_puch_2016, MDA-MB-231, MM1S, RPE-1, THP-1, U2OS
903335	108	GAA ...	EDP ...	Ribo-seq	GAA	moderate	2	sORFs_org_Human	sORF	HEK293, Jurkat

2130

ENST00000497544

CTSD-206

retained_intron

MetamORF ORF ID The MetamORF ID of the ORF. This is an unique ID referring to the ORF.	ORF length The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.	Nucleic sequence The nucleic sequence of the ORF.	Amino acid sequence The amino acid sequence of the ORF.	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Start codon The start codon sequence of the ORF.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.	ORF annotations A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.	Cell types A comma-separated list of the cell types in which the ORF has already been identified on the transcript.
2134	93	GTG ...	VRP ...	Ribo-seq	GTG	moderate	24	sORFs_org_Human	Intronic sORF	BJ, Blood, Brain, HAP1, HEK293, HEK293T, HeLa, hES, HFF, LCL, loayza_puch_2016, MDA-MB-231, MM1S, Monocyte, RPE-1, THP-1, U2OS
903341	102	ACT ...	TEA ...	Ribo-seq	ACT		1	sORFs_org_Human	Intronic sORF	Jurkat
2132	84	TTG ...	LWT ...	Ribo-seq	TTG	weak	24	sORFs_org_Human	Intronic sORF	BJ, Blood, Brain, HAP1, HEK293, HEK293T, HeLa, hES, HFF, LCL, loayza_puch_2016, MDA-MB-231, MM1S, Monocyte, RPE-1, THP-1, U2OS
2140	237	GTG ...	VAS ...	Ribo-seq	GTG	strong	16	sORFs_org_Human	Intronic sORF	BJ, Brain, guo_2014, HEK293, HeLa, hES, HFF, loayza_puch_2016, MM1S, U2OS
2142	243	GTG ...	VEV ...	Ribo-seq	GTG	moderate	18	sORFs_org_Human	Intronic sORF	BJ, Brain, guo_2014, HEK293, HeLa, hES, HFF, loayza_puch_2016, MM1S, THP-1, U2OS
2136	219	CTG ...	LCK ...	Ribo-seq	CTG	moderate	12	sORFs_org_Human	Intronic sORF	BJ, Brain, guo_2014, HEK293, HeLa, hES, HFF, loayza_puch_2016
481628	48	CTG ...	LNV ...	Ribo-seq	CTG		6	sORFs_org_Human	Intronic sORF	Flp-In_T-REx-293, HEK293, HEK293T, HeLa
2616111	30	AGG ...	RCP ...	Ribo-seq	AGG	moderate	2	sORFs_org_Human	Intronic sORF	HeLa, THP-1
2138	225	CTG ...	LTL ...	Ribo-seq	CTG	moderate	13	sORFs_org_Human	Intronic sORF	BJ, Brain, guo_2014, HEK293, HeLa, hES, HFF, loayza_puch_2016, MM1S
1007345	192	GTG ...	VDT ...	Ribo-seq	GTG	strong	3	sORFs_org_Human	Intronic sORF	guo_2014, HEK293, HeLa
282241	36	GTG ...	VRR ...	Ribo-seq	GTG	weak	21	sORFs_org_Human	Intronic sORF	BJ, Blood, Brain, HAP1, HEK293, HEK293T, HeLa, hES, HFF, LCL, loayza_puch_2016, MDA-MB-231, MM1S, Monocyte, RPE-1, THP-1, U2OS
903339	51	TGG ...	WEA ...	Ribo-seq	TGG	strong	3	sORFs_org_Human	Intronic sORF	HeLa, Jurkat, THP-1
1007343	168	GTG ...	VGP ...	Ribo-seq	GTG	strong	1	sORFs_org_Human	Intronic sORF	guo_2014
481630	57	CTG ...	LSY ...	Ribo-seq	CTG		5	sORFs_org_Human	Intronic sORF	Flp-In_T-REx-293, HEK293, HEK293T, HeLa
903337	78	GGC ...	GKG ...	Ribo-seq	GGC		3	sORFs_org_Human	Intronic sORF	HeLa, Jurkat, THP-1
2616109	93	ACA ...	TLK ...	Ribo-seq	ACA		2	sORFs_org_Human	Intronic sORF	HeLa, THP-1
2129	45	ATG ...	MRC ...	Ribo-seq	ATG	weak	21	sORFs_org_Human	Intronic sORF	BJ, Blood, Brain, guo_2014, HAP1, HEK293, HeLa, HFF, LCL, loayza_puch_2016, MDA-MB-231, MM1S, RPE-1, THP-1, U2OS

282247

ENST00000438213

CTSD-205

protein_coding

MetamORF ORF ID The MetamORF ID of the ORF. This is an unique ID referring to the ORF.	ORF length The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.	Nucleic sequence The nucleic sequence of the ORF.	Amino acid sequence The amino acid sequence of the ORF.	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Start codon The start codon sequence of the ORF.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.	ORF annotations A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.	Cell types A comma-separated list of the cell types in which the ORF has already been identified on the transcript.
282246	45	GTG ...	VAQ ...	Ribo-seq	GTG	moderate	7	sORFs_org_Human	sORF	LCL, loayza_puch_2016, MDA-MB-231, Monocyte, RPE-1, THP-1
2616114	36	ACT ...	TPS ...	Ribo-seq	ACT	weak	2	sORFs_org_Human	sORF	HeLa, THP-1
633503	48	ATG ...	MSP ...	Ribo-seq	ATG		11	sORFs_org_Human	sORF	BJ, Blood, Brain, HeLa, hES, loayza_puch_2016, MDA-MB-231, MM1S, RPE-1, THP-1

292929

ENST00000636843

CTSD-208

protein_coding

MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF
length

The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Start codon

The start codon sequence of the ORF.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

ORF annotations

A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of the cell types in which the ORF has already been identified on the transcript.

9445

ATG ...

MSP ...

Ribo-seq

ATG

sORFs_org_Human

HeLa

292928

CTG ...

LHP ...

Ribo-seq

CTG

sORFs_org_Human

BJ, Blood, HEK293,
HEK293T, HeLa, LCL,
loayza_puch_2016, MDA-MB-231, RPE-1,
THP-1, U2OS

ORFs

67 ORFs
8 known transcripts
73 ORF to known transcript associations

MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF Length

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Start codon

The start codon sequence of the ORF.

ORF annotations

A comma-separated list of all the annotations computed by our algorithm for the ORF. This list includes the annotations computed for the ORF for all transcripts. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of all the cell types in which the ORF has already been identified.

Transcripts

Display all the transcripts related to the entry.

633513

TTG...

LPK...

TTG

sORF

loayza_puch_2016, MDA-MB-231, RPE-1,
THP-1

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2154	ENST00000637381	CTSD-210	protein_coding	Ribo-seq	weak	4	sORFs_org_Human

282253

ATG...

MVP...

ATG

Alternative
InCDS
NMD
Overlapping
sORF

Brain_tumor, HEK293, HeLa,
HFF, BJ, Blood,
Brain, Flp-In_T-REx-293, HAP1,
HEK293T, hES, LCL,
loayza_puch_2016, MDA-MB-231, MM1S,
Monocyte, RPE-1, U2OS

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
3883589	ENST00000433655	CTSD-204	nonsense_mediated_decay	Ribo-seq	strong	1	Johnstone2016
2145	ENST00000367196	CTSD-202	protein_coding	Ribo-seq	strong	21	sORFs_org_Human

3311971

831

ATG...

MQP...

ATG

CDS
NMD

Brain_tumor, HEK293, HeLa,
HFF

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
3883589	ENST00000433655	CTSD-204	nonsense_mediated_decay	Ribo-seq		1	Johnstone2016
3311963	UNKNOWN_TRANSCRIPT			Ribo-seq		1	Erhard2018

533482

270

CTG...

LGT...

CTG

sORF

BJ, Brain, HEK293,
HEK293T, HeLa, hES,
loayza_puch_2016, MM1S, Monocyte,
RPE-1, THP-1, U2OS

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2145	ENST00000367196	CTSD-202	protein_coding	Ribo-seq		14	sORFs_org_Human

9436

222

CTG...

LKL...

CTG

ncRNA
sORF

BJ, Blood, Brain,
Flp-In_T-REx-293, guo_2014, HAP1,
HEK293, HEK293T, HeLa,
hES, HFF, LCL,
loayza_puch_2016, MDA-MB-231, MM1S,
Monocyte, RPE-1, THP-1,
U2OS

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
9437	ENST00000637937	CTSD-214	processed_transcript	Ribo-seq		32	sORFs_org_Human

2124

CTG...

LDP...

CTG

sORF

BJ, Blood, Brain,
HAP1, HEK293, HeLa,
hES, HFF, LCL,
loayza_puch_2016, MDA-MB-231, MM1S,
Monocyte, RPE-1, THP-1,
U2OS

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2125	ENST00000429746	CTSD-203	protein_coding	Ribo-seq	strong	23	sORFs_org_Human

2134

GTG...

VRP...

GTG

Intronic
sORF

BJ, Blood, Brain,
HAP1, HEK293, HEK293T,
HeLa, hES, HFF,
LCL, loayza_puch_2016, MDA-MB-231,
MM1S, Monocyte, RPE-1,
THP-1, U2OS

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2130	ENST00000497544	CTSD-206	retained_intron	Ribo-seq	moderate	24	sORFs_org_Human

903353

CAA...

QVQ...

CAA

sORF

Jurkat

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2154	ENST00000637381	CTSD-210	protein_coding	Ribo-seq		1	sORFs_org_Human

2149

144

GTG...

VSK...

GTG

sORF

BJ, Blood, Brain,
HEK293, HEK293T, HeLa,
hES, HFF, LCL,
loayza_puch_2016, MDA-MB-231, MM1S,
Monocyte, RPE-1, THP-1,
U2OS

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2145	ENST00000367196	CTSD-202	protein_coding	Ribo-seq	moderate	21	sORFs_org_Human

526664

TTG...

LTS...

TTG

sORF

Blood, Brain, HAP1,
HEK293, HEK293T, HeLa,
HFF, loayza_puch_2016, MDA-MB-231,
MM1S, RPE-1, U2OS

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2145	ENST00000367196	CTSD-202	protein_coding	Ribo-seq	weak	15	sORFs_org_Human

917610

210

GGC...

GKG...

GGC

ncRNA
sORF

HeLa, Jurkat, THP-1

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
9437	ENST00000637937	CTSD-214	processed_transcript	Ribo-seq		3	sORFs_org_Human

9447

195

CTG...

LSP...

CTG

ncRNA
sORF

BJ, Blood, Brain,
Flp-In_T-REx-293, guo_2014, HAP1,
HEK293, HEK293T, HeLa,
hES, HFF, LCL,
loayza_puch_2016, MDA-MB-231, MM1S,
Monocyte, RPE-1, THP-1,
U2OS

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
9437	ENST00000637937	CTSD-214	processed_transcript	Ribo-seq		32	sORFs_org_Human

903357

234

AGT...

STT...

AGT

sORF

Jurkat

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2154	ENST00000637381	CTSD-210	protein_coding	Ribo-seq		1	sORFs_org_Human

917608

CCC...

PAR...

CCC

ncRNA
sORF

Jurkat

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
9437	ENST00000637937	CTSD-214	processed_transcript	Ribo-seq		1	sORFs_org_Human

9443

246

GTG...

VST...

GTG

ncRNA
sORF

BJ, Blood, Brain,
Flp-In_T-REx-293, guo_2014, HAP1,
HEK293, HEK293T, HeLa,
hES, HFF, LCL,
loayza_puch_2016, MDA-MB-231, MM1S,
Monocyte, RPE-1, THP-1,
U2OS

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
9437	ENST00000637937	CTSD-214	processed_transcript	Ribo-seq		32	sORFs_org_Human

915743

CCT...

PPS...

CCT

sORF

Jurkat

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2145	ENST00000367196	CTSD-202	protein_coding	Ribo-seq		1	sORFs_org_Human

7839

195

CTG...

LPG...

CTG

sORF

BJ, Blood, Brain,
HAP1, HEK293, HEK293T,
HeLa, hES, HFF,
LCL, loayza_puch_2016, MDA-MB-231,
MM1S, Monocyte, RPE-1,
THP-1, U2OS

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2145	ENST00000367196	CTSD-202	protein_coding	Ribo-seq		25	sORFs_org_Human

2144

ATG...

MAS...

ATG

sORF
Alternative
InCDS
NMD
Overlapping

BJ, Blood, HEK293,
HeLa, hES, HFF,
LCL, loayza_puch_2016, MDA-MB-231,
MM1S, Monocyte, RPE-1,
THP-1, U2OS, Brain_tumor

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2145	ENST00000367196	CTSD-202	protein_coding	Ribo-seq	moderate	17	sORFs_org_Human
3883589	ENST00000433655	CTSD-204	nonsense_mediated_decay	Ribo-seq	moderate	1	Johnstone2016

903341

102

ACT...

TEA...

ACT

Intronic
sORF

Jurkat

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2130	ENST00000497544	CTSD-206	retained_intron	Ribo-seq		1	sORFs_org_Human

2617008

225

ACA...

TLK...

ACA

ncRNA
sORF

HeLa, THP-1

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
9437	ENST00000637937	CTSD-214	processed_transcript	Ribo-seq		2	sORFs_org_Human

2616873

153

AGA...

RLA...

AGA

sORF

HeLa, THP-1

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2145	ENST00000367196	CTSD-202	protein_coding	Ribo-seq		2	sORFs_org_Human

9445

ATG...

MSP...

ATG

ncRNA
sORF
Alternative
InCDS
Overlapping
Downstream
NMD

BJ, Blood, Brain,
HEK293, HEK293T, HeLa,
hES, HFF, LCL,
loayza_puch_2016, MDA-MB-231, MM1S,
Monocyte, RPE-1, THP-1,
U2OS, Brain_tumor

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
9437	ENST00000637937	CTSD-214	processed_transcript	Ribo-seq		20	sORFs_org_Human
292929	ENST00000636843	CTSD-208	protein_coding	Ribo-seq		1	sORFs_org_Human
3883589	ENST00000433655	CTSD-204	nonsense_mediated_decay	Ribo-seq		1	Johnstone2016

2153

CTG...

LQT...

CTG

sORF

BJ, Blood, Brain,
HAP1, HEK293, HEK293T,
HeLa, hES, LCL,
loayza_puch_2016, MDA-MB-231, MM1S,
Monocyte, RPE-1, THP-1,
U2OS, Flp-In_T-REx-293

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2154	ENST00000637381	CTSD-210	protein_coding	Ribo-seq		23	sORFs_org_Human
2145	ENST00000367196	CTSD-202	protein_coding	Ribo-seq		1	sORFs_org_Human

2132

TTG...

LWT...

TTG

Intronic
sORF

BJ, Blood, Brain,
HAP1, HEK293, HEK293T,
HeLa, hES, HFF,
LCL, loayza_puch_2016, MDA-MB-231,
MM1S, Monocyte, RPE-1,
THP-1, U2OS

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2130	ENST00000497544	CTSD-206	retained_intron	Ribo-seq	weak	24	sORFs_org_Human

2129

ATG...

MRC...

ATG

Downstream
NMD
sORF
Intronic

Brain_tumor, HEK293, HeLa,
HFF, BJ, Blood,
Brain, guo_2014, HAP1,
LCL, loayza_puch_2016, MDA-MB-231,
MM1S, RPE-1, THP-1,
U2OS

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
3883589	ENST00000433655	CTSD-204	nonsense_mediated_decay	Ribo-seq	weak	1	Johnstone2016
2130	ENST00000497544	CTSD-206	retained_intron	Ribo-seq	weak	21	sORFs_org_Human

9439

216

CTG...

LGG...

CTG

ncRNA
sORF

BJ, Blood, Brain,
Flp-In_T-REx-293, guo_2014, HAP1,
HEK293, HEK293T, HeLa,
hES, HFF, LCL,
loayza_puch_2016, MDA-MB-231, MM1S,
Monocyte, RPE-1, THP-1,
U2OS

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
9437	ENST00000637937	CTSD-214	processed_transcript	Ribo-seq		32	sORFs_org_Human

2147

120

TTG...

LGR...

TTG

sORF

BJ, Blood, Brain,
HEK293, HeLa, hES,
HFF, LCL, loayza_puch_2016,
MDA-MB-231, MM1S, Monocyte,
RPE-1, THP-1, U2OS

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2145	ENST00000367196	CTSD-202	protein_coding	Ribo-seq	moderate	20	sORFs_org_Human

2140

237

GTG...

VAS...

GTG

Intronic
sORF

BJ, Brain, guo_2014,
HEK293, HeLa, hES,
HFF, loayza_puch_2016, MM1S,
U2OS

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2130	ENST00000497544	CTSD-206	retained_intron	Ribo-seq	strong	16	sORFs_org_Human

2142

243

GTG...

VEV...

GTG

Intronic
sORF

BJ, Brain, guo_2014,
HEK293, HeLa, hES,
HFF, loayza_puch_2016, MM1S,
THP-1, U2OS

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2130	ENST00000497544	CTSD-206	retained_intron	Ribo-seq	moderate	18	sORFs_org_Human

936108

ATG...

MAR...

ATG

sORF

Blood, Brain, HAP1,
HEK293, HeLa, MDA-MB-231,
RPE-1

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2145	ENST00000367196	CTSD-202	protein_coding	Ribo-seq	strong	10	sORFs_org_Human

282246

GTG...

VAQ...

GTG

sORF

LCL, loayza_puch_2016, MDA-MB-231,
Monocyte, RPE-1, THP-1

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
282247	ENST00000438213	CTSD-205	protein_coding	Ribo-seq	moderate	7	sORFs_org_Human

2616869

TGG...

WEA...

TGG

sORF

HeLa, THP-1

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2125	ENST00000429746	CTSD-203	protein_coding	Ribo-seq		2	sORFs_org_Human

2136

219

CTG...

LCK...

CTG

Intronic
sORF

BJ, Brain, guo_2014,
HEK293, HeLa, hES,
HFF, loayza_puch_2016

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2130	ENST00000497544	CTSD-206	retained_intron	Ribo-seq	moderate	12	sORFs_org_Human

481628

CTG...

LNV...

CTG

Intronic
sORF

Flp-In_T-REx-293, HEK293, HEK293T,
HeLa

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2130	ENST00000497544	CTSD-206	retained_intron	Ribo-seq		6	sORFs_org_Human

2616111

AGG...

RCP...

AGG

Intronic
sORF

HeLa, THP-1

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2130	ENST00000497544	CTSD-206	retained_intron	Ribo-seq	moderate	2	sORFs_org_Human

2138

225

CTG...

LTL...

CTG

Intronic
sORF

BJ, Brain, guo_2014,
HEK293, HeLa, hES,
HFF, loayza_puch_2016, MM1S

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2130	ENST00000497544	CTSD-206	retained_intron	Ribo-seq	moderate	13	sORFs_org_Human

9441

237

CTG...

LPA...

CTG

ncRNA
sORF

BJ, Blood, Brain,
Flp-In_T-REx-293, guo_2014, HAP1,
HEK293, HEK293T, HeLa,
hES, HFF, LCL,
loayza_puch_2016, MDA-MB-231, MM1S,
Monocyte, RPE-1, THP-1,
U2OS

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
9437	ENST00000637937	CTSD-214	processed_transcript	Ribo-seq		32	sORFs_org_Human

2151

156

CTG...

LPW...

CTG

sORF

BJ, Blood, Brain,
HEK293, HeLa, hES,
HFF, LCL, loayza_puch_2016,
MDA-MB-231, MM1S, Monocyte,
RPE-1, THP-1, U2OS

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2145	ENST00000367196	CTSD-202	protein_coding	Ribo-seq	weak	20	sORFs_org_Human

903349

CGA...

RQV...

CGA

sORF

HeLa, Jurkat, THP-1

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2154	ENST00000637381	CTSD-210	protein_coding	Ribo-seq		3	sORFs_org_Human

903351

CAA...

QQR...

CAA

sORF

HeLa, Jurkat, THP-1

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2154	ENST00000637381	CTSD-210	protein_coding	Ribo-seq		3	sORFs_org_Human

1007345

192

GTG...

VDT...

GTG

Intronic
sORF

guo_2014, HEK293, HeLa

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2130	ENST00000497544	CTSD-206	retained_intron	Ribo-seq	strong	3	sORFs_org_Human

2616114

ACT...

TPS...

ACT

sORF

HeLa, THP-1

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
282247	ENST00000438213	CTSD-205	protein_coding	Ribo-seq	weak	2	sORFs_org_Human

282241

GTG...

VRR...

GTG

Intronic
sORF

BJ, Blood, Brain,
HAP1, HEK293, HEK293T,
HeLa, hES, HFF,
LCL, loayza_puch_2016, MDA-MB-231,
MM1S, Monocyte, RPE-1,
THP-1, U2OS

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2130	ENST00000497544	CTSD-206	retained_intron	Ribo-seq	weak	21	sORFs_org_Human

915747

165

AGT...

STT...

AGT

sORF

Jurkat

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2145	ENST00000367196	CTSD-202	protein_coding	Ribo-seq		1	sORFs_org_Human

2616118

171

CAG...

QRR...

CAG

sORF

HeLa, THP-1

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2145	ENST00000367196	CTSD-202	protein_coding	Ribo-seq	moderate	2	sORFs_org_Human

2616116

AGC...

SRS...

AGC

sORF

HeLa, THP-1

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2145	ENST00000367196	CTSD-202	protein_coding	Ribo-seq	weak	2	sORFs_org_Human

903339

TGG...

WEA...

TGG

Intronic
sORF

HeLa, Jurkat, THP-1

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2130	ENST00000497544	CTSD-206	retained_intron	Ribo-seq	strong	3	sORFs_org_Human

533484

276

TTG...

LAL...

TTG

sORF

BJ, Brain, HEK293T,
HeLa, hES, loayza_puch_2016,
MM1S, Monocyte, RPE-1,
THP-1, U2OS

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2145	ENST00000367196	CTSD-202	protein_coding	Ribo-seq		12	sORFs_org_Human

1007343

168

GTG...

VGP...

GTG

Intronic
sORF

guo_2014

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2130	ENST00000497544	CTSD-206	retained_intron	Ribo-seq	strong	1	sORFs_org_Human

409852

261

ATC...

IPC...

ATC

ncRNA
sORF

HCT116

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
9437	ENST00000637937	CTSD-214	processed_transcript	Ribo-seq		1	sORFs_org_Human

481630

CTG...

LSY...

CTG

Intronic
sORF

Flp-In_T-REx-293, HEK293, HEK293T,
HeLa

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2130	ENST00000497544	CTSD-206	retained_intron	Ribo-seq		5	sORFs_org_Human

2127

CTG...

LPE...

CTG

sORF

BJ, Blood, Brain,
HAP1, HEK293, HEK293T,
HeLa, hES, HFF,
LCL, loayza_puch_2016, MDA-MB-231,
MM1S, RPE-1, THP-1,
U2OS

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2125	ENST00000429746	CTSD-203	protein_coding	Ribo-seq	weak	25	sORFs_org_Human

2156

GTG...

VGP...

GTG

sORF

BJ, Blood, Brain,
HAP1, HEK293, HEK293T,
HeLa, hES, LCL,
loayza_puch_2016, MDA-MB-231, MM1S,
Monocyte, RPE-1, THP-1,
U2OS

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2154	ENST00000637381	CTSD-210	protein_coding	Ribo-seq		20	sORFs_org_Human

903337

GGC...

GKG...

GGC

Intronic
sORF

HeLa, Jurkat, THP-1

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2130	ENST00000497544	CTSD-206	retained_intron	Ribo-seq		3	sORFs_org_Human

7841

183

CTG...

LSP...

CTG

sORF

BJ, Blood, Brain,
HAP1, HEK293, HEK293T,
HeLa, hES, HFF,
LCL, loayza_puch_2016, MDA-MB-231,
MM1S, Monocyte, RPE-1,
THP-1, U2OS

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2145	ENST00000367196	CTSD-202	protein_coding	Ribo-seq		24	sORFs_org_Human

1296804

219

TTG...

LAS...

TTG

sORF

HEK293T

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2154	ENST00000637381	CTSD-210	protein_coding	Ribo-seq		1	sORFs_org_Human

2616109

ACA...

TLK...

ACA

Intronic
sORF

HeLa, THP-1

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2130	ENST00000497544	CTSD-206	retained_intron	Ribo-seq		2	sORFs_org_Human

2158

GTG...

VLH...

GTG

sORF

BJ, Blood, Brain,
HAP1, HEK293, HEK293T,
HeLa, hES, HFF,
LCL, loayza_puch_2016, MDA-MB-231,
MM1S, Monocyte, RPE-1,
THP-1, U2OS

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2154	ENST00000637381	CTSD-210	protein_coding	Ribo-seq		24	sORFs_org_Human

292928

CTG...

LHP...

CTG

Alternative
InCDS
Overlapping
sORF

BJ, Blood, HEK293,
HEK293T, HeLa, LCL,
loayza_puch_2016, MDA-MB-231, RPE-1,
THP-1, U2OS

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
292929	ENST00000636843	CTSD-208	protein_coding	Ribo-seq		12	sORFs_org_Human

903355

AGT...

SRL...

AGT

sORF

HeLa, Jurkat, THP-1

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2154	ENST00000637381	CTSD-210	protein_coding	Ribo-seq		3	sORFs_org_Human

533486

300

ATG...

MPL...

ATG

sORF

BJ, Brain, HEK293T,
HeLa, hES, loayza_puch_2016,
MM1S, Monocyte, U2OS

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2145	ENST00000367196	CTSD-202	protein_coding	Ribo-seq		9	sORFs_org_Human

7837

150

TTG...

LAS...

TTG

sORF

BJ, Blood, Brain,
HAP1, HEK293, HEK293T,
HeLa, hES, HFF,
LCL, loayza_puch_2016, MDA-MB-231,
MM1S, Monocyte, RPE-1,
THP-1, U2OS

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2145	ENST00000367196	CTSD-202	protein_coding	Ribo-seq		21	sORFs_org_Human

3199897

AGC...

SWW...

AGC

sORF

HEK293

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2145	ENST00000367196	CTSD-202	protein_coding	Ribo-seq	moderate	1	sORFs_org_Human

3199899

CAG...

QPS...

CAG

sORF

HEK293

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2145	ENST00000367196	CTSD-202	protein_coding	Ribo-seq	weak	1	sORFs_org_Human

915745

114

TCT...

SST...

TCT

sORF

Jurkat

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2145	ENST00000367196	CTSD-202	protein_coding	Ribo-seq		1	sORFs_org_Human

903335

108

GAA...

EDP...

GAA

sORF

HEK293, Jurkat

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
2125	ENST00000429746	CTSD-203	protein_coding	Ribo-seq	moderate	2	sORFs_org_Human

633503

ATG...

MSP...

ATG

sORF

BJ, Blood, Brain,
HeLa, hES, loayza_puch_2016,
MDA-MB-231, MM1S, RPE-1,
THP-1

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
282247	ENST00000438213	CTSD-205	protein_coding	Ribo-seq		11	sORFs_org_Human

Export data