Gene HGNC:20766 | MetamORF

Gene

ID CARD

HGNC:20766

Aliases

B-ALPHA-1
ENSG00000167552, FLJ25113, HGNC:20766
NCBI:7846, OFF:TUBA1A, TUBA1A
TUBA3

Chromosome

Transcripts

23 ORFs
6 known transcripts
17 ORF to known transcript associations

MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

ORFs

Display all the transcripts related to the entry.

921458

ENST00000550254

TUBA1A-206

retained_intron

MetamORF ORF ID The MetamORF ID of the ORF. This is an unique ID referring to the ORF.	ORF length The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.	Nucleic sequence The nucleic sequence of the ORF.	Amino acid sequence The amino acid sequence of the ORF.	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Start codon The start codon sequence of the ORF.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.	ORF annotations A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.	Cell types A comma-separated list of the cell types in which the ORF has already been identified on the transcript.
943416	51	GTG ...	VLD ...	Ribo-seq	GTG	weak	2	sORFs_org_Human	Intronic sORF	Brain, guo_2014
943414	48	TTG ...	LDR ...	Ribo-seq	TTG	strong	2	sORFs_org_Human	Intronic sORF	Brain, guo_2014
921457	123	ACA ...	TGK ...	Ribo-seq	ACA	strong	1	sORFs_org_Human	Intronic sORF	Jurkat
943412	30	CTG ...	LVC ...	Ribo-seq	CTG	strong	1	sORFs_org_Human	Intronic sORF	Brain
921465	108	CGA ...	RSS ...	Ribo-seq	CGA		2	sORFs_org_Human	Intronic sORF	BJ, Jurkat
943418	45	GTG ...	VRR ...	Ribo-seq	GTG	weak	4	sORFs_org_Human	Intronic sORF	BJ, Brain, HEK293, HFF
921460	138	CCT ...	PEQ ...	Ribo-seq	CCT	moderate	1	sORFs_org_Human	Intronic sORF	Jurkat

107382

ENST00000546918

TUBA1A-203

protein_coding

MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF
length

The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Start codon

The start codon sequence of the ORF.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

ORF annotations

A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of the cell types in which the ORF has already been identified on the transcript.

107384

ATG ...

MRP ...

Ribo-seq

ATG

moderate

sORFs_org_Human

Brain, HEK293, HeLa,
hES, HFF, RPE-1

107381

ATG ...

MTS ...

Ribo-seq

ATG

weak

sORFs_org_Human

Brain, HAP1, HEK293,
HeLa, hES, HFF,
MDA-MB-231, RPE-1

3328004

UNKNOWN_TRANSCRIPT

MetamORF ORF ID The MetamORF ID of the ORF. This is an unique ID referring to the ORF.	ORF length The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.	Nucleic sequence The nucleic sequence of the ORF.	Amino acid sequence The amino acid sequence of the ORF.	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Start codon The start codon sequence of the ORF.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.	ORF annotations A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.	Cell types A comma-separated list of the cell types in which the ORF has already been identified on the transcript.
3328021	27	ATC ...	IDR ...	Ribo-seq	ATC		1	Erhard2018	sORF	HFF
3328018	36	ATC ...	ISS ...	Ribo-seq	ATC		1	Erhard2018	sORF	HFF
3328015	79	ATG ...	MPS ...	Ribo-seq	ATG		1	Erhard2018	sORF	HFF
3328012	336	ATG ...	MPS ...	Ribo-seq	ATG		1	Erhard2018		HFF
3328009	696	ATG ...	MRE ...	Ribo-seq	ATG		1	Erhard2018		HFF
3328006	659	ATG ...	MPS ...	Ribo-seq	ATG		1	Erhard2018		HFF
3328003	1356	ATG ...	MRE ...	Ribo-seq	ATG		1	Erhard2018		HFF

410846

ENST00000550811

TUBA1A-208

protein_coding

MetamORF ORF ID The MetamORF ID of the ORF. This is an unique ID referring to the ORF.	ORF length The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.	Nucleic sequence The nucleic sequence of the ORF.	Amino acid sequence The amino acid sequence of the ORF.	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Start codon The start codon sequence of the ORF.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.	ORF annotations A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.	Cell types A comma-separated list of the cell types in which the ORF has already been identified on the transcript.
410845	42	AGA ...	RRG ...	Ribo-seq	AGA		2	sORFs_org_Human	sORF	HCT116, Jurkat
921463	54	CCT ...	PSS ...	Ribo-seq	CCT		1	sORFs_org_Human	sORF	Jurkat

943422

ENST00000301071

TUBA1A-202

protein_coding

MetamORF ORF ID The MetamORF ID of the ORF. This is an unique ID referring to the ORF.	ORF length The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.	Nucleic sequence The nucleic sequence of the ORF.	Amino acid sequence The amino acid sequence of the ORF.	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Start codon The start codon sequence of the ORF.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.	ORF annotations A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.	Cell types A comma-separated list of the cell types in which the ORF has already been identified on the transcript.
1828300	42	ATG ...	MLS ...	Ribo-seq	ATG	weak	1	sORFs_org_Human	Alternative Downstream sORF	HeLa
1828298	60	CTG ...	LTY ...	Ribo-seq	CTG	weak	1	sORFs_org_Human	Alternative Downstream sORF	HeLa
943421	48	ATG ...	MCR ...	Ribo-seq	ATG	weak	7	sORFs_org_Human	Alternative sORF Upstream	BJ, Brain, HEK293, HeLa, hES, MDA-MB-231, RPE-1

10779

ENST00000550767

TUBA1A-207

protein_coding

MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF
length

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Identification

Start codon

The start codon sequence of the ORF.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

ORF annotations

Cell types

A comma-separated list of the cell types in which the ORF has already been identified on the transcript.

10778

CTG ...

LVR ...

Ribo-seq

CTG

moderate

sORFs_org_Human

BJ, Brain, HEK293,
hES, HFF, LCL,
MDA-MB-231, RPE-1

107384

ATG ...

MRP ...

Ribo-seq

ATG

moderate

Johnstone2016

Brain_tumor, HEK293, HeLa,
HFF

921470

ENST00000552924

TUBA1A-209

protein_coding

MetamORF ORF ID The MetamORF ID of the ORF. This is an unique ID referring to the ORF.	ORF length The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.	Nucleic sequence The nucleic sequence of the ORF.	Amino acid sequence The amino acid sequence of the ORF.	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Start codon The start codon sequence of the ORF.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.	ORF annotations A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.	Cell types A comma-separated list of the cell types in which the ORF has already been identified on the transcript.
921469	120	GCT ...	ARQ ...	Ribo-seq	GCT		1	sORFs_org_Human	sORF	Jurkat

ORFs

23 ORFs
6 known transcripts
17 ORF to known transcript associations

MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF Length

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Start codon

The start codon sequence of the ORF.

ORF annotations

A comma-separated list of all the annotations computed by our algorithm for the ORF. This list includes the annotations computed for the ORF for all transcripts. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of all the cell types in which the ORF has already been identified.

Transcripts

Display all the transcripts related to the entry.

943416

GTG...

VLD...

GTG

Intronic
sORF

Brain, guo_2014

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
921458	ENST00000550254	TUBA1A-206	retained_intron	Ribo-seq	weak	2	sORFs_org_Human

943414

TTG...

LDR...

TTG

Intronic
sORF

Brain, guo_2014

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
921458	ENST00000550254	TUBA1A-206	retained_intron	Ribo-seq	strong	2	sORFs_org_Human

921457

123

ACA...

TGK...

ACA

Intronic
sORF

Jurkat

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
921458	ENST00000550254	TUBA1A-206	retained_intron	Ribo-seq	strong	1	sORFs_org_Human

107384

ATG...

MRP...

ATG

Downstream
sORF
Alternative
InCDS
Overlapping

Brain, HEK293, HeLa,
hES, HFF, RPE-1,
Brain_tumor

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
107382	ENST00000546918	TUBA1A-203	protein_coding	Ribo-seq	moderate	9	sORFs_org_Human
10779	ENST00000550767	TUBA1A-207	protein_coding	Ribo-seq	moderate	1	Johnstone2016

943412

CTG...

LVC...

CTG

Intronic
sORF

Brain

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
921458	ENST00000550254	TUBA1A-206	retained_intron	Ribo-seq	strong	1	sORFs_org_Human

3328021

ATC...

IDR...

ATC

sORF

HFF

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
3328004	UNKNOWN_TRANSCRIPT			Ribo-seq		1	Erhard2018

3328018

ATC...

ISS...

ATC

sORF

HFF

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
3328004	UNKNOWN_TRANSCRIPT			Ribo-seq		1	Erhard2018

921465

108

CGA...

RSS...

CGA

Intronic
sORF

BJ, Jurkat

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
921458	ENST00000550254	TUBA1A-206	retained_intron	Ribo-seq		2	sORFs_org_Human

410845

AGA...

RRG...

AGA

sORF

HCT116, Jurkat

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
410846	ENST00000550811	TUBA1A-208	protein_coding	Ribo-seq		2	sORFs_org_Human

1828300

ATG...

MLS...

ATG

Alternative
Downstream
sORF

HeLa

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
943422	ENST00000301071	TUBA1A-202	protein_coding	Ribo-seq	weak	1	sORFs_org_Human

943418

GTG...

VRR...

GTG

Intronic
sORF

BJ, Brain, HEK293,
HFF

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
921458	ENST00000550254	TUBA1A-206	retained_intron	Ribo-seq	weak	4	sORFs_org_Human

10778

CTG...

LVR...

CTG

Alternative
InCDS
Overlapping
sORF

BJ, Brain, HEK293,
hES, HFF, LCL,
MDA-MB-231, RPE-1

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
10779	ENST00000550767	TUBA1A-207	protein_coding	Ribo-seq	moderate	16	sORFs_org_Human

3328015

ATG...

MPS...

ATG

sORF

HFF

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
3328004	UNKNOWN_TRANSCRIPT			Ribo-seq		1	Erhard2018

3328012

336

ATG...

MPS...

ATG

HFF

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
3328004	UNKNOWN_TRANSCRIPT			Ribo-seq		1	Erhard2018

921469

120

GCT...

ARQ...

GCT

sORF

Jurkat

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
921470	ENST00000552924	TUBA1A-209	protein_coding	Ribo-seq		1	sORFs_org_Human

3328009

696

ATG...

MRE...

ATG

HFF

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
3328004	UNKNOWN_TRANSCRIPT			Ribo-seq		1	Erhard2018

1828298

CTG...

LTY...

CTG

Alternative
Downstream
sORF

HeLa

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
943422	ENST00000301071	TUBA1A-202	protein_coding	Ribo-seq	weak	1	sORFs_org_Human

107381

ATG...

MTS...

ATG

Downstream
sORF

Brain, HAP1, HEK293,
HeLa, hES, HFF,
MDA-MB-231, RPE-1

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
107382	ENST00000546918	TUBA1A-203	protein_coding	Ribo-seq	weak	14	sORFs_org_Human

921460

138

CCT...

PEQ...

CCT

Intronic
sORF

Jurkat

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
921458	ENST00000550254	TUBA1A-206	retained_intron	Ribo-seq	moderate	1	sORFs_org_Human

943421

ATG...

MCR...

ATG

Alternative
sORF
Upstream

BJ, Brain, HEK293,
HeLa, hES, MDA-MB-231,
RPE-1

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
943422	ENST00000301071	TUBA1A-202	protein_coding	Ribo-seq	weak	7	sORFs_org_Human

3328006

659

ATG...

MPS...

ATG

HFF

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
3328004	UNKNOWN_TRANSCRIPT			Ribo-seq		1	Erhard2018

921463

CCT...

PSS...

CCT

sORF

Jurkat

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
410846	ENST00000550811	TUBA1A-208	protein_coding	Ribo-seq		1	sORFs_org_Human

3328003

1356

ATG...

MRE...

ATG

HFF

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
3328004	UNKNOWN_TRANSCRIPT			Ribo-seq		1	Erhard2018

Export data