Gene NCBI:66077 | MetamORF

Gene

ID CARD

NCBI:66077

Aliases

0610033H09Rik
Aip, Akip, Aurkaip1
ENSMUSG00000065990, MGI:1913327, MRP-S38
NCBI:66077, OFF:Aurkaip1

Chromosome

Transcripts

37 ORFs
6 known transcripts
58 ORF to known transcript associations

MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

ORFs

Display all the transcripts related to the entry.

56078

ENSMUST00000084097

Aurkaip1-201

protein_coding

MetamORF ORF ID The MetamORF ID of the ORF. This is an unique ID referring to the ORF.	ORF length The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.	Nucleic sequence The nucleic sequence of the ORF.	Amino acid sequence The amino acid sequence of the ORF.	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Start codon The start codon sequence of the ORF.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.	ORF annotations A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.	Cell types A comma-separated list of the cell types in which the ORF has already been identified on the transcript.
56401	90	TTG ...	LSA ...	Ribo-seq	TTG		2	sORFs_org_Mouse	Alternative Overlapping sORF Upstream	Brain, Spleen_B_cell
513249	129	ATG ...	MLF ...	Ribo-seq	ATG	moderate	2	sORFs_org_Mouse	InCDS Overlapping sORF	B_cell, Spleen_B_cell
511983	141	ATG ...	MSG ...	Ribo-seq	ATG		1	sORFs_org_Mouse	Alternative sORF Upstream	Spleen_B_cell
1225532	39	AAG ...	KPL ...	Ribo-seq	AAG	moderate	1	sORFs_org_Mouse	InCDS Overlapping sORF	E14
56077	33	ATG ...	MPL ...	Ribo-seq	ATG	moderate	1	sORFs_org_Mouse	InCDS Overlapping sORF	Testis
60814	48	ATG ...	MMS ...	Ribo-seq	ATG		6	sORFs_org_Mouse	Alternative sORF Upstream	BMDC, Brain, Liver, Spleen_B_cell, Testis
60998	45	ATG ...	MSG ...	Ribo-seq	ATG		7	sORFs_org_Mouse	Alternative sORF Upstream	BMDC, Brain, E14, Liver, Spleen_B_cell, Testis
510340	90	CTG ...	LAD ...	Ribo-seq	CTG	strong	1	sORFs_org_Mouse	Alternative InCDS Overlapping sORF	Spleen_B_cell
509199	81	CTG ...	LPP ...	Ribo-seq	CTG	moderate	2	sORFs_org_Mouse	InCDS Overlapping sORF	B_cell, Spleen_B_cell
510068	144	ATG ...	MMS ...	Ribo-seq	ATG		1	sORFs_org_Mouse	Alternative sORF Upstream	Spleen_B_cell
823230	69	CTG ...	LVR ...	Ribo-seq	CTG	moderate	1	sORFs_org_Mouse	InCDS Overlapping sORF	B_cell
825497	99	CTG ...	LQA ...	Ribo-seq	CTG	moderate	1	sORFs_org_Mouse	InCDS Overlapping sORF	B_cell
513766	138	TTG ...	LAA ...	Ribo-seq	TTG	moderate	2	sORFs_org_Mouse	InCDS Overlapping sORF	B_cell, Spleen_B_cell
57072	96	CTG ...	LPW ...	Ribo-seq	CTG	weak	1	sORFs_org_Mouse	Alternative InCDS Overlapping sORF	Brain

544515

ENSMUST00000105592

Aurkaip1-203

protein_coding

MetamORF ORF ID The MetamORF ID of the ORF. This is an unique ID referring to the ORF.	ORF length The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.	Nucleic sequence The nucleic sequence of the ORF.	Amino acid sequence The amino acid sequence of the ORF.	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Start codon The start codon sequence of the ORF.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.	ORF annotations A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.	Cell types A comma-separated list of the cell types in which the ORF has already been identified on the transcript.
513249	129	ATG ...	MLF ...	Ribo-seq	ATG	moderate	1	sORFs_org_Mouse	InCDS Overlapping sORF	Liver
1651987	33	CTG ...	LRT ...	Ribo-seq	CTG	weak	1	sORFs_org_Mouse	InCDS Overlapping sORF	MEF
57072	96	CTG ...	LPW ...	Ribo-seq	CTG	weak	2	sORFs_org_Mouse	Alternative InCDS Overlapping sORF	Liver, MEF
1378457	45	CTG ...	LLN ...	Ribo-seq	CTG	weak	1	sORFs_org_Mouse	InCDS Overlapping sORF	E14
510340	90	CTG ...	LAD ...	Ribo-seq	CTG	strong	2	sORFs_org_Mouse	Alternative InCDS Overlapping sORF	E14, NSC
1178413	120	GTG ...	VTS ...	Ribo-seq	GTG		2	sORFs_org_Mouse	Alternative Overlapping sORF Upstream	Liver, MEF
544514	78	CTG ...	LGC ...	Ribo-seq	CTG	strong	2	sORFs_org_Mouse	Alternative InCDS Overlapping sORF	3T3, MEF
56401	90	TTG ...	LSA ...	Ribo-seq	TTG		6	sORFs_org_Mouse	Alternative Overlapping sORF Upstream	BMDC, E14, Liver, MEF

165832

ENSMUST00000139651

Aurkaip1-205

protein_coding

MetamORF ORF ID The MetamORF ID of the ORF. This is an unique ID referring to the ORF.	ORF length The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.	Nucleic sequence The nucleic sequence of the ORF.	Amino acid sequence The amino acid sequence of the ORF.	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Start codon The start codon sequence of the ORF.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.	ORF annotations A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.	Cell types A comma-separated list of the cell types in which the ORF has already been identified on the transcript.
513249	129	ATG ...	MLF ...	Ribo-seq	ATG	moderate	1	sORFs_org_Mouse	sORF	Brain
825497	99	CTG ...	LQA ...	Ribo-seq	CTG	moderate	1	sORFs_org_Mouse	sORF	Brain
513766	138	TTG ...	LAA ...	Ribo-seq	TTG	moderate	1	sORFs_org_Mouse	sORF	Brain
56401	90	TTG ...	LSA ...	Ribo-seq	TTG		1	sORFs_org_Mouse	sORF	MEF
1430966	279	TTG ...	LGE ...	Ribo-seq	TTG		1	sORFs_org_Mouse	sORF	Liver
1230172	138	AAG ...	KIR ...	Ribo-seq	AAG		1	sORFs_org_Mouse	sORF	E14
1554953	45	TTG ...	LRS ...	Ribo-seq	TTG	moderate	1	sORFs_org_Mouse	sORF	Brain
509199	81	CTG ...	LPP ...	Ribo-seq	CTG	moderate	1	sORFs_org_Mouse	sORF	Brain
1731601	57	GTG ...	VQK ...	Ribo-seq	GTG	moderate	1	sORFs_org_Mouse	sORF	MEF
1657950	57	TTG ...	LCF ...	Ribo-seq	TTG		2	sORFs_org_Mouse	sORF	MEF
823230	69	CTG ...	LVR ...	Ribo-seq	CTG	moderate	1	sORFs_org_Mouse	sORF	Brain
544514	78	CTG ...	LGC ...	Ribo-seq	CTG	strong	1	sORFs_org_Mouse	sORF	MEF
165831	228	GTG ...	VHA ...	Ribo-seq	GTG		5	sORFs_org_Mouse	sORF	Brain, E14, Liver, MEF
57072	96	CTG ...	LPW ...	Ribo-seq	CTG	weak	1	sORFs_org_Mouse	sORF	Brain

1017477

ENSMUST00000129711

Aurkaip1-204

processed_transcript

MetamORF ORF ID The MetamORF ID of the ORF. This is an unique ID referring to the ORF.	ORF length The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.	Nucleic sequence The nucleic sequence of the ORF.	Amino acid sequence The amino acid sequence of the ORF.	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Start codon The start codon sequence of the ORF.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.	ORF annotations A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.	Cell types A comma-separated list of the cell types in which the ORF has already been identified on the transcript.
1017476	57	ATG ...	MMS ...	Ribo-seq	ATG		1	sORFs_org_Mouse	ncRNA sORF	Brain
1178413	120	GTG ...	VTS ...	Ribo-seq	GTG		3	sORFs_org_Mouse	ncRNA sORF	E14, R1E, v6-5
1082184	204	ATG ...	MSP ...	Ribo-seq	ATG		2	sORFs_org_Mouse	ncRNA sORF	R1E, v6-5
56401	90	TTG ...	LSA ...	Ribo-seq	TTG		2	sORFs_org_Mouse	ncRNA sORF	R1E, v6-5
1083127	168	ATG ...	MGY ...	Ribo-seq	ATG		2	sORFs_org_Mouse	ncRNA sORF	R1E, v6-5
1081657	210	TTG ...	LKM ...	Ribo-seq	TTG		2	sORFs_org_Mouse	ncRNA sORF	R1E, v6-5
1082839	228	TTG ...	LRS ...	Ribo-seq	TTG		2	sORFs_org_Mouse	ncRNA sORF	R1E, v6-5
57072	96	CTG ...	LPW ...	Ribo-seq	CTG	weak	2	sORFs_org_Mouse	ncRNA sORF	E14, R1E
1081420	171	GTG ...	VMG ...	Ribo-seq	GTG		2	sORFs_org_Mouse	ncRNA sORF	R1E, v6-5
58119	126	CTG ...	LCC ...	Ribo-seq	CTG		2	sORFs_org_Mouse	ncRNA sORF	E14, R1E

56402

ENSMUST00000151222

Aurkaip1-206

retained_intron

MetamORF ORF ID The MetamORF ID of the ORF. This is an unique ID referring to the ORF.	ORF length The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.	Nucleic sequence The nucleic sequence of the ORF.	Amino acid sequence The amino acid sequence of the ORF.	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Start codon The start codon sequence of the ORF.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.	ORF annotations A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.	Cell types A comma-separated list of the cell types in which the ORF has already been identified on the transcript.
56401	90	TTG ...	LSA ...	Ribo-seq	TTG	moderate	1	sORFs_org_Mouse	Intronic sORF	Testis
57072	96	CTG ...	LPW ...	Ribo-seq	CTG	weak	1	sORFs_org_Mouse	Intronic sORF	Testis
58119	126	CTG ...	LCC ...	Ribo-seq	CTG	weak	1	sORFs_org_Mouse	Intronic sORF	Testis

159159

ENSMUST00000105591

Aurkaip1-202

protein_coding

MetamORF ORF ID The MetamORF ID of the ORF. This is an unique ID referring to the ORF.	ORF length The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.	Nucleic sequence The nucleic sequence of the ORF.	Amino acid sequence The amino acid sequence of the ORF.	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Start codon The start codon sequence of the ORF.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.	ORF annotations A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.	Cell types A comma-separated list of the cell types in which the ORF has already been identified on the transcript.
166557	267	ATG ...	MSG ...	Ribo-seq	ATG		3	sORFs_org_Mouse	Alternative Overlapping sORF Upstream	Brain, E14, MEF
1017735	228	GTG ...	VRA ...	Ribo-seq	GTG		3	sORFs_org_Mouse	Alternative Overlapping sORF Upstream	Brain, E14, Liver
1017949	204	CTG ...	LRS ...	Ribo-seq	CTG		3	sORFs_org_Mouse	Alternative Overlapping sORF Upstream	Brain, E14, Liver
56077	33	ATG ...	MPL ...	Ribo-seq	ATG	moderate	1	Johnstone2016	InCDS Overlapping sORF	Glioma, Liver, MEF, MESC
56401	90	TTG ...	LSA ...	Ribo-seq	TTG		1	sORFs_org_Mouse	Alternative Overlapping sORF Upstream	Brain
1017388	231	TTG ...	LVR ...	Ribo-seq	TTG		3	sORFs_org_Mouse	Alternative Overlapping sORF Upstream	Brain, E14, Liver
513249	129	ATG ...	MLF ...	Ribo-seq	ATG	moderate	2	sORFs_org_Mouse Johnstone2016	InCDS Overlapping sORF	Glioma, Liver, MEF, MESC
166559	270	ATG ...	MMS ...	Ribo-seq	ATG		4	sORFs_org_Mouse Johnstone2016	Alternative Overlapping sORF Upstream	Brain, E14, Glioma, Liver, MEF, MESC
161658	246	TTG ...	LAC ...	Ribo-seq	TTG		5	sORFs_org_Mouse	Alternative Overlapping sORF Upstream	Brain, E14, Liver, MEF

3180326

UNKNOWN_TRANSCRIPT

MetamORF ORF ID The MetamORF ID of the ORF. This is an unique ID referring to the ORF.	ORF length The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.	Nucleic sequence The nucleic sequence of the ORF.	Amino acid sequence The amino acid sequence of the ORF.	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Start codon The start codon sequence of the ORF.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.	ORF annotations A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.	Cell types A comma-separated list of the cell types in which the ORF has already been identified on the transcript.
166559	270	ATG ...	MMS ...	Predicted	ATG		2	Samandi2017	sORF
513249	129	ATG ...	MLF ...	Predicted	ATG		1	Samandi2017	sORF

ORFs

37 ORFs
6 known transcripts
58 ORF to known transcript associations

MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF Length

The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Start codon

The start codon sequence of the ORF.

ORF annotations

A comma-separated list of all the annotations computed by our algorithm for the ORF. This list includes the annotations computed for the ORF for all transcripts. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of all the cell types in which the ORF has already been identified.

Transcripts

Display all the transcripts related to the entry.

56401

TTG...

LSA...

TTG

Alternative
Overlapping
sORF
Upstream
Intronic
ncRNA

Brain, Spleen_B_cell, Testis,
MEF, R1E, v6-5,
BMDC, E14, Liver

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
56078	ENSMUST00000084097	Aurkaip1-201	protein_coding	Ribo-seq		2	sORFs_org_Mouse
56402	ENSMUST00000151222	Aurkaip1-206	retained_intron	Ribo-seq	moderate	1	sORFs_org_Mouse
165832	ENSMUST00000139651	Aurkaip1-205	protein_coding	Ribo-seq		1	sORFs_org_Mouse
1017477	ENSMUST00000129711	Aurkaip1-204	processed_transcript	Ribo-seq		2	sORFs_org_Mouse
159159	ENSMUST00000105591	Aurkaip1-202	protein_coding	Ribo-seq		1	sORFs_org_Mouse
544515	ENSMUST00000105592	Aurkaip1-203	protein_coding	Ribo-seq		6	sORFs_org_Mouse

513249

129

ATG...

MLF...

ATG

InCDS
Overlapping
sORF

Liver, Brain, B_cell,
Spleen_B_cell, Glioma,
MEF, MESC

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
544515	ENSMUST00000105592	Aurkaip1-203	protein_coding	Ribo-seq	moderate	1	sORFs_org_Mouse
165832	ENSMUST00000139651	Aurkaip1-205	protein_coding	Ribo-seq	moderate	1	sORFs_org_Mouse
56078	ENSMUST00000084097	Aurkaip1-201	protein_coding	Ribo-seq	moderate	2	sORFs_org_Mouse
3180326	UNKNOWN_TRANSCRIPT			Predicted		1	Samandi2017
159159	ENSMUST00000105591	Aurkaip1-202	protein_coding	Ribo-seq	moderate	2	sORFs_org_Mouse Johnstone2016

1651987

CTG...

LRT...

CTG

InCDS
Overlapping
sORF

MEF

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
544515	ENSMUST00000105592	Aurkaip1-203	protein_coding	Ribo-seq	weak	1	sORFs_org_Mouse

1017476

ATG...

MMS...

ATG

ncRNA
sORF

Brain

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
1017477	ENSMUST00000129711	Aurkaip1-204	processed_transcript	Ribo-seq		1	sORFs_org_Mouse

825497

CTG...

LQA...

CTG

sORF
InCDS
Overlapping

Brain, B_cell

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
165832	ENSMUST00000139651	Aurkaip1-205	protein_coding	Ribo-seq	moderate	1	sORFs_org_Mouse
56078	ENSMUST00000084097	Aurkaip1-201	protein_coding	Ribo-seq	moderate	1	sORFs_org_Mouse

511983

141

ATG...

MSG...

ATG

Alternative
sORF
Upstream

Spleen_B_cell

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
56078	ENSMUST00000084097	Aurkaip1-201	protein_coding	Ribo-seq		1	sORFs_org_Mouse

1178413

120

GTG...

VTS...

GTG

ncRNA
sORF
Alternative
Overlapping
Upstream

E14, R1E, v6-5,
Liver, MEF

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
1017477	ENSMUST00000129711	Aurkaip1-204	processed_transcript	Ribo-seq		3	sORFs_org_Mouse
544515	ENSMUST00000105592	Aurkaip1-203	protein_coding	Ribo-seq		2	sORFs_org_Mouse

1225532

AAG...

KPL...

AAG

InCDS
Overlapping
sORF

E14

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
56078	ENSMUST00000084097	Aurkaip1-201	protein_coding	Ribo-seq	moderate	1	sORFs_org_Mouse

57072

CTG...

LPW...

CTG

Alternative
InCDS
Overlapping
sORF
Intronic
ncRNA

Liver, MEF, Testis,
E14, R1E, Brain

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
544515	ENSMUST00000105592	Aurkaip1-203	protein_coding	Ribo-seq	weak	2	sORFs_org_Mouse
56402	ENSMUST00000151222	Aurkaip1-206	retained_intron	Ribo-seq	weak	1	sORFs_org_Mouse
1017477	ENSMUST00000129711	Aurkaip1-204	processed_transcript	Ribo-seq	weak	2	sORFs_org_Mouse
56078	ENSMUST00000084097	Aurkaip1-201	protein_coding	Ribo-seq	weak	1	sORFs_org_Mouse
165832	ENSMUST00000139651	Aurkaip1-205	protein_coding	Ribo-seq	weak	1	sORFs_org_Mouse

56077

ATG...

MPL...

ATG

InCDS
Overlapping
sORF

Testis, Glioma, Liver,
MEF, MESC

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
56078	ENSMUST00000084097	Aurkaip1-201	protein_coding	Ribo-seq	moderate	1	sORFs_org_Mouse
159159	ENSMUST00000105591	Aurkaip1-202	protein_coding	Ribo-seq	moderate	1	Johnstone2016

166557

267

ATG...

MSG...

ATG

Alternative
Overlapping
sORF
Upstream

Brain, E14, MEF

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
159159	ENSMUST00000105591	Aurkaip1-202	protein_coding	Ribo-seq		3	sORFs_org_Mouse

1082184

204

ATG...

MSP...

ATG

ncRNA
sORF

R1E, v6-5

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
1017477	ENSMUST00000129711	Aurkaip1-204	processed_transcript	Ribo-seq		2	sORFs_org_Mouse

513766

138

TTG...

LAA...

TTG

sORF
InCDS
Overlapping

Brain, B_cell, Spleen_B_cell

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
165832	ENSMUST00000139651	Aurkaip1-205	protein_coding	Ribo-seq	moderate	1	sORFs_org_Mouse
56078	ENSMUST00000084097	Aurkaip1-201	protein_coding	Ribo-seq	moderate	2	sORFs_org_Mouse

60814

ATG...

MMS...

ATG

Alternative
sORF
Upstream

BMDC, Brain, Liver,
Spleen_B_cell, Testis

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
56078	ENSMUST00000084097	Aurkaip1-201	protein_coding	Ribo-seq		6	sORFs_org_Mouse

1378457

CTG...

LLN...

CTG

InCDS
Overlapping
sORF

E14

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
544515	ENSMUST00000105592	Aurkaip1-203	protein_coding	Ribo-seq	weak	1	sORFs_org_Mouse

1017735

228

GTG...

VRA...

GTG

Alternative
Overlapping
sORF
Upstream

Brain, E14, Liver

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
159159	ENSMUST00000105591	Aurkaip1-202	protein_coding	Ribo-seq		3	sORFs_org_Mouse

1083127

168

ATG...

MGY...

ATG

ncRNA
sORF

R1E, v6-5

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
1017477	ENSMUST00000129711	Aurkaip1-204	processed_transcript	Ribo-seq		2	sORFs_org_Mouse

510340

CTG...

LAD...

CTG

Alternative
InCDS
Overlapping
sORF

E14, NSC, Spleen_B_cell

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
544515	ENSMUST00000105592	Aurkaip1-203	protein_coding	Ribo-seq	strong	2	sORFs_org_Mouse
56078	ENSMUST00000084097	Aurkaip1-201	protein_coding	Ribo-seq	strong	1	sORFs_org_Mouse

1430966

279

TTG...

LGE...

TTG

sORF

Liver

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
165832	ENSMUST00000139651	Aurkaip1-205	protein_coding	Ribo-seq		1	sORFs_org_Mouse

166559

270

ATG...

MMS...

ATG

sORF
Alternative
Overlapping
Upstream

Brain, E14,
Glioma, Liver, MEF,
MESC

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
3180326	UNKNOWN_TRANSCRIPT			Predicted		2	Samandi2017
159159	ENSMUST00000105591	Aurkaip1-202	protein_coding	Ribo-seq		4	sORFs_org_Mouse Johnstone2016

60998

ATG...

MSG...

ATG

Alternative
sORF
Upstream

BMDC, Brain, E14,
Liver, Spleen_B_cell, Testis

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
56078	ENSMUST00000084097	Aurkaip1-201	protein_coding	Ribo-seq		7	sORFs_org_Mouse

1017949

204

CTG...

LRS...

CTG

Alternative
Overlapping
sORF
Upstream

Brain, E14, Liver

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
159159	ENSMUST00000105591	Aurkaip1-202	protein_coding	Ribo-seq		3	sORFs_org_Mouse

58119

126

CTG...

LCC...

CTG

Intronic
sORF
ncRNA

Testis, E14, R1E

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
56402	ENSMUST00000151222	Aurkaip1-206	retained_intron	Ribo-seq	weak	1	sORFs_org_Mouse
1017477	ENSMUST00000129711	Aurkaip1-204	processed_transcript	Ribo-seq		2	sORFs_org_Mouse

1230172

138

AAG...

KIR...

AAG

sORF

E14

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
165832	ENSMUST00000139651	Aurkaip1-205	protein_coding	Ribo-seq		1	sORFs_org_Mouse

1554953

TTG...

LRS...

TTG

sORF

Brain

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
165832	ENSMUST00000139651	Aurkaip1-205	protein_coding	Ribo-seq	moderate	1	sORFs_org_Mouse

1081657

210

TTG...

LKM...

TTG

ncRNA
sORF

R1E, v6-5

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
1017477	ENSMUST00000129711	Aurkaip1-204	processed_transcript	Ribo-seq		2	sORFs_org_Mouse

509199

CTG...

LPP...

CTG

InCDS
Overlapping
sORF

B_cell, Spleen_B_cell, Brain

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
56078	ENSMUST00000084097	Aurkaip1-201	protein_coding	Ribo-seq	moderate	2	sORFs_org_Mouse
165832	ENSMUST00000139651	Aurkaip1-205	protein_coding	Ribo-seq	moderate	1	sORFs_org_Mouse

510068

144

ATG...

MMS...

ATG

Alternative
sORF
Upstream

Spleen_B_cell

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
56078	ENSMUST00000084097	Aurkaip1-201	protein_coding	Ribo-seq		1	sORFs_org_Mouse

544514

CTG...

LGC...

CTG

Alternative
InCDS
Overlapping
sORF

3T3, MEF

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
544515	ENSMUST00000105592	Aurkaip1-203	protein_coding	Ribo-seq	strong	2	sORFs_org_Mouse
165832	ENSMUST00000139651	Aurkaip1-205	protein_coding	Ribo-seq	strong	1	sORFs_org_Mouse

823230

CTG...

LVR...

CTG

InCDS
Overlapping
sORF

B_cell, Brain

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
56078	ENSMUST00000084097	Aurkaip1-201	protein_coding	Ribo-seq	moderate	1	sORFs_org_Mouse
165832	ENSMUST00000139651	Aurkaip1-205	protein_coding	Ribo-seq	moderate	1	sORFs_org_Mouse

1082839

228

TTG...

LRS...

TTG

ncRNA
sORF

R1E, v6-5

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
1017477	ENSMUST00000129711	Aurkaip1-204	processed_transcript	Ribo-seq		2	sORFs_org_Mouse

1017388

231

TTG...

LVR...

TTG

Alternative
Overlapping
sORF
Upstream

Brain, E14, Liver

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
159159	ENSMUST00000105591	Aurkaip1-202	protein_coding	Ribo-seq		3	sORFs_org_Mouse

1731601

GTG...

VQK...

GTG

sORF

MEF

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
165832	ENSMUST00000139651	Aurkaip1-205	protein_coding	Ribo-seq	moderate	1	sORFs_org_Mouse

1657950

TTG...

LCF...

TTG

sORF

MEF

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
165832	ENSMUST00000139651	Aurkaip1-205	protein_coding	Ribo-seq		2	sORFs_org_Mouse

165831

228

GTG...

VHA...

GTG

sORF

Brain, E14, Liver,
MEF

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
165832	ENSMUST00000139651	Aurkaip1-205	protein_coding	Ribo-seq		5	sORFs_org_Mouse

1081420

171

GTG...

VMG...

GTG

ncRNA
sORF

R1E, v6-5

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
1017477	ENSMUST00000129711	Aurkaip1-204	processed_transcript	Ribo-seq		2	sORFs_org_Mouse

161658

246

TTG...

LAC...

TTG

Alternative
Overlapping
sORF
Upstream

Brain, E14, Liver,
MEF

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
159159	ENSMUST00000105591	Aurkaip1-202	protein_coding	Ribo-seq		5	sORFs_org_Mouse

Export data