Gene NCBI:66291 | MetamORF

Gene

ID CARD

NCBI:66291

Aliases

1810030N24Rik
2810406B13Rik, ENSMUSG00000028295, MGI:1913541
NCBI:66291, OFF:Smim8, Smim8

Chromosome

Transcripts

21 ORFs
6 known transcripts
37 ORF to known transcript associations

MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

ORFs

Display all the transcripts related to the entry.

822311

ENSMUST00000108132

Smim8-203

protein_coding

MetamORF ORF ID The MetamORF ID of the ORF. This is an unique ID referring to the ORF.	ORF length The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.	Nucleic sequence The nucleic sequence of the ORF.	Amino acid sequence The amino acid sequence of the ORF.	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Start codon The start codon sequence of the ORF.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.	ORF annotations A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.	Cell types A comma-separated list of the cell types in which the ORF has already been identified on the transcript.
151903	90	GTG ...	VWL ...	Ribo-seq	GTG	weak	1	sORFs_org_Mouse	Alternative InCDS Overlapping sORF	Liver
149775	78	TTG ...	LVI ...	Ribo-seq	TTG	strong	1	sORFs_org_Mouse	Alternative InCDS Overlapping sORF	Liver
160109	66	ATG ...	MQL ...	Ribo-seq	ATG	weak	1	sORFs_org_Mouse	Alternative InCDS Overlapping sORF	B_cell
159226	60	TTG ...	LCG ...	Ribo-seq	TTG	moderate	1	sORFs_org_Mouse	Alternative InCDS Overlapping sORF	B_cell

60002

ENSMUST00000029972

Smim8-201

protein_coding

MetamORF ORF ID The MetamORF ID of the ORF. This is an unique ID referring to the ORF.	ORF length The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.	Nucleic sequence The nucleic sequence of the ORF.	Amino acid sequence The amino acid sequence of the ORF.	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Start codon The start codon sequence of the ORF.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.	ORF annotations A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.	Cell types A comma-separated list of the cell types in which the ORF has already been identified on the transcript.
149060	33	CTG ...	LIL ...	Ribo-seq	CTG	weak	1	sORFs_org_Mouse	InCDS Overlapping sORF	Brain
60992	36	CTG ...	LKL ...	Ribo-seq	CTG		1	sORFs_org_Mouse	Alternative sORF Upstream	Testis
60001	51	CTG ...	LAA ...	Ribo-seq	CTG		1	sORFs_org_Mouse	Alternative sORF Upstream	Testis
60683	57	CTG ...	LSL ...	Ribo-seq	CTG		1	sORFs_org_Mouse	Alternative sORF Upstream	Testis
899517	156	CTG ...	LKL ...	Ribo-seq	CTG		1	sORFs_org_Mouse	Alternative sORF Upstream	BMDC
149775	78	TTG ...	LVI ...	Ribo-seq	TTG	strong	2	sORFs_org_Mouse	Alternative InCDS Overlapping sORF	3T3, Brain
556394	48	TTG ...	LPK ...	Ribo-seq	TTG		4	sORFs_org_Mouse	Alternative Overlapping sORF Upstream	3T3, E14, NSC, v6-5
899997	171	CTG ...	LAA ...	Ribo-seq	CTG		1	sORFs_org_Mouse	Alternative sORF Upstream	BMDC
151903	90	GTG ...	VWL ...	Ribo-seq	GTG	weak	2	sORFs_org_Mouse	Alternative InCDS Overlapping sORF	3T3, Brain
147565	36	ATG ...	MKP ...	Ribo-seq	ATG	weak	2	sORFs_org_Mouse	Alternative InCDS Overlapping sORF	3T3, Brain
160109	66	ATG ...	MQL ...	Ribo-seq	ATG	weak	2	sORFs_org_Mouse	Alternative InCDS Overlapping sORF	3T3, Brain
159226	60	TTG ...	LCG ...	Ribo-seq	TTG	moderate	1	sORFs_org_Mouse	Alternative InCDS Overlapping sORF	Brain

1177974

ENSMUST00000108131

Smim8-202

protein_coding

MetamORF ORF ID The MetamORF ID of the ORF. This is an unique ID referring to the ORF.	ORF length The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.	Nucleic sequence The nucleic sequence of the ORF.	Amino acid sequence The amino acid sequence of the ORF.	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Start codon The start codon sequence of the ORF.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.	ORF annotations A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.	Cell types A comma-separated list of the cell types in which the ORF has already been identified on the transcript.
159226	60	TTG ...	LCG ...	Ribo-seq	TTG	moderate	3	sORFs_org_Mouse	Alternative InCDS Overlapping sORF	E14, MEF, v6-5
149060	33	CTG ...	LIL ...	Ribo-seq	CTG	weak	1	sORFs_org_Mouse	InCDS Overlapping sORF	MEF
160109	66	ATG ...	MQL ...	Ribo-seq	ATG	weak	2	sORFs_org_Mouse	Alternative InCDS Overlapping sORF	E14, MEF
151903	90	GTG ...	VWL ...	Ribo-seq	GTG	weak	1	sORFs_org_Mouse	Alternative InCDS Overlapping sORF	MEF
149775	78	TTG ...	LVI ...	Ribo-seq	TTG	strong	2	sORFs_org_Mouse	Alternative InCDS Overlapping sORF	E14, MEF

58193

ENSMUST00000139922

Smim8-207

processed_transcript

MetamORF ORF ID The MetamORF ID of the ORF. This is an unique ID referring to the ORF.	ORF length The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.	Nucleic sequence The nucleic sequence of the ORF.	Amino acid sequence The amino acid sequence of the ORF.	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Start codon The start codon sequence of the ORF.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.	ORF annotations A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.	Cell types A comma-separated list of the cell types in which the ORF has already been identified on the transcript.
61230	60	CTG ...	LAA ...	Ribo-seq	CTG		3	sORFs_org_Mouse	ncRNA sORF	E14, Testis, v6-5
58490	135	TTG ...	LVT ...	Ribo-seq	TTG	strong	1	sORFs_org_Mouse	ncRNA sORF	Testis
59169	114	GTG ...	VAY ...	Ribo-seq	GTG	moderate	1	sORFs_org_Mouse	ncRNA sORF	Testis
899997	171	CTG ...	LAA ...	Ribo-seq	CTG		1	sORFs_org_Mouse	ncRNA sORF	E14
60690	168	TTG ...	LPK ...	Ribo-seq	TTG		5	sORFs_org_Mouse	ncRNA sORF	Brain, Liver, MEF, Spleen_B_cell, Testis
61093	66	CTG ...	LSL ...	Ribo-seq	CTG		1	sORFs_org_Mouse	ncRNA sORF	Testis
58192	96	CTG ...	LHA ...	Ribo-seq	CTG	weak	1	sORFs_org_Mouse	ncRNA sORF	Testis
60184	45	CTG ...	LKL ...	Ribo-seq	CTG		4	sORFs_org_Mouse	ncRNA sORF	E14, Liver, Testis, v6-5
58630	150	GTG ...	VMA ...	Ribo-seq	GTG	weak	1	sORFs_org_Mouse	ncRNA sORF	Testis
899517	156	CTG ...	LKL ...	Ribo-seq	CTG		1	sORFs_org_Mouse	ncRNA sORF	E14

1085011

ENSMUST00000108134

Smim8-205

protein_coding

MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF
length

The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Start codon

The start codon sequence of the ORF.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

ORF annotations

A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of the cell types in which the ORF has already been identified on the transcript.

1085010

TTG ...

LWF ...

Ribo-seq

TTG

sORFs_org_Mouse

v6-5

160109

ATG ...

MQL ...

Ribo-seq

ATG

weak

Johnstone2016

Glioma, Liver, MEF,
MESC

1885071

ENSMUST00000108133

Smim8-204

protein_coding

MetamORF ORF ID The MetamORF ID of the ORF. This is an unique ID referring to the ORF.	ORF length The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.	Nucleic sequence The nucleic sequence of the ORF.	Amino acid sequence The amino acid sequence of the ORF.	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Start codon The start codon sequence of the ORF.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.	ORF annotations A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.	Cell types A comma-separated list of the cell types in which the ORF has already been identified on the transcript.
159226	60	TTG ...	LCG ...	Ribo-seq	TTG	moderate	1	sORFs_org_Mouse	Alternative InCDS Overlapping sORF	R1E
149775	78	TTG ...	LVI ...	Ribo-seq	TTG	strong	1	sORFs_org_Mouse	Alternative InCDS Overlapping sORF	R1E
149060	33	CTG ...	LIL ...	Ribo-seq	CTG	weak	1	sORFs_org_Mouse	InCDS Overlapping sORF	R1E
160109	66	ATG ...	MQL ...	Ribo-seq	ATG	weak	1	sORFs_org_Mouse	Alternative InCDS Overlapping sORF	R1E

ORFs

21 ORFs
6 known transcripts
37 ORF to known transcript associations

MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF Length

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Start codon

The start codon sequence of the ORF.

ORF annotations

A comma-separated list of all the annotations computed by our algorithm for the ORF. This list includes the annotations computed for the ORF for all transcripts. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of all the cell types in which the ORF has already been identified.

Transcripts

Display all the transcripts related to the entry.

151903

GTG...

VWL...

GTG

Alternative
InCDS
Overlapping
sORF

Liver, MEF, 3T3,
Brain

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
822311	ENSMUST00000108132	Smim8-203	protein_coding	Ribo-seq	weak	1	sORFs_org_Mouse
1177974	ENSMUST00000108131	Smim8-202	protein_coding	Ribo-seq	weak	1	sORFs_org_Mouse
60002	ENSMUST00000029972	Smim8-201	protein_coding	Ribo-seq	weak	2	sORFs_org_Mouse

149060

CTG...

LIL...

CTG

InCDS
Overlapping
sORF

Brain, R1E, MEF

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
60002	ENSMUST00000029972	Smim8-201	protein_coding	Ribo-seq	weak	1	sORFs_org_Mouse
1885071	ENSMUST00000108133	Smim8-204	protein_coding	Ribo-seq	weak	1	sORFs_org_Mouse
1177974	ENSMUST00000108131	Smim8-202	protein_coding	Ribo-seq	weak	1	sORFs_org_Mouse

159226

TTG...

LCG...

TTG

Alternative
InCDS
Overlapping
sORF

E14, MEF, v6-5,
R1E, B_cell, Brain

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
1177974	ENSMUST00000108131	Smim8-202	protein_coding	Ribo-seq	moderate	3	sORFs_org_Mouse
1885071	ENSMUST00000108133	Smim8-204	protein_coding	Ribo-seq	moderate	1	sORFs_org_Mouse
822311	ENSMUST00000108132	Smim8-203	protein_coding	Ribo-seq	moderate	1	sORFs_org_Mouse
60002	ENSMUST00000029972	Smim8-201	protein_coding	Ribo-seq	moderate	1	sORFs_org_Mouse

60992

CTG...

LKL...

CTG

Alternative
sORF
Upstream

Testis

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
60002	ENSMUST00000029972	Smim8-201	protein_coding	Ribo-seq		1	sORFs_org_Mouse

61230

CTG...

LAA...

CTG

ncRNA
sORF

E14, Testis, v6-5

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
58193	ENSMUST00000139922	Smim8-207	processed_transcript	Ribo-seq		3	sORFs_org_Mouse

149775

TTG...

LVI...

TTG

Alternative
InCDS
Overlapping
sORF

Liver, R1E, 3T3,
Brain, E14, MEF

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
822311	ENSMUST00000108132	Smim8-203	protein_coding	Ribo-seq	strong	1	sORFs_org_Mouse
1885071	ENSMUST00000108133	Smim8-204	protein_coding	Ribo-seq	strong	1	sORFs_org_Mouse
60002	ENSMUST00000029972	Smim8-201	protein_coding	Ribo-seq	strong	2	sORFs_org_Mouse
1177974	ENSMUST00000108131	Smim8-202	protein_coding	Ribo-seq	strong	2	sORFs_org_Mouse

160109

ATG...

MQL...

ATG

Alternative
InCDS
Overlapping
sORF

B_cell, Glioma, Liver,
MEF, MESC, R1E,
E14, 3T3, Brain

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
822311	ENSMUST00000108132	Smim8-203	protein_coding	Ribo-seq	weak	1	sORFs_org_Mouse
1085011	ENSMUST00000108134	Smim8-205	protein_coding	Ribo-seq	weak	1	Johnstone2016
1885071	ENSMUST00000108133	Smim8-204	protein_coding	Ribo-seq	weak	1	sORFs_org_Mouse
1177974	ENSMUST00000108131	Smim8-202	protein_coding	Ribo-seq	weak	2	sORFs_org_Mouse
60002	ENSMUST00000029972	Smim8-201	protein_coding	Ribo-seq	weak	2	sORFs_org_Mouse

1085010

TTG...

LWF...

TTG

Alternative
Overlapping
sORF
Upstream

v6-5

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
1085011	ENSMUST00000108134	Smim8-205	protein_coding	Ribo-seq		1	sORFs_org_Mouse

58490

135

TTG...

LVT...

TTG

ncRNA
sORF

Testis

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
58193	ENSMUST00000139922	Smim8-207	processed_transcript	Ribo-seq	strong	1	sORFs_org_Mouse

60001

CTG...

LAA...

CTG

Alternative
sORF
Upstream

Testis

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
60002	ENSMUST00000029972	Smim8-201	protein_coding	Ribo-seq		1	sORFs_org_Mouse

59169

114

GTG...

VAY...

GTG

ncRNA
sORF

Testis

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
58193	ENSMUST00000139922	Smim8-207	processed_transcript	Ribo-seq	moderate	1	sORFs_org_Mouse

60683

CTG...

LSL...

CTG

Alternative
sORF
Upstream

Testis

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
60002	ENSMUST00000029972	Smim8-201	protein_coding	Ribo-seq		1	sORFs_org_Mouse

899997

171

CTG...

LAA...

CTG

ncRNA
sORF
Alternative
Upstream

E14, BMDC

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
58193	ENSMUST00000139922	Smim8-207	processed_transcript	Ribo-seq		1	sORFs_org_Mouse
60002	ENSMUST00000029972	Smim8-201	protein_coding	Ribo-seq		1	sORFs_org_Mouse

899517

156

CTG...

LKL...

CTG

Alternative
sORF
Upstream
ncRNA

BMDC, E14

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
60002	ENSMUST00000029972	Smim8-201	protein_coding	Ribo-seq		1	sORFs_org_Mouse
58193	ENSMUST00000139922	Smim8-207	processed_transcript	Ribo-seq		1	sORFs_org_Mouse

60690

168

TTG...

LPK...

TTG

ncRNA
sORF

Brain, Liver, MEF,
Spleen_B_cell, Testis

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
58193	ENSMUST00000139922	Smim8-207	processed_transcript	Ribo-seq		5	sORFs_org_Mouse

61093

CTG...

LSL...

CTG

ncRNA
sORF

Testis

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
58193	ENSMUST00000139922	Smim8-207	processed_transcript	Ribo-seq		1	sORFs_org_Mouse

556394

TTG...

LPK...

TTG

Alternative
Overlapping
sORF
Upstream

3T3, E14, NSC,
v6-5

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
60002	ENSMUST00000029972	Smim8-201	protein_coding	Ribo-seq		4	sORFs_org_Mouse

58192

CTG...

LHA...

CTG

ncRNA
sORF

Testis

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
58193	ENSMUST00000139922	Smim8-207	processed_transcript	Ribo-seq	weak	1	sORFs_org_Mouse

147565

ATG...

MKP...

ATG

Alternative
InCDS
Overlapping
sORF

3T3, Brain

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
60002	ENSMUST00000029972	Smim8-201	protein_coding	Ribo-seq	weak	2	sORFs_org_Mouse

60184

CTG...

LKL...

CTG

ncRNA
sORF

E14, Liver, Testis,
v6-5

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
58193	ENSMUST00000139922	Smim8-207	processed_transcript	Ribo-seq		4	sORFs_org_Mouse

58630

150

GTG...

VMA...

GTG

ncRNA
sORF

Testis

MetamORF transcript ID The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.	Transcript ID The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).	Transcript name The transcript name (e.g. MDK-202).	Transcript biotype The biotype of the transcript (as defined by Ensembl).	Identification The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.	Kozak context The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.	Exp. count The number of original datasets that identifed the ORF on the transcript.	Data sources The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.
58193	ENSMUST00000139922	Smim8-207	processed_transcript	Ribo-seq	weak	1	sORFs_org_Mouse

Export data