Metam
ORF

Logo

Gene

Gene

ID CARD

ID

NCBI:66291

Aliases

1810030N24Rik
2810406B13Rik, ENSMUSG00000028295, MGI:1913541
NCBI:66291, OFF:Smim8, Smim8

Chromosome

4

Transcripts

21 ORFs
6 known transcripts
37 ORF to known transcript associations

MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

ORFs

Display all the transcripts related to the entry.

822311 ENSMUST00000108132 Smim8-203 protein_coding
MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF
length

The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Start codon

The start codon sequence of the ORF.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

ORF annotations

A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of the cell types in which the ORF has already been identified on the transcript.

151903 90

GTG ...

VWL ...

Ribo-seq
GTG

weak

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
Liver
149775 78

TTG ...

LVI ...

Ribo-seq
TTG

strong

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
Liver
160109 66

ATG ...

MQL ...

Ribo-seq
ATG

weak

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
B_cell
159226 60

TTG ...

LCG ...

Ribo-seq
TTG

moderate

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
B_cell
60002 ENSMUST00000029972 Smim8-201 protein_coding
MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF
length

The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Start codon

The start codon sequence of the ORF.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

ORF annotations

A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of the cell types in which the ORF has already been identified on the transcript.

149060 33

CTG ...

LIL ...

Ribo-seq
CTG

weak

1 sORFs_org_Mouse
    InCDS
    Overlapping
    sORF
Brain
60992 36

CTG ...

LKL ...

Ribo-seq
CTG

1 sORFs_org_Mouse
    Alternative
    sORF
    Upstream
Testis
60001 51

CTG ...

LAA ...

Ribo-seq
CTG

1 sORFs_org_Mouse
    Alternative
    sORF
    Upstream
Testis
60683 57

CTG ...

LSL ...

Ribo-seq
CTG

1 sORFs_org_Mouse
    Alternative
    sORF
    Upstream
Testis
899517 156

CTG ...

LKL ...

Ribo-seq
CTG

1 sORFs_org_Mouse
    Alternative
    sORF
    Upstream
BMDC
149775 78

TTG ...

LVI ...

Ribo-seq
TTG

strong

2 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
3T3, Brain
556394 48

TTG ...

LPK ...

Ribo-seq
TTG

4 sORFs_org_Mouse
    Alternative
    Overlapping
    sORF
    Upstream
3T3, E14, NSC,
v6-5
899997 171

CTG ...

LAA ...

Ribo-seq
CTG

1 sORFs_org_Mouse
    Alternative
    sORF
    Upstream
BMDC
151903 90

GTG ...

VWL ...

Ribo-seq
GTG

weak

2 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
3T3, Brain
147565 36

ATG ...

MKP ...

Ribo-seq
ATG

weak

2 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
3T3, Brain
160109 66

ATG ...

MQL ...

Ribo-seq
ATG

weak

2 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
3T3, Brain
159226 60

TTG ...

LCG ...

Ribo-seq
TTG

moderate

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
Brain
1177974 ENSMUST00000108131 Smim8-202 protein_coding
MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF
length

The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Start codon

The start codon sequence of the ORF.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

ORF annotations

A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of the cell types in which the ORF has already been identified on the transcript.

159226 60

TTG ...

LCG ...

Ribo-seq
TTG

moderate

3 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
E14, MEF, v6-5
149060 33

CTG ...

LIL ...

Ribo-seq
CTG

weak

1 sORFs_org_Mouse
    InCDS
    Overlapping
    sORF
MEF
160109 66

ATG ...

MQL ...

Ribo-seq
ATG

weak

2 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
E14, MEF
151903 90

GTG ...

VWL ...

Ribo-seq
GTG

weak

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
MEF
149775 78

TTG ...

LVI ...

Ribo-seq
TTG

strong

2 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
E14, MEF
58193 ENSMUST00000139922 Smim8-207 processed_transcript
MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF
length

The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Start codon

The start codon sequence of the ORF.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

ORF annotations

A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of the cell types in which the ORF has already been identified on the transcript.

61230 60

CTG ...

LAA ...

Ribo-seq
CTG

3 sORFs_org_Mouse
    ncRNA
    sORF
E14, Testis, v6-5
58490 135

TTG ...

LVT ...

Ribo-seq
TTG

strong

1 sORFs_org_Mouse
    ncRNA
    sORF
Testis
59169 114

GTG ...

VAY ...

Ribo-seq
GTG

moderate

1 sORFs_org_Mouse
    ncRNA
    sORF
Testis
899997 171

CTG ...

LAA ...

Ribo-seq
CTG

1 sORFs_org_Mouse
    ncRNA
    sORF
E14
60690 168

TTG ...

LPK ...

Ribo-seq
TTG

5 sORFs_org_Mouse
    ncRNA
    sORF
Brain, Liver, MEF,
Spleen_B_cell, Testis
61093 66

CTG ...

LSL ...

Ribo-seq
CTG

1 sORFs_org_Mouse
    ncRNA
    sORF
Testis
58192 96

CTG ...

LHA ...

Ribo-seq
CTG

weak

1 sORFs_org_Mouse
    ncRNA
    sORF
Testis
60184 45

CTG ...

LKL ...

Ribo-seq
CTG

4 sORFs_org_Mouse
    ncRNA
    sORF
E14, Liver, Testis,
v6-5
58630 150

GTG ...

VMA ...

Ribo-seq
GTG

weak

1 sORFs_org_Mouse
    ncRNA
    sORF
Testis
899517 156

CTG ...

LKL ...

Ribo-seq
CTG

1 sORFs_org_Mouse
    ncRNA
    sORF
E14
1085011 ENSMUST00000108134 Smim8-205 protein_coding
MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF
length

The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Start codon

The start codon sequence of the ORF.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

ORF annotations

A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of the cell types in which the ORF has already been identified on the transcript.

1085010 63

TTG ...

LWF ...

Ribo-seq
TTG

1 sORFs_org_Mouse
    Alternative
    Overlapping
    sORF
    Upstream
v6-5
160109 66

ATG ...

MQL ...

Ribo-seq
ATG

weak

1 Johnstone2016
    Alternative
    InCDS
    Overlapping
    sORF
Glioma, Liver, MEF,
MESC
1885071 ENSMUST00000108133 Smim8-204 protein_coding
MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF
length

The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Start codon

The start codon sequence of the ORF.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

ORF annotations

A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of the cell types in which the ORF has already been identified on the transcript.

159226 60

TTG ...

LCG ...

Ribo-seq
TTG

moderate

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
R1E
149775 78

TTG ...

LVI ...

Ribo-seq
TTG

strong

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
R1E
149060 33

CTG ...

LIL ...

Ribo-seq
CTG

weak

1 sORFs_org_Mouse
    InCDS
    Overlapping
    sORF
R1E
160109 66

ATG ...

MQL ...

Ribo-seq
ATG

weak

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
R1E

ORFs

21 ORFs
6 known transcripts
37 ORF to known transcript associations

MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF Length

The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Start codon

The start codon sequence of the ORF.

ORF annotations

A comma-separated list of all the annotations computed by our algorithm for the ORF. This list includes the annotations computed for the ORF for all transcripts. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of all the cell types in which the ORF has already been identified.

Transcripts

Display all the transcripts related to the entry.

151903 90

GTG...

VWL...

GTG Alternative
InCDS
Overlapping
sORF
Liver, MEF, 3T3,
Brain
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

822311 ENSMUST00000108132 Smim8-203 protein_coding Ribo-seq
weak 1 sORFs_org_Mouse
1177974 ENSMUST00000108131 Smim8-202 protein_coding Ribo-seq
weak 1 sORFs_org_Mouse
60002 ENSMUST00000029972 Smim8-201 protein_coding Ribo-seq
weak 2 sORFs_org_Mouse
149060 33

CTG...

LIL...

CTG InCDS
Overlapping
sORF
Brain, R1E, MEF
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

60002 ENSMUST00000029972 Smim8-201 protein_coding Ribo-seq
weak 1 sORFs_org_Mouse
1885071 ENSMUST00000108133 Smim8-204 protein_coding Ribo-seq
weak 1 sORFs_org_Mouse
1177974 ENSMUST00000108131 Smim8-202 protein_coding Ribo-seq
weak 1 sORFs_org_Mouse
159226 60

TTG...

LCG...

TTG Alternative
InCDS
Overlapping
sORF
E14, MEF, v6-5,
R1E, B_cell, Brain
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

1177974 ENSMUST00000108131 Smim8-202 protein_coding Ribo-seq
moderate 3 sORFs_org_Mouse
1885071 ENSMUST00000108133 Smim8-204 protein_coding Ribo-seq
moderate 1 sORFs_org_Mouse
822311 ENSMUST00000108132 Smim8-203 protein_coding Ribo-seq
moderate 1 sORFs_org_Mouse
60002 ENSMUST00000029972 Smim8-201 protein_coding Ribo-seq
moderate 1 sORFs_org_Mouse
60992 36

CTG...

LKL...

CTG Alternative
sORF
Upstream
Testis
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

60002 ENSMUST00000029972 Smim8-201 protein_coding Ribo-seq
1 sORFs_org_Mouse
61230 60

CTG...

LAA...

CTG ncRNA
sORF
E14, Testis, v6-5
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

58193 ENSMUST00000139922 Smim8-207 processed_transcript Ribo-seq
3 sORFs_org_Mouse
149775 78

TTG...

LVI...

TTG Alternative
InCDS
Overlapping
sORF
Liver, R1E, 3T3,
Brain, E14, MEF
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

822311 ENSMUST00000108132 Smim8-203 protein_coding Ribo-seq
strong 1 sORFs_org_Mouse
1885071 ENSMUST00000108133 Smim8-204 protein_coding Ribo-seq
strong 1 sORFs_org_Mouse
60002 ENSMUST00000029972 Smim8-201 protein_coding Ribo-seq
strong 2 sORFs_org_Mouse
1177974 ENSMUST00000108131 Smim8-202 protein_coding Ribo-seq
strong 2 sORFs_org_Mouse
160109 66

ATG...

MQL...

ATG Alternative
InCDS
Overlapping
sORF
B_cell, Glioma, Liver,
MEF, MESC, R1E,
E14, 3T3, Brain
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

822311 ENSMUST00000108132 Smim8-203 protein_coding Ribo-seq
weak 1 sORFs_org_Mouse
1085011 ENSMUST00000108134 Smim8-205 protein_coding Ribo-seq
weak 1 Johnstone2016
1885071 ENSMUST00000108133 Smim8-204 protein_coding Ribo-seq
weak 1 sORFs_org_Mouse
1177974 ENSMUST00000108131 Smim8-202 protein_coding Ribo-seq
weak 2 sORFs_org_Mouse
60002 ENSMUST00000029972 Smim8-201 protein_coding Ribo-seq
weak 2 sORFs_org_Mouse
1085010 63

TTG...

LWF...

TTG Alternative
Overlapping
sORF
Upstream
v6-5
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

1085011 ENSMUST00000108134 Smim8-205 protein_coding Ribo-seq
1 sORFs_org_Mouse
58490 135

TTG...

LVT...

TTG ncRNA
sORF
Testis
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

58193 ENSMUST00000139922 Smim8-207 processed_transcript Ribo-seq
strong 1 sORFs_org_Mouse
60001 51

CTG...

LAA...

CTG Alternative
sORF
Upstream
Testis
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

60002 ENSMUST00000029972 Smim8-201 protein_coding Ribo-seq
1 sORFs_org_Mouse
59169 114

GTG...

VAY...

GTG ncRNA
sORF
Testis
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

58193 ENSMUST00000139922 Smim8-207 processed_transcript Ribo-seq
moderate 1 sORFs_org_Mouse
60683 57

CTG...

LSL...

CTG Alternative
sORF
Upstream
Testis
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

60002 ENSMUST00000029972 Smim8-201 protein_coding Ribo-seq
1 sORFs_org_Mouse
899997 171

CTG...

LAA...

CTG ncRNA
sORF
Alternative
Upstream
E14, BMDC
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

58193 ENSMUST00000139922 Smim8-207 processed_transcript Ribo-seq
1 sORFs_org_Mouse
60002 ENSMUST00000029972 Smim8-201 protein_coding Ribo-seq
1 sORFs_org_Mouse
899517 156

CTG...

LKL...

CTG Alternative
sORF
Upstream
ncRNA
BMDC, E14
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

60002 ENSMUST00000029972 Smim8-201 protein_coding Ribo-seq
1 sORFs_org_Mouse
58193 ENSMUST00000139922 Smim8-207 processed_transcript Ribo-seq
1 sORFs_org_Mouse
60690 168

TTG...

LPK...

TTG ncRNA
sORF
Brain, Liver, MEF,
Spleen_B_cell, Testis
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

58193 ENSMUST00000139922 Smim8-207 processed_transcript Ribo-seq
5 sORFs_org_Mouse
61093 66

CTG...

LSL...

CTG ncRNA
sORF
Testis
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

58193 ENSMUST00000139922 Smim8-207 processed_transcript Ribo-seq
1 sORFs_org_Mouse
556394 48

TTG...

LPK...

TTG Alternative
Overlapping
sORF
Upstream
3T3, E14, NSC,
v6-5
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

60002 ENSMUST00000029972 Smim8-201 protein_coding Ribo-seq
4 sORFs_org_Mouse
58192 96

CTG...

LHA...

CTG ncRNA
sORF
Testis
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

58193 ENSMUST00000139922 Smim8-207 processed_transcript Ribo-seq
weak 1 sORFs_org_Mouse
147565 36

ATG...

MKP...

ATG Alternative
InCDS
Overlapping
sORF
3T3, Brain
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

60002 ENSMUST00000029972 Smim8-201 protein_coding Ribo-seq
weak 2 sORFs_org_Mouse
60184 45

CTG...

LKL...

CTG ncRNA
sORF
E14, Liver, Testis,
v6-5
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

58193 ENSMUST00000139922 Smim8-207 processed_transcript Ribo-seq
4 sORFs_org_Mouse
58630 150

GTG...

VMA...

GTG ncRNA
sORF
Testis
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

58193 ENSMUST00000139922 Smim8-207 processed_transcript Ribo-seq
weak 1 sORFs_org_Mouse

Export data

Identification method

Predicted

Ribo-seq

MS

Start Codon

Kozak context

moderate

weak

strong

optimal

ORF length

Transcript biotype

antisense

antisense_RNA

bidirectional_promoter_lncRNA

IG_C_gene

lincRNA

non_stop_decay

nonsense_mediated_decay

polymorphic_pseudogene

processed_pseudogene

processed_transcript

protein_coding

pseudogene

retained_intron

sense_intronic

sense_overlapping

TEC

transcribed_processed_pseudogene

transcribed_unitary_pseudogene

transcribed_unprocessed_pseudogene

unprocessed_pseudogene

ORF Annotations

Reading frame

Alternative

Relative position

CDS

Downstream

InCDS

Intronic

NewCDS

Overlapping

Upstream

Biotype

Intergenic

ncRNA

NMD

NSD

Pseudogene

Length

sORF

Cell types

3T3

B_cell

BMDC

Brain

C2C12

E14

Glioma

Liver

MEF

MESC

Neutrophil

NSC

R1E

Skin_tumor

Spleen_B_cell

Testis

v6-5