Metam
ORF

Logo

Gene

Gene

ID CARD

ID

NCBI:66077

Aliases

0610033H09Rik
Aip, Akip, Aurkaip1
ENSMUSG00000065990, MGI:1913327, MRP-S38
NCBI:66077, OFF:Aurkaip1

Chromosome

4

Transcripts

37 ORFs
6 known transcripts
58 ORF to known transcript associations

MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

ORFs

Display all the transcripts related to the entry.

56078 ENSMUST00000084097 Aurkaip1-201 protein_coding
MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF
length

The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Start codon

The start codon sequence of the ORF.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

ORF annotations

A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of the cell types in which the ORF has already been identified on the transcript.

56401 90

TTG ...

LSA ...

Ribo-seq
TTG

2 sORFs_org_Mouse
    Alternative
    Overlapping
    sORF
    Upstream
Brain, Spleen_B_cell
513249 129

ATG ...

MLF ...

Ribo-seq
ATG

moderate

2 sORFs_org_Mouse
    InCDS
    Overlapping
    sORF
B_cell, Spleen_B_cell
511983 141

ATG ...

MSG ...

Ribo-seq
ATG

1 sORFs_org_Mouse
    Alternative
    sORF
    Upstream
Spleen_B_cell
1225532 39

AAG ...

KPL ...

Ribo-seq
AAG

moderate

1 sORFs_org_Mouse
    InCDS
    Overlapping
    sORF
E14
56077 33

ATG ...

MPL ...

Ribo-seq
ATG

moderate

1 sORFs_org_Mouse
    InCDS
    Overlapping
    sORF
Testis
60814 48

ATG ...

MMS ...

Ribo-seq
ATG

6 sORFs_org_Mouse
    Alternative
    sORF
    Upstream
BMDC, Brain, Liver,
Spleen_B_cell, Testis
60998 45

ATG ...

MSG ...

Ribo-seq
ATG

7 sORFs_org_Mouse
    Alternative
    sORF
    Upstream
BMDC, Brain, E14,
Liver, Spleen_B_cell, Testis
510340 90

CTG ...

LAD ...

Ribo-seq
CTG

strong

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
Spleen_B_cell
509199 81

CTG ...

LPP ...

Ribo-seq
CTG

moderate

2 sORFs_org_Mouse
    InCDS
    Overlapping
    sORF
B_cell, Spleen_B_cell
510068 144

ATG ...

MMS ...

Ribo-seq
ATG

1 sORFs_org_Mouse
    Alternative
    sORF
    Upstream
Spleen_B_cell
823230 69

CTG ...

LVR ...

Ribo-seq
CTG

moderate

1 sORFs_org_Mouse
    InCDS
    Overlapping
    sORF
B_cell
825497 99

CTG ...

LQA ...

Ribo-seq
CTG

moderate

1 sORFs_org_Mouse
    InCDS
    Overlapping
    sORF
B_cell
513766 138

TTG ...

LAA ...

Ribo-seq
TTG

moderate

2 sORFs_org_Mouse
    InCDS
    Overlapping
    sORF
B_cell, Spleen_B_cell
57072 96

CTG ...

LPW ...

Ribo-seq
CTG

weak

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
Brain
544515 ENSMUST00000105592 Aurkaip1-203 protein_coding
MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF
length

The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Start codon

The start codon sequence of the ORF.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

ORF annotations

A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of the cell types in which the ORF has already been identified on the transcript.

513249 129

ATG ...

MLF ...

Ribo-seq
ATG

moderate

1 sORFs_org_Mouse
    InCDS
    Overlapping
    sORF
Liver
1651987 33

CTG ...

LRT ...

Ribo-seq
CTG

weak

1 sORFs_org_Mouse
    InCDS
    Overlapping
    sORF
MEF
57072 96

CTG ...

LPW ...

Ribo-seq
CTG

weak

2 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
Liver, MEF
1378457 45

CTG ...

LLN ...

Ribo-seq
CTG

weak

1 sORFs_org_Mouse
    InCDS
    Overlapping
    sORF
E14
510340 90

CTG ...

LAD ...

Ribo-seq
CTG

strong

2 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
E14, NSC
1178413 120

GTG ...

VTS ...

Ribo-seq
GTG

2 sORFs_org_Mouse
    Alternative
    Overlapping
    sORF
    Upstream
Liver, MEF
544514 78

CTG ...

LGC ...

Ribo-seq
CTG

strong

2 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
3T3, MEF
56401 90

TTG ...

LSA ...

Ribo-seq
TTG

6 sORFs_org_Mouse
    Alternative
    Overlapping
    sORF
    Upstream
BMDC, E14, Liver,
MEF
165832 ENSMUST00000139651 Aurkaip1-205 protein_coding
MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF
length

The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Start codon

The start codon sequence of the ORF.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

ORF annotations

A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of the cell types in which the ORF has already been identified on the transcript.

513249 129

ATG ...

MLF ...

Ribo-seq
ATG

moderate

1 sORFs_org_Mouse
    sORF
Brain
825497 99

CTG ...

LQA ...

Ribo-seq
CTG

moderate

1 sORFs_org_Mouse
    sORF
Brain
513766 138

TTG ...

LAA ...

Ribo-seq
TTG

moderate

1 sORFs_org_Mouse
    sORF
Brain
56401 90

TTG ...

LSA ...

Ribo-seq
TTG

1 sORFs_org_Mouse
    sORF
MEF
1430966 279

TTG ...

LGE ...

Ribo-seq
TTG

1 sORFs_org_Mouse
    sORF
Liver
1230172 138

AAG ...

KIR ...

Ribo-seq
AAG

1 sORFs_org_Mouse
    sORF
E14
1554953 45

TTG ...

LRS ...

Ribo-seq
TTG

moderate

1 sORFs_org_Mouse
    sORF
Brain
509199 81

CTG ...

LPP ...

Ribo-seq
CTG

moderate

1 sORFs_org_Mouse
    sORF
Brain
1731601 57

GTG ...

VQK ...

Ribo-seq
GTG

moderate

1 sORFs_org_Mouse
    sORF
MEF
1657950 57

TTG ...

LCF ...

Ribo-seq
TTG

2 sORFs_org_Mouse
    sORF
MEF
823230 69

CTG ...

LVR ...

Ribo-seq
CTG

moderate

1 sORFs_org_Mouse
    sORF
Brain
544514 78

CTG ...

LGC ...

Ribo-seq
CTG

strong

1 sORFs_org_Mouse
    sORF
MEF
165831 228

GTG ...

VHA ...

Ribo-seq
GTG

5 sORFs_org_Mouse
    sORF
Brain, E14, Liver,
MEF
57072 96

CTG ...

LPW ...

Ribo-seq
CTG

weak

1 sORFs_org_Mouse
    sORF
Brain
1017477 ENSMUST00000129711 Aurkaip1-204 processed_transcript
MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF
length

The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Start codon

The start codon sequence of the ORF.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

ORF annotations

A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of the cell types in which the ORF has already been identified on the transcript.

1017476 57

ATG ...

MMS ...

Ribo-seq
ATG

1 sORFs_org_Mouse
    ncRNA
    sORF
Brain
1178413 120

GTG ...

VTS ...

Ribo-seq
GTG

3 sORFs_org_Mouse
    ncRNA
    sORF
E14, R1E, v6-5
1082184 204

ATG ...

MSP ...

Ribo-seq
ATG

2 sORFs_org_Mouse
    ncRNA
    sORF
R1E, v6-5
56401 90

TTG ...

LSA ...

Ribo-seq
TTG

2 sORFs_org_Mouse
    ncRNA
    sORF
R1E, v6-5
1083127 168

ATG ...

MGY ...

Ribo-seq
ATG

2 sORFs_org_Mouse
    ncRNA
    sORF
R1E, v6-5
1081657 210

TTG ...

LKM ...

Ribo-seq
TTG

2 sORFs_org_Mouse
    ncRNA
    sORF
R1E, v6-5
1082839 228

TTG ...

LRS ...

Ribo-seq
TTG

2 sORFs_org_Mouse
    ncRNA
    sORF
R1E, v6-5
57072 96

CTG ...

LPW ...

Ribo-seq
CTG

weak

2 sORFs_org_Mouse
    ncRNA
    sORF
E14, R1E
1081420 171

GTG ...

VMG ...

Ribo-seq
GTG

2 sORFs_org_Mouse
    ncRNA
    sORF
R1E, v6-5
58119 126

CTG ...

LCC ...

Ribo-seq
CTG

2 sORFs_org_Mouse
    ncRNA
    sORF
E14, R1E
56402 ENSMUST00000151222 Aurkaip1-206 retained_intron
MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF
length

The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Start codon

The start codon sequence of the ORF.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

ORF annotations

A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of the cell types in which the ORF has already been identified on the transcript.

56401 90

TTG ...

LSA ...

Ribo-seq
TTG

moderate

1 sORFs_org_Mouse
    Intronic
    sORF
Testis
57072 96

CTG ...

LPW ...

Ribo-seq
CTG

weak

1 sORFs_org_Mouse
    Intronic
    sORF
Testis
58119 126

CTG ...

LCC ...

Ribo-seq
CTG

weak

1 sORFs_org_Mouse
    Intronic
    sORF
Testis
159159 ENSMUST00000105591 Aurkaip1-202 protein_coding
MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF
length

The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Start codon

The start codon sequence of the ORF.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

ORF annotations

A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of the cell types in which the ORF has already been identified on the transcript.

166557 267

ATG ...

MSG ...

Ribo-seq
ATG

3 sORFs_org_Mouse
    Alternative
    Overlapping
    sORF
    Upstream
Brain, E14, MEF
1017735 228

GTG ...

VRA ...

Ribo-seq
GTG

3 sORFs_org_Mouse
    Alternative
    Overlapping
    sORF
    Upstream
Brain, E14, Liver
1017949 204

CTG ...

LRS ...

Ribo-seq
CTG

3 sORFs_org_Mouse
    Alternative
    Overlapping
    sORF
    Upstream
Brain, E14, Liver
56077 33

ATG ...

MPL ...

Ribo-seq
ATG

moderate

1 Johnstone2016
    InCDS
    Overlapping
    sORF
Glioma, Liver, MEF,
MESC
56401 90

TTG ...

LSA ...

Ribo-seq
TTG

1 sORFs_org_Mouse
    Alternative
    Overlapping
    sORF
    Upstream
Brain
1017388 231

TTG ...

LVR ...

Ribo-seq
TTG

3 sORFs_org_Mouse
    Alternative
    Overlapping
    sORF
    Upstream
Brain, E14, Liver
513249 129

ATG ...

MLF ...

Ribo-seq
ATG

moderate

2 sORFs_org_Mouse
Johnstone2016
    InCDS
    Overlapping
    sORF
Glioma, Liver, MEF,
MESC
166559 270

ATG ...

MMS ...

Ribo-seq
ATG

4 sORFs_org_Mouse
Johnstone2016
    Alternative
    Overlapping
    sORF
    Upstream
Brain, E14, Glioma,
Liver, MEF, MESC
161658 246

TTG ...

LAC ...

Ribo-seq
TTG

5 sORFs_org_Mouse
    Alternative
    Overlapping
    sORF
    Upstream
Brain, E14, Liver,
MEF
3180326 UNKNOWN_TRANSCRIPT
MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF
length

The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Start codon

The start codon sequence of the ORF.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

ORF annotations

A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of the cell types in which the ORF has already been identified on the transcript.

166559 270

ATG ...

MMS ...

Predicted
ATG

2 Samandi2017
    sORF
513249 129

ATG ...

MLF ...

Predicted
ATG

1 Samandi2017
    sORF

ORFs

37 ORFs
6 known transcripts
58 ORF to known transcript associations

MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF Length

The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Start codon

The start codon sequence of the ORF.

ORF annotations

A comma-separated list of all the annotations computed by our algorithm for the ORF. This list includes the annotations computed for the ORF for all transcripts. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of all the cell types in which the ORF has already been identified.

Transcripts

Display all the transcripts related to the entry.

56401 90

TTG...

LSA...

TTG Alternative
Overlapping
sORF
Upstream
Intronic
ncRNA
Brain, Spleen_B_cell, Testis,
MEF, R1E, v6-5,
BMDC, E14, Liver
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

56078 ENSMUST00000084097 Aurkaip1-201 protein_coding Ribo-seq
2 sORFs_org_Mouse
56402 ENSMUST00000151222 Aurkaip1-206 retained_intron Ribo-seq
moderate 1 sORFs_org_Mouse
165832 ENSMUST00000139651 Aurkaip1-205 protein_coding Ribo-seq
1 sORFs_org_Mouse
1017477 ENSMUST00000129711 Aurkaip1-204 processed_transcript Ribo-seq
2 sORFs_org_Mouse
159159 ENSMUST00000105591 Aurkaip1-202 protein_coding Ribo-seq
1 sORFs_org_Mouse
544515 ENSMUST00000105592 Aurkaip1-203 protein_coding Ribo-seq
6 sORFs_org_Mouse
513249 129

ATG...

MLF...

ATG InCDS
Overlapping
sORF
Liver, Brain, B_cell,
Spleen_B_cell, Glioma,
MEF, MESC
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

544515 ENSMUST00000105592 Aurkaip1-203 protein_coding Ribo-seq
moderate 1 sORFs_org_Mouse
165832 ENSMUST00000139651 Aurkaip1-205 protein_coding Ribo-seq
moderate 1 sORFs_org_Mouse
56078 ENSMUST00000084097 Aurkaip1-201 protein_coding Ribo-seq
moderate 2 sORFs_org_Mouse
3180326 UNKNOWN_TRANSCRIPT Predicted
1 Samandi2017
159159 ENSMUST00000105591 Aurkaip1-202 protein_coding Ribo-seq
moderate 2 sORFs_org_Mouse
Johnstone2016
1651987 33

CTG...

LRT...

CTG InCDS
Overlapping
sORF
MEF
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

544515 ENSMUST00000105592 Aurkaip1-203 protein_coding Ribo-seq
weak 1 sORFs_org_Mouse
1017476 57

ATG...

MMS...

ATG ncRNA
sORF
Brain
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

1017477 ENSMUST00000129711 Aurkaip1-204 processed_transcript Ribo-seq
1 sORFs_org_Mouse
825497 99

CTG...

LQA...

CTG sORF
InCDS
Overlapping
Brain, B_cell
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

165832 ENSMUST00000139651 Aurkaip1-205 protein_coding Ribo-seq
moderate 1 sORFs_org_Mouse
56078 ENSMUST00000084097 Aurkaip1-201 protein_coding Ribo-seq
moderate 1 sORFs_org_Mouse
511983 141

ATG...

MSG...

ATG Alternative
sORF
Upstream
Spleen_B_cell
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

56078 ENSMUST00000084097 Aurkaip1-201 protein_coding Ribo-seq
1 sORFs_org_Mouse
1178413 120

GTG...

VTS...

GTG ncRNA
sORF
Alternative
Overlapping
Upstream
E14, R1E, v6-5,
Liver, MEF
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

1017477 ENSMUST00000129711 Aurkaip1-204 processed_transcript Ribo-seq
3 sORFs_org_Mouse
544515 ENSMUST00000105592 Aurkaip1-203 protein_coding Ribo-seq
2 sORFs_org_Mouse
1225532 39

AAG...

KPL...

AAG InCDS
Overlapping
sORF
E14
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

56078 ENSMUST00000084097 Aurkaip1-201 protein_coding Ribo-seq
moderate 1 sORFs_org_Mouse
57072 96

CTG...

LPW...

CTG Alternative
InCDS
Overlapping
sORF
Intronic
ncRNA
Liver, MEF, Testis,
E14, R1E, Brain
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

544515 ENSMUST00000105592 Aurkaip1-203 protein_coding Ribo-seq
weak 2 sORFs_org_Mouse
56402 ENSMUST00000151222 Aurkaip1-206 retained_intron Ribo-seq
weak 1 sORFs_org_Mouse
1017477 ENSMUST00000129711 Aurkaip1-204 processed_transcript Ribo-seq
weak 2 sORFs_org_Mouse
56078 ENSMUST00000084097 Aurkaip1-201 protein_coding Ribo-seq
weak 1 sORFs_org_Mouse
165832 ENSMUST00000139651 Aurkaip1-205 protein_coding Ribo-seq
weak 1 sORFs_org_Mouse
56077 33

ATG...

MPL...

ATG InCDS
Overlapping
sORF
Testis, Glioma, Liver,
MEF, MESC
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

56078 ENSMUST00000084097 Aurkaip1-201 protein_coding Ribo-seq
moderate 1 sORFs_org_Mouse
159159 ENSMUST00000105591 Aurkaip1-202 protein_coding Ribo-seq
moderate 1 Johnstone2016
166557 267

ATG...

MSG...

ATG Alternative
Overlapping
sORF
Upstream
Brain, E14, MEF
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

159159 ENSMUST00000105591 Aurkaip1-202 protein_coding Ribo-seq
3 sORFs_org_Mouse
1082184 204

ATG...

MSP...

ATG ncRNA
sORF
R1E, v6-5
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

1017477 ENSMUST00000129711 Aurkaip1-204 processed_transcript Ribo-seq
2 sORFs_org_Mouse
513766 138

TTG...

LAA...

TTG sORF
InCDS
Overlapping
Brain, B_cell, Spleen_B_cell
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

165832 ENSMUST00000139651 Aurkaip1-205 protein_coding Ribo-seq
moderate 1 sORFs_org_Mouse
56078 ENSMUST00000084097 Aurkaip1-201 protein_coding Ribo-seq
moderate 2 sORFs_org_Mouse
60814 48

ATG...

MMS...

ATG Alternative
sORF
Upstream
BMDC, Brain, Liver,
Spleen_B_cell, Testis
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

56078 ENSMUST00000084097 Aurkaip1-201 protein_coding Ribo-seq
6 sORFs_org_Mouse
1378457 45

CTG...

LLN...

CTG InCDS
Overlapping
sORF
E14
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

544515 ENSMUST00000105592 Aurkaip1-203 protein_coding Ribo-seq
weak 1 sORFs_org_Mouse
1017735 228

GTG...

VRA...

GTG Alternative
Overlapping
sORF
Upstream
Brain, E14, Liver
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

159159 ENSMUST00000105591 Aurkaip1-202 protein_coding Ribo-seq
3 sORFs_org_Mouse
1083127 168

ATG...

MGY...

ATG ncRNA
sORF
R1E, v6-5
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

1017477 ENSMUST00000129711 Aurkaip1-204 processed_transcript Ribo-seq
2 sORFs_org_Mouse
510340 90

CTG...

LAD...

CTG Alternative
InCDS
Overlapping
sORF
E14, NSC, Spleen_B_cell
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

544515 ENSMUST00000105592 Aurkaip1-203 protein_coding Ribo-seq
strong 2 sORFs_org_Mouse
56078 ENSMUST00000084097 Aurkaip1-201 protein_coding Ribo-seq
strong 1 sORFs_org_Mouse
1430966 279

TTG...

LGE...

TTG sORF
Liver
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

165832 ENSMUST00000139651 Aurkaip1-205 protein_coding Ribo-seq
1 sORFs_org_Mouse
166559 270

ATG...

MMS...

ATG sORF
Alternative
Overlapping
Upstream
Brain, E14,
Glioma, Liver, MEF,
MESC
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

3180326 UNKNOWN_TRANSCRIPT Predicted
2 Samandi2017
159159 ENSMUST00000105591 Aurkaip1-202 protein_coding Ribo-seq
4 sORFs_org_Mouse
Johnstone2016
60998 45

ATG...

MSG...

ATG Alternative
sORF
Upstream
BMDC, Brain, E14,
Liver, Spleen_B_cell, Testis
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

56078 ENSMUST00000084097 Aurkaip1-201 protein_coding Ribo-seq
7 sORFs_org_Mouse
1017949 204

CTG...

LRS...

CTG Alternative
Overlapping
sORF
Upstream
Brain, E14, Liver
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

159159 ENSMUST00000105591 Aurkaip1-202 protein_coding Ribo-seq
3 sORFs_org_Mouse
58119 126

CTG...

LCC...

CTG Intronic
sORF
ncRNA
Testis, E14, R1E
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

56402 ENSMUST00000151222 Aurkaip1-206 retained_intron Ribo-seq
weak 1 sORFs_org_Mouse
1017477 ENSMUST00000129711 Aurkaip1-204 processed_transcript Ribo-seq
2 sORFs_org_Mouse
1230172 138

AAG...

KIR...

AAG sORF
E14
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

165832 ENSMUST00000139651 Aurkaip1-205 protein_coding Ribo-seq
1 sORFs_org_Mouse
1554953 45

TTG...

LRS...

TTG sORF
Brain
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

165832 ENSMUST00000139651 Aurkaip1-205 protein_coding Ribo-seq
moderate 1 sORFs_org_Mouse
1081657 210

TTG...

LKM...

TTG ncRNA
sORF
R1E, v6-5
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

1017477 ENSMUST00000129711 Aurkaip1-204 processed_transcript Ribo-seq
2 sORFs_org_Mouse
509199 81

CTG...

LPP...

CTG InCDS
Overlapping
sORF
B_cell, Spleen_B_cell, Brain
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

56078 ENSMUST00000084097 Aurkaip1-201 protein_coding Ribo-seq
moderate 2 sORFs_org_Mouse
165832 ENSMUST00000139651 Aurkaip1-205 protein_coding Ribo-seq
moderate 1 sORFs_org_Mouse
510068 144

ATG...

MMS...

ATG Alternative
sORF
Upstream
Spleen_B_cell
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

56078 ENSMUST00000084097 Aurkaip1-201 protein_coding Ribo-seq
1 sORFs_org_Mouse
544514 78

CTG...

LGC...

CTG Alternative
InCDS
Overlapping
sORF
3T3, MEF
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

544515 ENSMUST00000105592 Aurkaip1-203 protein_coding Ribo-seq
strong 2 sORFs_org_Mouse
165832 ENSMUST00000139651 Aurkaip1-205 protein_coding Ribo-seq
strong 1 sORFs_org_Mouse
823230 69

CTG...

LVR...

CTG InCDS
Overlapping
sORF
B_cell, Brain
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

56078 ENSMUST00000084097 Aurkaip1-201 protein_coding Ribo-seq
moderate 1 sORFs_org_Mouse
165832 ENSMUST00000139651 Aurkaip1-205 protein_coding Ribo-seq
moderate 1 sORFs_org_Mouse
1082839 228

TTG...

LRS...

TTG ncRNA
sORF
R1E, v6-5
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

1017477 ENSMUST00000129711 Aurkaip1-204 processed_transcript Ribo-seq
2 sORFs_org_Mouse
1017388 231

TTG...

LVR...

TTG Alternative
Overlapping
sORF
Upstream
Brain, E14, Liver
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

159159 ENSMUST00000105591 Aurkaip1-202 protein_coding Ribo-seq
3 sORFs_org_Mouse
1731601 57

GTG...

VQK...

GTG sORF
MEF
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

165832 ENSMUST00000139651 Aurkaip1-205 protein_coding Ribo-seq
moderate 1 sORFs_org_Mouse
1657950 57

TTG...

LCF...

TTG sORF
MEF
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

165832 ENSMUST00000139651 Aurkaip1-205 protein_coding Ribo-seq
2 sORFs_org_Mouse
165831 228

GTG...

VHA...

GTG sORF
Brain, E14, Liver,
MEF
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

165832 ENSMUST00000139651 Aurkaip1-205 protein_coding Ribo-seq
5 sORFs_org_Mouse
1081420 171

GTG...

VMG...

GTG ncRNA
sORF
R1E, v6-5
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

1017477 ENSMUST00000129711 Aurkaip1-204 processed_transcript Ribo-seq
2 sORFs_org_Mouse
161658 246

TTG...

LAC...

TTG Alternative
Overlapping
sORF
Upstream
Brain, E14, Liver,
MEF
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

159159 ENSMUST00000105591 Aurkaip1-202 protein_coding Ribo-seq
5 sORFs_org_Mouse

Export data

Identification method

Predicted

Ribo-seq

MS

Start Codon

Kozak context

moderate

weak

strong

optimal

ORF length

Transcript biotype

antisense

antisense_RNA

bidirectional_promoter_lncRNA

IG_C_gene

lincRNA

non_stop_decay

nonsense_mediated_decay

polymorphic_pseudogene

processed_pseudogene

processed_transcript

protein_coding

pseudogene

retained_intron

sense_intronic

sense_overlapping

TEC

transcribed_processed_pseudogene

transcribed_unitary_pseudogene

transcribed_unprocessed_pseudogene

unprocessed_pseudogene

ORF Annotations

Reading frame

Alternative

Relative position

CDS

Downstream

InCDS

Intronic

NewCDS

Overlapping

Upstream

Biotype

Intergenic

ncRNA

NMD

NSD

Pseudogene

Length

sORF

Cell types

3T3

B_cell

BMDC

Brain

C2C12

E14

Glioma

Liver

MEF

MESC

Neutrophil

NSC

R1E

Skin_tumor

Spleen_B_cell

Testis

v6-5