Metam
ORF

Logo

Gene

Gene

ID CARD

ID

NCBI:83602

Aliases

6330549H03Rik
AA536742, AA959775, AW060250
ENSMUSG00000020962, Gtf2a1, MGI:1933277
NCBI:83602, OFF:Gtf2a1, Tfiia1
TfIIAa/b

Chromosome

12

Transcripts

44 ORFs
3 known transcripts
61 ORF to known transcript associations

MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

ORFs

Display all the transcripts related to the entry.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding
MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF
length

The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Start codon

The start codon sequence of the ORF.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

ORF annotations

A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of the cell types in which the ORF has already been identified on the transcript.

938738 90

CTG ...

LEL ...

Ribo-seq
CTG

moderate

2 sORFs_org_Mouse
    Alternative
    sORF
    Upstream
BMDC, E14
323617 75

ATG ...

MTS ...

Ribo-seq
ATG

2 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
E14, MEF
939565 51

TTG ...

LCC ...

Ribo-seq
TTG

2 sORFs_org_Mouse
    Overlapping
    sORF
    Upstream
BMDC, E14
1299901 192

AGG ...

RKK ...

Ribo-seq
AGG

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
E14
25008 33

TTG ...

LFL ...

Ribo-seq
TTG

weak

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
Testis
940425 84

AAG ...

KMK ...

Ribo-seq
AAG

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
BMDC
323331 36

CTG ...

LRM ...

Ribo-seq
CTG

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
MEF
939851 39

GTG ...

VAN ...

Ribo-seq
GTG

strong

4 sORFs_org_Mouse
    Alternative
    sORF
    Upstream
BMDC, E14, MEF,
v6-5
322376 42

TTG ...

LTL ...

Ribo-seq
TTG

2 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
MEF, v6-5
780639 60

ATG ...

MLL ...

Ribo-seq
ATG

moderate

2 sORFs_org_Mouse
Johnstone2016
    InCDS
    Overlapping
    sORF
Glioma, Liver, MEF,
MESC
940670 222

AAG ...

KMK ...

Ribo-seq
AAG

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
BMDC
997355 30

ATG ...

MLS ...

Ribo-seq
ATG

3 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
Brain, MEF, v6-5
1300403 219

ATG ...

MKK ...

Ribo-seq
ATG

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
E14
1344401 72

TTG ...

LFY ...

Ribo-seq
TTG

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
E14
1297510 66

ATG ...

MMM ...

Ribo-seq
ATG

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
E14
1346339 39

CTG ...

LVE ...

Ribo-seq
CTG

2 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
E14, NSC
327868 66

GTG ...

VTR ...

Ribo-seq
GTG

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
MEF
1298070 156

ATT ...

IFS ...

Ribo-seq
ATT

weak

1 sORFs_org_Mouse
    Alternative
    sORF
    Upstream
E14
1297018 114

AGG ...

RFL ...

Ribo-seq
AGG

weak

1 sORFs_org_Mouse
    Alternative
    sORF
    Upstream
E14
24675 42

CTG ...

LLL ...

Ribo-seq
CTG

moderate

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
Testis
779752 66

TTG ...

LKM ...

Ribo-seq
TTG

weak

1 sORFs_org_Mouse
    InCDS
    Overlapping
    sORF
MEF
1299820 204

ATG ...

MMM ...

Ribo-seq
ATG

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
E14
939927 132

TTG ...

LAR ...

Ribo-seq
TTG

moderate

2 sORFs_org_Mouse
    Alternative
    sORF
    Upstream
BMDC, E14
939774 255

CTG ...

LRE ...

Ribo-seq
CTG

2 sORFs_org_Mouse
    Overlapping
    sORF
    Upstream
BMDC, E14
24290 39

CTG ...

LLQ ...

Ribo-seq
CTG

weak

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
Testis
1345398 36

CTG ...

LVE ...

Ribo-seq
CTG

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
E14
1346132 75

TTG ...

LFY ...

Ribo-seq
TTG

2 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
E14, NSC
938857 144

ATT ...

IFS ...

Ribo-seq
ATT

weak

1 sORFs_org_Mouse
    Alternative
    sORF
    Upstream
BMDC
940920 69

TTG ...

LCC ...

Ribo-seq
TTG

2 sORFs_org_Mouse
    Overlapping
    sORF
    Upstream
BMDC, E14
939020 63

GTG ...

VAR ...

Ribo-seq
GTG

strong

2 sORFs_org_Mouse
    Alternative
    sORF
    Upstream
BMDC, E14
25749 30

CTG ...

LLP ...

Ribo-seq
CTG

weak

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
Testis
780986 222

ATG ...

MDF ...

Ribo-seq
ATG

1 Johnstone2016
    Alternative
    InCDS
    Overlapping
    sORF
Glioma, Liver, MEF,
MESC
1297607 54

AGG ...

RKK ...

Ribo-seq
AGG

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
E14
940539 273

CTG ...

LRE ...

Ribo-seq
CTG

2 sORFs_org_Mouse
    Overlapping
    sORF
    Upstream
BMDC, E14
938807 84

ATT ...

ILE ...

Ribo-seq
ATT

moderate

2 sORFs_org_Mouse
    Alternative
    sORF
    Upstream
BMDC, E14
1346262 36

GTG ...

VES ...

Ribo-seq
GTG

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
E14
940547 180

TTG ...

LAG ...

Ribo-seq
TTG

2 sORFs_org_Mouse
    Overlapping
    sORF
    Upstream
BMDC, E14
1298279 81

ATG ...

MKK ...

Ribo-seq
ATG

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
E14
939337 48

TTG ...

LVS ...

Ribo-seq
TTG

strong

1 sORFs_org_Mouse
    Alternative
    sORF
    Upstream
BMDC
939239 162

TTG ...

LAG ...

Ribo-seq
TTG

2 sORFs_org_Mouse
    Overlapping
    sORF
    Upstream
BMDC, E14
1345746 33

GTG ...

VES ...

Ribo-seq
GTG

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
E14
1300143 123

ATG ...

MTS ...

Ribo-seq
ATG

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
E14
328248 51

ATG ...

MTY ...

Ribo-seq
ATG

weak

1 sORFs_org_Mouse
    InCDS
    Overlapping
    sORF
Spleen_B_cell
320893 ENSMUST00000063314 Gtf2a1-202 protein_coding
MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF
length

The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Start codon

The start codon sequence of the ORF.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

ORF annotations

A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of the cell types in which the ORF has already been identified on the transcript.

780639 60

ATG ...

MLL ...

Ribo-seq
ATG

moderate

3 sORFs_org_Mouse
    sORF
    Upstream
B_cell, BMDC, MEF
24290 39

CTG ...

LLQ ...

Ribo-seq
CTG

weak

4 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
3T3, Brain, R1E
322376 42

TTG ...

LTL ...

Ribo-seq
TTG

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
Brain
1346262 36

GTG ...

VES ...

Ribo-seq
GTG

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
Brain
780986 222

ATG ...

MDF ...

Ribo-seq
ATG

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
B_cell
327715 45

GTG ...

VLL ...

Ribo-seq
GTG

weak

4 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
3T3, Brain, R1E
25008 33

TTG ...

LFL ...

Ribo-seq
TTG

weak

2 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
Brain, R1E
323331 36

CTG ...

LRM ...

Ribo-seq
CTG

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
Brain
328248 51

ATG ...

MTY ...

Ribo-seq
ATG

weak

5 sORFs_org_Mouse
    sORF
    Upstream
B_cell, BMDC, Brain,
MEF
25749 30

CTG ...

LLP ...

Ribo-seq
CTG

weak

4 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
3T3, Brain, R1E
1345746 33

GTG ...

VES ...

Ribo-seq
GTG

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
Brain
24675 42

CTG ...

LLL ...

Ribo-seq
CTG

moderate

4 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
3T3, Brain, R1E
1346339 39

CTG ...

LVE ...

Ribo-seq
CTG

2 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
Brain, R1E
327868 66

GTG ...

VTR ...

Ribo-seq
GTG

2 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
Brain, MEF
323617 75

ATG ...

MTS ...

Ribo-seq
ATG

2 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
Brain, MEF
1345398 36

CTG ...

LVE ...

Ribo-seq
CTG

1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
Brain
779752 66

TTG ...

LKM ...

Ribo-seq
TTG

weak

3 sORFs_org_Mouse
    sORF
    Upstream
B_cell, BMDC, MEF
662756 ENSMUST00000163693 Gtf2a1-203 retained_intron
MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF
length

The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Start codon

The start codon sequence of the ORF.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

ORF annotations

A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of the cell types in which the ORF has already been identified on the transcript.

328248 51

ATG ...

MTY ...

Ribo-seq
ATG

weak

2 sORFs_org_Mouse
    Intronic
    sORF
3T3, R1E
3053164 UNKNOWN_TRANSCRIPT
MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF
length

The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Start codon

The start codon sequence of the ORF.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

ORF annotations

A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of the cell types in which the ORF has already been identified on the transcript.

780986 222

ATG ...

MDF ...

Predicted
ATG

1 Samandi2017
    sORF

ORFs

44 ORFs
3 known transcripts
61 ORF to known transcript associations

MetamORF
ORF ID

The MetamORF ID of the ORF. This is an unique ID referring to the ORF.

ORF Length

The genomic length of the ORF (in bp). This length is defined as the sum of the lengths of each exon constituting the ORF (thus excluding its eventual introns) and includes both the start and the stop codons.

Nucleic
sequence

The nucleic sequence of the ORF.

Amino acid
sequence

The amino acid sequence of the ORF.

Start codon

The start codon sequence of the ORF.

ORF annotations

A comma-separated list of all the annotations computed by our algorithm for the ORF. This list includes the annotations computed for the ORF for all transcripts. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of all the cell types in which the ORF has already been identified.

Transcripts

Display all the transcripts related to the entry.

938738 90

CTG...

LEL...

CTG Alternative
sORF
Upstream
BMDC, E14
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
moderate 2 sORFs_org_Mouse
323617 75

ATG...

MTS...

ATG Alternative
InCDS
Overlapping
sORF
E14, MEF, Brain
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
2 sORFs_org_Mouse
320893 ENSMUST00000063314 Gtf2a1-202 protein_coding Ribo-seq
2 sORFs_org_Mouse
939565 51

TTG...

LCC...

TTG Overlapping
sORF
Upstream
BMDC, E14
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
2 sORFs_org_Mouse
1299901 192

AGG...

RKK...

AGG Alternative
InCDS
Overlapping
sORF
E14
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
1 sORFs_org_Mouse
25008 33

TTG...

LFL...

TTG Alternative
InCDS
Overlapping
sORF
Testis, Brain, R1E
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
weak 1 sORFs_org_Mouse
320893 ENSMUST00000063314 Gtf2a1-202 protein_coding Ribo-seq
weak 2 sORFs_org_Mouse
780639 60

ATG...

MLL...

ATG sORF
Upstream
InCDS
Overlapping
B_cell, BMDC, MEF,
Glioma, Liver, MESC
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

320893 ENSMUST00000063314 Gtf2a1-202 protein_coding Ribo-seq
moderate 3 sORFs_org_Mouse
24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
moderate 2 sORFs_org_Mouse
Johnstone2016
24290 39

CTG...

LLQ...

CTG Alternative
InCDS
Overlapping
sORF
3T3, Brain, R1E,
Testis
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

320893 ENSMUST00000063314 Gtf2a1-202 protein_coding Ribo-seq
weak 4 sORFs_org_Mouse
24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
weak 1 sORFs_org_Mouse
940425 84

AAG...

KMK...

AAG Alternative
InCDS
Overlapping
sORF
BMDC
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
1 sORFs_org_Mouse
323331 36

CTG...

LRM...

CTG Alternative
InCDS
Overlapping
sORF
MEF, Brain
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
1 sORFs_org_Mouse
320893 ENSMUST00000063314 Gtf2a1-202 protein_coding Ribo-seq
1 sORFs_org_Mouse
322376 42

TTG...

LTL...

TTG Alternative
InCDS
Overlapping
sORF
Brain, MEF, v6-5
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

320893 ENSMUST00000063314 Gtf2a1-202 protein_coding Ribo-seq
1 sORFs_org_Mouse
24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
2 sORFs_org_Mouse
1346262 36

GTG...

VES...

GTG Alternative
InCDS
Overlapping
sORF
Brain, E14
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

320893 ENSMUST00000063314 Gtf2a1-202 protein_coding Ribo-seq
1 sORFs_org_Mouse
24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
1 sORFs_org_Mouse
780986 222

ATG...

MDF...

ATG Alternative
InCDS
Overlapping
sORF
B_cell, Glioma,
Liver, MEF, MESC
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

320893 ENSMUST00000063314 Gtf2a1-202 protein_coding Ribo-seq
1 sORFs_org_Mouse
3053164 UNKNOWN_TRANSCRIPT Predicted
1 Samandi2017
24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
1 Johnstone2016
939851 39

GTG...

VAN...

GTG Alternative
sORF
Upstream
BMDC, E14, MEF,
v6-5
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
strong 4 sORFs_org_Mouse
940670 222

AAG...

KMK...

AAG Alternative
InCDS
Overlapping
sORF
BMDC
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
1 sORFs_org_Mouse
327715 45

GTG...

VLL...

GTG Alternative
InCDS
Overlapping
sORF
3T3, Brain, R1E
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

320893 ENSMUST00000063314 Gtf2a1-202 protein_coding Ribo-seq
weak 4 sORFs_org_Mouse
997355 30

ATG...

MLS...

ATG Alternative
InCDS
Overlapping
sORF
Brain, MEF, v6-5
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
3 sORFs_org_Mouse
328248 51

ATG...

MTY...

ATG Intronic
sORF
Upstream
InCDS
Overlapping
3T3, R1E, B_cell,
BMDC, Brain, MEF,
Spleen_B_cell
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

662756 ENSMUST00000163693 Gtf2a1-203 retained_intron Ribo-seq
weak 2 sORFs_org_Mouse
320893 ENSMUST00000063314 Gtf2a1-202 protein_coding Ribo-seq
weak 5 sORFs_org_Mouse
24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
weak 1 sORFs_org_Mouse
1300403 219

ATG...

MKK...

ATG Alternative
InCDS
Overlapping
sORF
E14
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
1 sORFs_org_Mouse
1344401 72

TTG...

LFY...

TTG Alternative
InCDS
Overlapping
sORF
E14
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
1 sORFs_org_Mouse
1297510 66

ATG...

MMM...

ATG Alternative
InCDS
Overlapping
sORF
E14
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
1 sORFs_org_Mouse
1346339 39

CTG...

LVE...

CTG Alternative
InCDS
Overlapping
sORF
E14, NSC, Brain,
R1E
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
2 sORFs_org_Mouse
320893 ENSMUST00000063314 Gtf2a1-202 protein_coding Ribo-seq
2 sORFs_org_Mouse
327868 66

GTG...

VTR...

GTG Alternative
InCDS
Overlapping
sORF
MEF, Brain
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
1 sORFs_org_Mouse
320893 ENSMUST00000063314 Gtf2a1-202 protein_coding Ribo-seq
2 sORFs_org_Mouse
1298070 156

ATT...

IFS...

ATT Alternative
sORF
Upstream
E14
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
weak 1 sORFs_org_Mouse
25749 30

CTG...

LLP...

CTG Alternative
InCDS
Overlapping
sORF
3T3, Brain, R1E,
Testis
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

320893 ENSMUST00000063314 Gtf2a1-202 protein_coding Ribo-seq
weak 4 sORFs_org_Mouse
24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
weak 1 sORFs_org_Mouse
1297018 114

AGG...

RFL...

AGG Alternative
sORF
Upstream
E14
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
weak 1 sORFs_org_Mouse
24675 42

CTG...

LLL...

CTG Alternative
InCDS
Overlapping
sORF
Testis, 3T3, Brain,
R1E
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
moderate 1 sORFs_org_Mouse
320893 ENSMUST00000063314 Gtf2a1-202 protein_coding Ribo-seq
moderate 4 sORFs_org_Mouse
779752 66

TTG...

LKM...

TTG InCDS
Overlapping
sORF
Upstream
MEF, B_cell, BMDC
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
weak 1 sORFs_org_Mouse
320893 ENSMUST00000063314 Gtf2a1-202 protein_coding Ribo-seq
weak 3 sORFs_org_Mouse
1299820 204

ATG...

MMM...

ATG Alternative
InCDS
Overlapping
sORF
E14
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
1 sORFs_org_Mouse
939927 132

TTG...

LAR...

TTG Alternative
sORF
Upstream
BMDC, E14
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
moderate 2 sORFs_org_Mouse
939774 255

CTG...

LRE...

CTG Overlapping
sORF
Upstream
BMDC, E14
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
2 sORFs_org_Mouse
1345398 36

CTG...

LVE...

CTG Alternative
InCDS
Overlapping
sORF
E14, Brain
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
1 sORFs_org_Mouse
320893 ENSMUST00000063314 Gtf2a1-202 protein_coding Ribo-seq
1 sORFs_org_Mouse
1346132 75

TTG...

LFY...

TTG Alternative
InCDS
Overlapping
sORF
E14, NSC
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
2 sORFs_org_Mouse
938857 144

ATT...

IFS...

ATT Alternative
sORF
Upstream
BMDC
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
weak 1 sORFs_org_Mouse
1345746 33

GTG...

VES...

GTG Alternative
InCDS
Overlapping
sORF
Brain, E14
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

320893 ENSMUST00000063314 Gtf2a1-202 protein_coding Ribo-seq
1 sORFs_org_Mouse
24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
1 sORFs_org_Mouse
940920 69

TTG...

LCC...

TTG Overlapping
sORF
Upstream
BMDC, E14
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
2 sORFs_org_Mouse
939020 63

GTG...

VAR...

GTG Alternative
sORF
Upstream
BMDC, E14
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
strong 2 sORFs_org_Mouse
1297607 54

AGG...

RKK...

AGG Alternative
InCDS
Overlapping
sORF
E14
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
1 sORFs_org_Mouse
940539 273

CTG...

LRE...

CTG Overlapping
sORF
Upstream
BMDC, E14
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
2 sORFs_org_Mouse
938807 84

ATT...

ILE...

ATT Alternative
sORF
Upstream
BMDC, E14
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
moderate 2 sORFs_org_Mouse
940547 180

TTG...

LAG...

TTG Overlapping
sORF
Upstream
BMDC, E14
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
2 sORFs_org_Mouse
1298279 81

ATG...

MKK...

ATG Alternative
InCDS
Overlapping
sORF
E14
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
1 sORFs_org_Mouse
939337 48

TTG...

LVS...

TTG Alternative
sORF
Upstream
BMDC
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
strong 1 sORFs_org_Mouse
939239 162

TTG...

LAG...

TTG Overlapping
sORF
Upstream
BMDC, E14
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
2 sORFs_org_Mouse
1300143 123

ATG...

MTS...

ATG Alternative
InCDS
Overlapping
sORF
E14
MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript name

The transcript name (e.g. MDK-202).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Kozak context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp. count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

24291 ENSMUST00000021345 Gtf2a1-201 protein_coding Ribo-seq
1 sORFs_org_Mouse

Export data

Identification method

Predicted

Ribo-seq

MS

Start Codon

Kozak context

moderate

weak

strong

optimal

ORF length

Transcript biotype

antisense

antisense_RNA

bidirectional_promoter_lncRNA

IG_C_gene

lincRNA

non_stop_decay

nonsense_mediated_decay

polymorphic_pseudogene

processed_pseudogene

processed_transcript

protein_coding

pseudogene

retained_intron

sense_intronic

sense_overlapping

TEC

transcribed_processed_pseudogene

transcribed_unitary_pseudogene

transcribed_unprocessed_pseudogene

unprocessed_pseudogene

ORF Annotations

Reading frame

Alternative

Relative position

CDS

Downstream

InCDS

Intronic

NewCDS

Overlapping

Upstream

Biotype

Intergenic

ncRNA

NMD

NSD

Pseudogene

Length

sORF

Cell types

3T3

B_cell

BMDC

Brain

C2C12

E14

Glioma

Liver

MEF

MESC

Neutrophil

NSC

R1E

Skin_tumor

Spleen_B_cell

Testis

v6-5