Metam
ORF

Logo

Open Reading Frame (ORF)

ORF

ID CARD

MetamORF ID

679444

Chromosome

1

Strand

+

Start-stop positions

88216199-88216303

Nucleic length (bp)

105

Sequences

Nucleic sequence

ATG ...

Amino acid sequence

MQW ...

Spliced: No

Exons

1

Exons start-end

Transcripts

12 known transcripts

MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript
ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Gene ID

The gene ID. Usually the HGNC ID for H. sapiens genes and the NCBI ID for M. musculus genes.

Relative
positions

The relative positions of the start and stop codons of the ORF on the transcript.

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Start flanking
sequence

The sequence flanking the start codon of the ORF on the transcript. This sequence registered the nucleotides from -6 to +4 positions, where +1 corresponds to the first nucleotide of the ORF start codon.

Kozak
context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp.
count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

ORF
annotations

A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of the cell types in which the ORF has already been identified on the transcript.

1038125 ENSMUST00000014263 protein_coding NCBI:94284 1236 - 1340 Ribo-seq
AGGAATATGC moderate 1 Johnstone2016
    Alternative
    InCDS
    Overlapping
    sORF
Glioma,
Liver, MEF, MESC
1394466 ENSMUST00000150634 protein_coding NCBI:394432 524 - 628 Ribo-seq
AGGAATATGC moderate 1 sORFs_org_Mouse
    sORF
Liver
3036738 UNKNOWN_TRANSCRIPT Ugt1a1_Ugt1a10_Ugt1a2_Ugt1a5_Ugt1a6a_Ugt1a6b_Ugt1a7c_Ugt1a9 - Predicted
1 Samandi2017
    sORF

21160 ENSMUST00000113139 protein_coding NCBI:613123 1191 - 1295 Ribo-seq
AGGAATATGC moderate 1 sORFs_org_Mouse
    Alternative
    InCDS
    Overlapping
    sORF
Liver
2895450 ENSMUST00000073772 protein_coding NCBI:394434 1161 - 1265 Ribo-seq
AGGAATATGC moderate 1 Johnstone2016
    Alternative
    InCDS
    Overlapping
    sORF
Glioma,
Liver, MEF, MESC
7950 ENSMUST00000126203 nonsense_mediated_decay NCBI:394432 384 - 488 Ribo-seq
AGGAATATGC moderate 1 Johnstone2016
    Alternative
    Downstream
    NMD
    Overlapping
    sORF
Glioma,
Liver, MEF, MESC
1397708 ENSMUST00000073049 protein_coding NCBI:394436 1195 - 1299 Ribo-seq
AGGAATATGC moderate 1 Johnstone2016
    InCDS
    Overlapping
    sORF
Glioma,
Liver, MEF, MESC
642270 ENSMUST00000138182 protein_coding NCBI:394430 559 - 663 Ribo-seq
AGGAATATGC moderate 1 sORFs_org_Mouse
    sORF
3T3
584836 ENSMUST00000113142 protein_coding NCBI:394430 1208 - 1312 Ribo-seq
AGGAATATGC moderate 1 Johnstone2016
    Alternative
    InCDS
    Overlapping
    sORF
Glioma,
Liver, MEF, MESC
2895508 ENSMUST00000113137 protein_coding NCBI:394435 1220 - 1324 Ribo-seq
AGGAATATGC moderate 1 Johnstone2016
    InCDS
    Overlapping
    sORF
Glioma,
Liver, MEF, MESC
202847 ENSMUST00000049289 protein_coding NCBI:22236 1182 - 1286 Ribo-seq
AGGAATATGC moderate 1 Johnstone2016
    Alternative
    InCDS
    Overlapping
    sORF
Glioma,
Liver, MEF, MESC
978362 ENSMUST00000097659 protein_coding NCBI:394433 1174 - 1278 Ribo-seq
AGGAATATGC moderate 2 sORFs_org_Mouse
Johnstone2016
    Alternative
    InCDS
    Overlapping
    sORF
Glioma,
Liver, MEF, MESC
2895478 ENSMUST00000113139 protein_coding ENSMUSG00000089675 1191 - 1295 Ribo-seq
AGGAATATGC moderate 1 Johnstone2016
    Alternative
    InCDS
    Overlapping
    sORF
Glioma,
Liver, MEF, MESC

Export data

Identification method

Predicted

Ribo-seq

MS

Kozak context

moderate

weak

strong

optimal

Transcript biotype

antisense

antisense_RNA

bidirectional_promoter_lncRNA

IG_C_gene

lincRNA

non_stop_decay

nonsense_mediated_decay

polymorphic_pseudogene

processed_pseudogene

processed_transcript

protein_coding

pseudogene

retained_intron

sense_intronic

sense_overlapping

TEC

transcribed_processed_pseudogene

transcribed_unitary_pseudogene

transcribed_unprocessed_pseudogene

unprocessed_pseudogene

ORF Annotations

Reading frame

Alternative

Relative position

CDS

Downstream

InCDS

Intronic

NewCDS

Overlapping

Upstream

Biotype

Intergenic

ncRNA

NMD

NSD

Pseudogene

Length

sORF

Cell types

3T3

B_cell

BMDC

Brain

C2C12

E14

Glioma

Liver

MEF

MESC

Neutrophil

NSC

R1E

Skin_tumor

Spleen_B_cell

Testis

v6-5