Metam
ORF

Logo

Open Reading Frame (ORF)

ORF

ID CARD

MetamORF ID

3305527

Chromosome

11

Strand

+

Start-stop positions

9384764-9445194

Nucleic length (bp)

3117

Sequences

Nucleic sequence

ATG ...

Amino acid sequence

MDP ...

Spliced: Yes

Exons

25

Exons start-end

9384764-9384847
9403290-9403371
9408486-9408639
9409928-9410086
9414255-9414411
9417059-9417148
9420411-9420505
9420614-9420698
9423006-9423140
9423777-9423876
9424914-9424990
9425146-9425262
9428540-9428629
9429031-9429196
9429674-9429834
9430875-9431003
9433570-9433636
9433721-9433846
9434934-9435031
9436271-9436366
9437754-9437974
9438080-9438285
9440455-9440661
9442081-9442197
9445097-9445194

Transcripts

1 known transcript

MetamORF
transcript ID

The MetamORF ID of the transcript. This is an arbitrary ID that does not correspond to any official (Ensembl, NCBI...) transcript ID or external reference.

Transcript
ID

The official transcript ID (usually an Ensembl ID, e.g. ENST00000395565).

Transcript biotype

The biotype of the transcript (as defined by Ensembl).

Gene ID

The gene ID. Usually the HGNC ID for H. sapiens genes and the NCBI ID for M. musculus genes.

Relative
positions

The relative positions of the start and stop codons of the ORF on the transcript.

Identification

The method of identification used to identify the ORF. MetamORF currently integrates data from three main type of identification methods: bioinformatic predictions, ribosome profiling experiments and mass spectrometry experiments (either proteomics or proteogenomics). See the data sources section of the advances documentation for more information about this.

Start flanking
sequence

The sequence flanking the start codon of the ORF on the transcript. This sequence registered the nucleotides from -6 to +4 positions, where +1 corresponds to the first nucleotide of the ORF start codon.

Kozak
context

The Kozak context computed by our algorithm for the ORF on the transcript. See the Kozak contexts section of the advanced documentation for more details regarding the nomenclature we use.

Exp.
count

The number of original datasets that identifed the ORF on the transcript.

Data sources

The data sources in which the ORF has been identified. Click on the button to display all the original IDs in a pop-up. See the data sources section of the advanced documentation for more details regarding the information related to the data sources and the original ORF IDs.

ORF
annotations

A comma-separated list of the annotations computed by our algorithm for the ORF on the transcript. See the section dedicated to ORF annotations in the advanced documentation for more details regarding the nomenclature we use.

Cell types

A comma-separated list of the cell types in which the ORF has already been identified on the transcript.

3305516 UNKNOWN_TRANSCRIPT HGNC:9852 - Ribo-seq
1 Erhard2018
HFF
2536 ENST00000379719 protein_coding HGNC:9852 - Ribo-seq
1 Johnstone2016
    CDS
Brain_tumor,
HEK293, HeLa, HFF

Export data

Identification method

Predicted

Ribo-seq

MS

Kozak context

moderate

weak

strong

optimal

Transcript biotype

antisense

antisense_RNA

lincRNA

non_stop_decay

nonsense_mediated_decay

polymorphic_pseudogene

processed_pseudogene

processed_transcript

protein_coding

retained_intron

sense_intronic

sense_overlapping

TEC

transcribed_processed_pseudogene

transcribed_unitary_pseudogene

transcribed_unprocessed_pseudogene

unitary_pseudogene

unprocessed_pseudogene

ORF Annotations

Reading frame

Alternative

Relative position

CDS

Downstream

InCDS

Intronic

NewCDS

Overlapping

Upstream

Biotype

Intergenic

ncRNA

NMD

NSD

Pseudogene

Length

sORF

Cell types

B_cell

BJ

Blood

Brain

Brain_tumor

Breast

Flp-In_T-REx-293

guo_2014

HAP1

HCT116

HEK293

HEK293T

HeLa

hES

HFF

Jurkat

LCL

loayza_puch_2016

MDA-MB-231

MM1S

Monocyte

NCCIT

RPE-1

Skeletal_muscle

THP-1

U2OS