Gene : ensembl, release-109
Install mysqlclient: https://pypi.org/project/mysqlclient/
Human
Queries
Query for the basic gene annotations:
(111838, 5)
Copy to clipboard
stable_id
display_label
biotype
description
synonym
0
ENSG00000210049
MT-TF
Mt_tRNA
mitochondrially encoded tRNA-Phe (UUU/C) [Sour...
MTTF
1
ENSG00000210049
MT-TF
Mt_tRNA
mitochondrially encoded tRNA-Phe (UUU/C) [Sour...
trnF
2
ENSG00000211459
MT-RNR1
Mt_rRNA
mitochondrially encoded 12S rRNA [Source:HGNC ...
12S
3
ENSG00000211459
MT-RNR1
Mt_rRNA
mitochondrially encoded 12S rRNA [Source:HGNC ...
MOTS-c
4
ENSG00000211459
MT-RNR1
Mt_rRNA
mitochondrially encoded 12S rRNA [Source:HGNC ...
MTRNR1
display_label
biotype
description
synonym
stable_id
ENSG00000000003
TSPAN6
protein_coding
tetraspanin 6 [Source:HGNC Symbol;Acc:HGNC:11858]
TSPAN-6|T245|TM4SF6
ENSG00000000005
TNMD
protein_coding
tenomodulin [Source:HGNC Symbol;Acc:HGNC:17757]
tendin|ChM1L|TEM|myodulin|BRICD4
ENSG00000000419
DPM1
protein_coding
dolichyl-phosphate mannosyltransferase subunit...
CDGIE|MPDS
ENSG00000000457
SCYL3
protein_coding
SCY1 like pseudokinase 3 [Source:HGNC Symbol;A...
PACE1|PACE-1
ENSG00000000460
C1orf112
protein_coding
chromosome 1 open reading frame 112 [Source:HG...
FLJ10706
(69299, 4)
Copy to clipboard
Query for external ids:
(80857, 4)
Copy to clipboard
stable_id
xref_id
dbprimary_acc
db_name
0
ENSG00000210049
2898423
HGNC:7481
HGNC
1
ENSG00000211459
2898394
HGNC:7470
HGNC
HGNC
stable_id
hgnc_id
6193
ENSG00000277796
HGNC:10628
6195
ENSG00000277796
HGNC:30554
6366
ENSG00000277768
HGNC:10628
6368
ENSG00000277768
HGNC:30554
9921
ENSG00000277336
HGNC:10628
9923
ENSG00000277336
HGNC:30554
12519
ENSG00000288487
HGNC:16346
12520
ENSG00000288487
HGNC:6335
17707
ENSG00000230417
HGNC:31430
17708
ENSG00000230417
HGNC:45111
79713
ENSG00000276085
HGNC:10628
79715
ENSG00000276085
HGNC:30554
hgnc_id
stable_id
ENSG00000277768
HGNC:30554
ENSG00000277336
HGNC:30554
ENSG00000288487
HGNC:6335
Entrez
ncbi_gene_id
stable_id
ENSG00000278294
124907156
ENSG00000278294
124907485
ENSG00000278294
124908250
ENSG00000276779
3805
ENSG00000276779
124900568
...
...
ENSG00000273768
124905574
ENSG00000273768
124905808
ENSG00000273768
124905809
ENSG00000178104
9659
ENSG00000178104
124904395
6437 rows × 1 columns
Merge Ensembl with HGNC and Entrez
display_label
biotype
description
synonym
stable_id
ENSG00000000003
TSPAN6
protein_coding
tetraspanin 6 [Source:HGNC Symbol;Acc:HGNC:11858]
TSPAN-6|T245|TM4SF6
ENSG00000000005
TNMD
protein_coding
tenomodulin [Source:HGNC Symbol;Acc:HGNC:17757]
tendin|ChM1L|TEM|myodulin|BRICD4
ENSG00000000419
DPM1
protein_coding
dolichyl-phosphate mannosyltransferase subunit...
CDGIE|MPDS
ENSG00000000457
SCYL3
protein_coding
SCY1 like pseudokinase 3 [Source:HGNC Symbol;A...
PACE1|PACE-1
ENSG00000000460
C1orf112
protein_coding
chromosome 1 open reading frame 112 [Source:HG...
FLJ10706
...
...
...
...
...
ENSG00000291313
None
protein_coding
novel protein
ENSG00000291314
None
protein_coding
novel protein
ENSG00000291315
None
protein_coding
novel protein
ENSG00000291316
None
protein_coding
novel protein, LOC84773-CYHR1 readthrough
ENSG00000291317
TMEM276
protein_coding
transmembrane protein 276 [Source:HGNC Symbol;...
69299 rows × 4 columns
display_label
biotype
description
synonym
hgnc_id
stable_id
ENSG00000277336
CCL3L3
protein_coding
C-C motif chemokine ligand 3 like 3 [Source:HG...
MGC12815
HGNC:30554
ENSG00000277768
CCL3L3
protein_coding
C-C motif chemokine ligand 3 like 3 [Source:HG...
MGC12815
HGNC:30554
ENSG00000288487
KIR2DS3
protein_coding
killer cell immunoglobulin like receptor, two ...
nkat7
HGNC:6335
ensembl_gene_id
symbol
ncbi_gene_id
hgnc_id
biotype
description
synonyms
0
ENSG00000000003
TSPAN6
7105
HGNC:11858
protein_coding
tetraspanin 6 [Source:HGNC Symbol;Acc:HGNC:11858]
TSPAN-6|T245|TM4SF6
1
ENSG00000000005
TNMD
64102
HGNC:17757
protein_coding
tenomodulin [Source:HGNC Symbol;Acc:HGNC:17757]
tendin|ChM1L|TEM|myodulin|BRICD4
2
ENSG00000000419
DPM1
8813
HGNC:3005
protein_coding
dolichyl-phosphate mannosyltransferase subunit...
CDGIE|MPDS
3
ENSG00000000457
SCYL3
57147
HGNC:19285
protein_coding
SCY1 like pseudokinase 3 [Source:HGNC Symbol;A...
PACE1|PACE-1
4
ENSG00000000460
C1orf112
55732
HGNC:25565
protein_coding
chromosome 1 open reading frame 112 [Source:HG...
FLJ10706
(75124, 7)
Copy to clipboard
Uploaded to: s3://bionty-assets/human_ensembl_release-109_Gene_lookup.parquet
Mouse
Queries
Query for the basic gene annotations:
(84720, 5)
Copy to clipboard
stable_id
display_label
biotype
description
synonym
0
ENSMUSG00000064336
mt-Tf
Mt_tRNA
mitochondrially encoded tRNA phenylalanine [So...
tRNA
1
ENSMUSG00000064336
mt-Tf
Mt_tRNA
mitochondrially encoded tRNA phenylalanine [So...
tRNA-Phe
2
ENSMUSG00000064336
mt-Tf
Mt_tRNA
mitochondrially encoded tRNA phenylalanine [So...
TrnF tRNA
3
ENSMUSG00000064337
mt-Rnr1
Mt_rRNA
mitochondrially encoded 12S rRNA [Source:MGI S...
12S ribosomal RNA
4
ENSMUSG00000064337
mt-Rnr1
Mt_rRNA
mitochondrially encoded 12S rRNA [Source:MGI S...
12S rRNA
display_label
biotype
description
synonym
stable_id
ENSMUSG00000000001
Gnai3
protein_coding
guanine nucleotide binding protein (G protein)...
Galphai3
ENSMUSG00000000003
Pbsn
protein_coding
probasin [Source:MGI Symbol;Acc:MGI:1860484]
PB
ENSMUSG00000000028
Cdc45
protein_coding
cell division cycle 45 [Source:MGI Symbol;Acc:...
Cdc45l
ENSMUSG00000000031
H19
lncRNA
H19, imprinted maternally expressed transcript...
ENSMUSG00000000037
Scml2
protein_coding
Scm polycomb group protein like 2 [Source:MGI ...
4932420G07Rik
(57010, 4)
Copy to clipboard
Query for external ids:
(83066, 4)
Copy to clipboard
stable_id
xref_id
dbprimary_acc
db_name
0
ENSMUSG00000064336
1630742
MGI:102487
MGI
1
ENSMUSG00000064337
1630726
MGI:102493
MGI
MGI
mgi_id
stable_id
ENSMUSG00000115016
MGI:2145569
ENSMUSG00000115016
MGI:5593065
ENSMUSG00000119828
MGI:5455181
ENSMUSG00000119828
MGI:6721448
ENSMUSG00000082414
MGI:3705775
ENSMUSG00000082414
MGI:5434448
Entrez
ncbi_gene_id
stable_id
ENSMUSG00000094741
331195
ENSMUSG00000094741
100503733
ENSMUSG00000094383
108168683
ENSMUSG00000094383
108168684
ENSMUSG00000094383
108169098
...
...
ENSMUSG00000095545
102639505
ENSMUSG00000089756
667962
ENSMUSG00000089756
102639505
ENSMUSG00000078862
628147
ENSMUSG00000078862
665211
622 rows × 1 columns
Merge ensembl with MGI, Entrez
display_label
biotype
description
synonym
stable_id
ENSMUSG00000000001
Gnai3
protein_coding
guanine nucleotide binding protein (G protein)...
Galphai3
ENSMUSG00000000003
Pbsn
protein_coding
probasin [Source:MGI Symbol;Acc:MGI:1860484]
PB
ENSMUSG00000000028
Cdc45
protein_coding
cell division cycle 45 [Source:MGI Symbol;Acc:...
Cdc45l
ENSMUSG00000000031
H19
lncRNA
H19, imprinted maternally expressed transcript...
ENSMUSG00000000037
Scml2
protein_coding
Scm polycomb group protein like 2 [Source:MGI ...
4932420G07Rik
...
...
...
...
...
ENSMUSG00002076988
Gm56371
rRNA
predicted gene, 56371 [Source:MGI Symbol;Acc:M...
ENSMUSG00002076989
Gm23510
snRNA
predicted gene, 23510 [Source:MGI Symbol;Acc:M...
ENSMUSG00002076990
Gm22711
snoRNA
predicted gene, 22711 [Source:MGI Symbol;Acc:M...
ENSMUSG00002076991
Gm55627
misc_RNA
predicted gene, 55627 [Source:MGI Symbol;Acc:M...
ENSMUSG00002076992
Gm54807
misc_RNA
predicted gene, 54807 [Source:MGI Symbol;Acc:M...
57010 rows × 4 columns
display_label
biotype
description
synonym
mgi_id
stable_id
ENSMUSG00000082414
Gm13303
unprocessed_pseudogene
predicted gene 13303 [Source:MGI Symbol;Acc:MG...
MGI:5434448
ENSMUSG00000115016
Gm33906
lncRNA
predicted gene, 33906 [Source:MGI Symbol;Acc:M...
MGI:5593065
ENSMUSG00000119828
Gm25404
snRNA
predicted gene, 25404 [Source:MGI Symbol;Acc:M...
MGI:6721448
ensembl_gene_id
symbol
ncbi_gene_id
mgi_id
biotype
description
synonyms
0
ENSMUSG00000000001
Gnai3
14679
MGI:95773
protein_coding
guanine nucleotide binding protein (G protein)...
Galphai3
1
ENSMUSG00000000003
Pbsn
54192
MGI:1860484
protein_coding
probasin [Source:MGI Symbol;Acc:MGI:1860484]
PB
2
ENSMUSG00000000028
Cdc45
12544
MGI:1338073
protein_coding
cell division cycle 45 [Source:MGI Symbol;Acc:...
Cdc45l
3
ENSMUSG00000000031
H19
14955
MGI:95891
lncRNA
H19, imprinted maternally expressed transcript...
4
ENSMUSG00000000037
Scml2
107815
MGI:1340042
protein_coding
Scm polycomb group protein like 2 [Source:MGI ...
4932420G07Rik
(57388, 7)
Copy to clipboard