graphein
v1.7.7


文档|纸|教程|安装
蛋白质与互动图库
该软件包提供了产生蛋白质和RNA结构以及生物相互作用网络的几何表示的功能。我们提供与标准PYDATA格式的兼容性,以及旨在易于使用的图形对象,可与流行的深度学习库易用。
| 1.7.0 | foldcomp数据集 | |
| 1.7.0 | 从PDB创建数据集 | |
| 1.6.0 | 蛋白张量模块 | |
| 1.5.0 | alphafold2创建蛋白质图! | |
| 1.5.0 | Dotbracket符号的RNA图构造法 | |
| 1.4.0 | 构造分子图 | |
| 1.3.0 | Pytorch几何的即时数据加载器 | |
| 1.2.0 | 从蛋白质图中提取子图 | |
| 1.2.0 | 蛋白质图分析 | |
| 1.2.0 | Gruphein CLI | |
| 1.2.0 | 蛋白质图可视化! | |
| 1.1.0 | 蛋白质 - 蛋白质相互作用网络支持和结构相互作用组学(使用alphafold2!) | |
| 1.0.0 | 高级和低水平的API,以实现巨大的灵活性 - 创建自己的定制工作流! |
Chaphein提供了一个程序化API和用于构造图形的命令行界面。
Chaphein Configs可以指定为.yaml文件,以从命令行批处理过程图。
文档
graphein -c config.yaml -p path/to/pdbs -o path/to/output| 教程(残留级) | 教程(原子) | 文档 |
from graphein . protein . config import ProteinGraphConfig
from graphein . protein . graphs import construct_graph
config = ProteinGraphConfig ()
g = construct_graph ( config = config , pdb_code = "3eiy" )| 教程 | 文档 |
from graphein . protein . config import ProteinGraphConfig
from graphein . protein . graphs import construct_graph
from graphein . protein . utils import download_alphafold_structure
config = ProteinGraphConfig ()
fp = download_alphafold_structure ( "Q5VSL9" , aligned_score = False )
g = construct_graph ( config = config , path = fp )| 教程 | 文档 |
from graphein . protein . config import ProteinMeshConfig
from graphein . protein . meshes import create_mesh
verts , faces , aux = create_mesh ( pdb_code = "3eiy" , config = config )Chaphein可以从微笑字符串以及.sdf , .mol2和.pdb文件中创建分子图
| 教程 | 文档 |
from graphein . molecule . config import MoleculeGraphConfig
from graphein . molecule . graphs import construct_graph
g = create_graph ( smiles = "CC(=O)OC1=CC=CC=C1C(=O)O" , config = config )| 教程 | 文档 |
from graphein . rna . graphs import construct_rna_graph
# Build the graph from a dotbracket & optional sequence
rna = construct_rna_graph ( dotbracket = '..(((((..(((...)))..)))))...' ,
sequence = 'UUGGAGUACACAACCUGUACACUCUUUC' )| 教程 | 文档 |
from graphein . ppi . config import PPIGraphConfig
from graphein . ppi . graphs import compute_ppi_graph
from graphein . ppi . edges import add_string_edges , add_biogrid_edges
config = PPIGraphConfig ()
protein_list = [ "CDC42" , "CDK1" , "KIF23" , "PLK1" , "RAC2" , "RACGAP1" , "RHOA" , "RHOB" ]
g = compute_ppi_graph ( config = config ,
protein_list = protein_list ,
edge_construction_funcs = [ add_string_edges , add_biogrid_edges ]
)| 教程 | 文档 |
from graphein . grn . config import GRNGraphConfig
from graphein . grn . graphs import compute_grn_graph
from graphein . grn . edges import add_regnetwork_edges , add_trrust_edges
config = GRNGraphConfig ()
gene_list = [ "AATF" , "MYC" , "USF1" , "SP1" , "TP53" , "DUSP1" ]
g = compute_grn_graph (
gene_list = gene_list ,
edge_construction_funcs = [
partial ( add_trrust_edges , trrust_filtering_funcs = config . trrust_config . filtering_functions ),
partial ( add_regnetwork_edges , regnetwork_filtering_funcs = config . regnetwork_config . filtering_functions ),
],
)最简单的安装是通过PIP。 nb这不安装将转换为数据格式的ML/DL库以及与Pytorch 3D生成蛋白质结构网格所需的。更多细节
pip install graphein # For base install
pip install graphein[extras] # For additional featurisation dependencies
pip install graphein[dev] # For dev dependencies
pip install graphein[all] # To get the lot但是,有许多(可选的)实用程序(DSSP,Pymol,GetContacts)无法通过PYPI提供:
conda install -c salilab dssp # Required for computing secondary structural features
conda install -c schrodinger pymol # Required for PyMol visualisations & mesh generation
# GetContacts - used as an alternative way to compute intramolecular interactions
conda install -c conda-forge vmd-python
git clone https://github.com/getcontacts/getcontacts
# Add folder to PATH
echo "export PATH=$PATH:`pwd`/getcontacts" >> ~/.bashrc
source ~/.bashrc
To test the installation, run:
cd getcontacts/example/5xnd
get_dynamic_contacts.py --topology 5xnd_topology.pdb
--trajectory 5xnd_trajectory.dcd
--itypes hb
--output 5xnd_hbonds.tsv
开发环境包括集成到Chaphein中的每个深度学习库中的GPU构建(CUDA 11.1)。
git clone https://www.github.com/a-r-j/graphein
cd graphein
conda env create -f environment-dev.yml
pip install -e .可以通过以下方式执行较轻的安装:
git clone https://www.github.com/a-r-j/graphein
cd graphein
conda env create -f environment.yml
pip install -e .我们在本地提供两个CPU( docker-compose.cpu.yml )和GPU用法( docker-compose.yml )的docker-compose文件。对于GPU使用率,请确保安装了NVIDIA容器工具包。进入容器后,请确保您安装本地安装的音量( pip install -e . )。这也将在本地设置开发环境。
构建(GPU)运行:
docker-compose up -d --build # start the container
docker-compose down # stop the container
如果证明对您的工作有用,请考虑引用Chaphein。
@inproceedings { jamasb2022graphein ,
title = { Graphein - a Python Library for Geometric Deep Learning and Network Analysis on Biomolecular Structures and Interaction Networks } ,
author = { Arian Rokkum Jamasb and Ramon Vi{~n}as Torn{'e} and Eric J Ma and Yuanqi Du and Charles Harris and Kexin Huang and Dominic Hall and Pietro Lio and Tom Leon Blundell } ,
booktitle = { Advances in Neural Information Processing Systems } ,
editor = { Alice H. Oh and Alekh Agarwal and Danielle Belgrave and Kyunghyun Cho } ,
year = { 2022 } ,
url = { https://openreview.net/forum?id=9xRZlV6GfOX }
}