MM-PBSA/MM-GBSA Calculations
run_gbsa
StreaMD wraps gmx_MMPBSA tool to compute binding free energies from trajectories.
Usage
run_gbsa -h
run_gbsa -h
usage: run_gbsa [-h] [-i DIRNAME [DIRNAME ...]] [--topol topol.top] [--tpr md_out.tpr] [--xtc md_fit.xtc] [--index index.ndx] [-m mmpbsa.in] [-d WDIR]
[--out_files OUT_FILES [OUT_FILES ...]] [--hostfile FILENAME] [-c INTEGER] [--ligand_id UNL] [-a [STRING ...]] [--clean_previous]
Run MM-GBSA/MM-PBSA calculation using gmx_MMPBSA tool
options:
-h, --help show this help message and exit
-i DIRNAME [DIRNAME ...], --wdir_to_run DIRNAME [DIRNAME ...]
single or multiple directories for simulations. Should consist of: tpr, xtc, ndx files
--topol topol.top topol file from the the MD simulation. Will be ignored if --wdir_to_run is used
--tpr md_out.tpr tpr file from the the MD simulation. Will be ignored if --wdir_to_run is used
--xtc md_fit.xtc xtc file of the simulation. Trajectory should have no PBC and be fitted on the Protein_Ligand group. Will be ignored if --wdir_to_run is used
--index index.ndx Gromacs index file from the simulation. Will be ignored if --wdir_to_run is used
-m mmpbsa.in, --mmpbsa mmpbsa.in
MMPBSA input file. If not set up default template will be used.
-d WDIR, --wdir WDIR Working directory for program output. If not set the current directory will be used.
--out_files OUT_FILES [OUT_FILES ...]
gmxMMPBSA out files (FINAL*.dat) to parse. If set will be used over other variables.
--hostfile FILENAME text file with addresses of nodes of dask SSH cluster. The most typical, it can be passed as $PBS_NODEFILE variable from inside a PBS script.
The first line in this file will be the address of the scheduler running on the standard port 8786. If omitted, calculations will run on a
single machine as usual.
-c INTEGER, --ncpu INTEGER
number of CPU per server. Use all cpus by default.
--ligand_id UNL Ligand residue ID
-a [STRING ...], --append_protein_selection [STRING ...]
residue IDs whuch will be included in the protein system (cofactors).Example: ZN MG
--clean_previous Clean previous temporary gmxMMPBSA files
-o string, --out_suffix string
Unique suffix for output files. By default, start-time_unique-id.
Unique suffix is used to separate outputs from different runs.
Calculation parameters can be changed/added by customized mmpbsa.in (see https://valdes-tresanco-ms.github.io/gmx_MMPBSA/dev/input_file/). Control trajectory length via startframe, endframe, and interval. Required inputs (tpr/xtc/ndx) are produced by Running Molecular Dynamics.
Examples
Protein-ligand system
run_gbsa --wdir_to_run md_files/md_run/protein_H_HIS_ligand_1 md_files/md_run/protein_H_HIS_ligand_2 -c 128 -m mmpbsa.in
Protein-ligand-cofactor system
Residue names for ligands/cofactors are listed in md_files/md_run/protein-ligand/all_ligand_resid.txt.
# Include a cofactor in the protein selection
run_gbsa --wdir_to_run md_files/md_run/protein_H_HIS_ligand_* --append_protein_selection MG GTP
# Treat a cofactor as the ligand
run_gbsa --wdir_to_run md_files/md_run/protein_H_HIS_ligand_* --append_protein_selection MG --ligand_id GTP
Per-residue decomposition
Add a &decomp section to mmpbsa.in with the desired decomposition parameters. run_gbsa detects this and produces FINAL_DECOMP_MMPBSA_<suffix>.dat along with aggregated GBSA_decomp_avg_<suffix>.csv and PBSA_decomp_avg_<suffix>.csv when PBSA is used.
Outputs
Each run creates (in --wdir or current directory):
log_mmpbsa_*unique-suffix*.log- StreaMD logginglog_mmpbsa_bash_*unique-suffix*.log- gmx_MMPBSA stdoutGBSA_output_*unique-suffix*.csvandPBSA_output_*unique-suffix*.csv(when applicable) - summary CSVPer
--wdir_to_rundirectory:FINAL_RESULTS_MMPBSA_*unique-suffix*.csv(and decomposition files when enabled)
See Outputs and Files for folder layout and other analysis artifacts.