1 \documentclass[12pt]{article}
2 %\usepackage{latex2html}
14 \setcounter{secnumdepth}{5}
15 \setcounter{tocdepth}{5}
19 \title{UNRES - A PROGRAM FOR COARSE-GRAINED SIMULATIONS OF PROTEINS}
21 \author{Laboratory of Molecular Modeling\\ Faculty of Chemistry\\ University of Gdansk\\ Sobieskiego 18\\ 80-952 Gdansk, Poland\\
24 Scheraga Group\\ Baker Laboratory of Chemistry \\
25 and Chemical Biology\\ Cornell University\\ Ithaca, NY 14853-1301, USA}
37 %3. General information
39 % 3.2. Functions of the program
40 % 3.3. Companion programs
41 % 3.4. Programming language
44 %5. Customizing your batch and C-shell script
45 %6. Command line and files
48 % 8.1. Main input data file
50 % 8.1.2. Control data (data list format; READ_CONTROL subroutine)
51 % 8.1.2.1 Keywords to chose calculation type
52 % 8.1.2.2 Specification of protein and structure output in non-MD applications
53 % 8.1.2.3. Miscellaneous
54 % 8.1.3. Minimizer options (data list, subroutine READ_MINIM)
55 % 8.1.4. CSA control parameters
56 % 8.1.5. MCM data (data list, subroutine MCMREAD)
57 % 8.1.6. MD data (subroutine READ_MDPAR)
58 % 8.1.7. REMD/MREMD data (subroutine READ_REMDPAR)
59 % 8.1.8. Energy-term weights (data list; subroutine MOLREAD)
60 % 8.1.9. Input and/or reference PDB file name (text format; subroutine MOLREAD)
61 % 8.1.10. Amino-acid sequence (free and text format)
62 % 8.1.11. Disulfide-bridge information (free format; subroutine READ_BRIDGE)
63 % 8.1.12. Dihedral-angle restraint data (free format; subroutine MOLREAD)
64 % 8.1.13. Distance restraints (subroutine READ_DIST_CONSTR)
65 % 8.1.14. Internal coordinates of the reference structure (free format; subroutine READ_ANGLES)
66 % 8.1.15. Internal coordinates of the initial conformation (free format; subroutine READ_ANGLES)
67 % 8.1.15.1. File name with internal coordinates of the conformations to be processed
68 % 8.1.16 Control data for energy map construction (data lists; subroutine MAP_READ)
69 % 8.2. Input coordinate files
70 % 8.3. Other input files
72 % 9.1. Coordinate files
73 % 9.1.1. The internal coordinate (INT) files
74 % 9.1.2. The plain Cartesian coordinate (X) files
75 % 9.1.3. The compressed Cartesian coordinate (CX) files
76 % 9.1.4. The Brookhaven Protein Data Bank format (PDB) files
77 % 9.1.5. The SYBYLL (MOL2) files
78 % 9.2. The summary (STAT) file
80 % 9.2.2. MD and MREMD runs
81 % 9.3. CSA-specific output files
82 %10. Technical support contact information
87 \section{LICENSE TERMS}
93 This software is provided free of charge to academic users, subject to the condition that no part of it be sold or used otherwise for commercial purposes, including, but not limited to its incorporation into commercial software packages, without written consent from the authors. For permission contact Prof. H. A. Scheraga, Cornell University.
96 This software package is provided on an ``as is'' basis. We in no way warrant either this software or results it may produce.
99 Reports or publications using this software package must contain an acknowledgment to the authors and the NIH Resource in the form commonly used in academic research.
108 The current and former developers of UNRES are listed in this section in alphabetic
109 order together with their current or former affiliations.
112 Maurizio Chinchio (formerly Cornell Univ., USA)
113 Cezary Czaplewski (Univ. of Gdansk, Poland)
114 Carlo Guardiani (Georgia State Univ., USA)
115 Yi He (Cornell Univ., USA)
116 Justyna Iwaszkiewicz (Swiss Institute of Bioinformatics, Switzerland)
117 Dawid Jagiela (Univ. of Gdansk, Poland)
118 Stanislaw Jaworski (deceased)
119 Sebastian Kalinowski (Univ. of Gdansk, Poland)
120 Urszula Kozlowska (deceased)
121 Pawel Krupa (Univ. of Gdansk, Poland)
122 Rajmund Kazmierkiewicz (Univ. of Gdansk, Poland)
123 Jooyoung Lee (Korea Institute for Advanced Studies, Korea)
124 Adam Liwo (Univ. of Gdansk, Poland)
125 Mariusz Makowski (Univ. of Gdansk, Poland)
126 Magdalena Mozolewska (Univ. of Gdansk, Poland)
127 Marian Nanias (formerly Cornell Univ., USA)
128 Stanislaw Oldziej (Univ. of Gdansk, Poland)
129 Jaroslaw Pillardy (Cornell Univ., USA)
130 Shelly Rackovsky (Mout Sinai School of Medicine, USA)
131 Daniel Ripoll (formerly Cornell Univ., USA)
132 Jeff Saunders (Schrodinger Inc., USA)
133 Harold A. Scheraga (Cornell Univ., USA)
134 Hujun Shen (Dalian Institute of Chemical Physics, P.R. China)
135 Adam Sieradzan (Univ. of Gdansk, Poland)
136 Ryszard Wawak (formerly Cornell Univ., USA)
137 Tomasz Wirecki (Univ. of Gdansk, Poland)
138 Marta Wisniewska (Univ. of Gdansk, Poland)
139 Yanping Yin (Cornell Univ., USA)
140 Bartlomiej Zaborowski (Univ. of Gdansk, Poland)
145 \section{GENERAL INFORMATION}
149 \label{sect:geninfo:purpose}
151 Run coarse-grained calculations of polypeptide chains with the UNRES force field.
152 There are two versions of the package which should be kept separate because of
153 non-overlapping functions: version which runs global optimization (Conformational
154 Space Annealing, CSA) and version that runs coarse-grained molecular dynamics and
155 its extension. Because the installation, input file preparation and running CSA
156 and MD versions are similar, a common manual is provided. Items specific
157 for the CSA and MD version are marked ``CSA'' and ``MD'', respectively.
159 MD version can be used to run multiple-chain proteins (however, that version of
160 the code is a new release and might fail if yet un-checked functions are used).
161 The multi-chain CSA version for this purpose is another package (written largely in
164 \subsection{Functions of the program}
165 \label{sect:geninfo:functions}
170 Perform energy evaluation of a single or multiple conformations (serial and parallel) (CSA and MD).
173 Run canonical mesoscopic molecular dynamics (serial and parallel) (MD).
176 Run replica exchange (REMD) and multiplexing replica exchange (MREMD) dynamics (parallel only) (MD).
179 Run multicanonical molecular dynamics (parallel only) (MD).
182 Run energy minimization (serial and parallel) (CSA and MD).
185 Run conformational space annealing (CSA search) (parallel only) (CSA).
188 Run Monte Carlo plus Minimization (MCM) (parallel only) (CSA).
191 Run conformational family Monte Carlo (CFMC) calculations (CSA).
194 Thread the sequence against a database from the PDB and minimize energy of each structure (CSA).
198 Energy and force evaluation is parallelized in MD version.
201 \subsection{Companion programs}
202 \label{sect:geninfo:companion}
204 The structures produced by UNRES can be used as inputs to the following programs provided
205 with this package or separately:
209 \item{xdrf2pdb} -- converts the compressed coordinate files from MD (but not MREMD)runs into
212 \item{xdrf2pdb-m} -- same for MREMD runs (multiple trajectory capacity).
214 \item{xdrf2x} -- converts the plain Cartesian coordinate files into PDB format.
216 \item{WHAM} -- processes the coordinate files from MREMD runs and computes temperature profiles
217 of ensemble averages and computes the probabilities of conformations at selected
218 temperatures; also prepares data for CLUSTER and ZSCORE.
220 \item{CLUSTER} -- does the cluster analysis of the conformations; for MREMD runs takes the
221 coordinate files from WHAM which contain information to compute probabilities
222 of conformations at any temperature.
224 \item{PHOENIX} -- conversion of UNRES conformations to all-atom conformations.
226 \item{ZSCORE} -- force field optimization (for developers).
230 Please consult the manuals of the corresponding packages for details. Note that not
231 all of these packages are released yet; they will be released depending on their
232 readiness for distribution. Contact Adam Liwo, Cezary Czaplewski or Stanislaw Oldziej
233 for developmental versions of these programs.
235 \subsection{Programming language}
236 \label{sect:geninfo:language}
238 This version of UNRES is written almost exclusively in Fortran 77; some subroutines
239 for data management are in ansi-C. The package was parallelized with MPI.
243 \subsection{References}
244 \label{sect:geninfo:references}
246 Citing the following references in your work that makes use of UNRES is gratefully
250 \renewcommand{\section}[2]{}%
251 \begin{thebibliography}{10}
254 A. Liwo, S. Oldziej, M.R. Pincus, R.J. Wawak, S. Rackovsky, H.A. Scheraga.
255 A united-residue force field for off-lattice protein-structure simulations.
256 I: Functional forms and parameters of long-range side-chain interaction potentials
257 from protein crystal data. {\it J. Comput. Chem.}, {\bf 1997}, 18, 849-873.
259 \bibitem{liwo_1997_02}
260 A. Liwo, M.R. Pincus, R.J. Wawak, S. Rackovsky, S. Oldziej, H.A. Scheraga.
261 A united-residue force field for off-lattice protein-structure simulations.
262 II: Parameterization of local interactions and determination
263 of the weights of energy terms by Z-score optimization.
264 {\it J. Comput. Chem.}, {\bf 1997}, 18, 874-887.
266 \bibitem{liwo_1997_03}
267 A. Liwo, S. O{\l}dziej, R. Ka\'zmierkiewicz, M. Groth, C. Czaplewski.
268 Design of a knowledge-based force field for off-lattice simulations of protein
270 {\it Acta Biochim. Pol.}, {\bf 1997}, 44, 527-548.
274 A. Liwo, R. Kazmierkiewicz, C. Czaplewski, M. Groth, S. Oldziej, R.J. Wawak,
275 S. Rackovsky, M.R. Pincus, H.A. Scheraga.
276 United-residue force field for off-lattice protein-structure simulations.
277 III. Origin of backbone hydrogen-bonding cooperativity in united-residue potentials.
278 {\it J. Comput. Chem.}, {\bf 1998}, 19, 259-276.
281 A. Liwo, C. Czaplewski, J. Pillardy, H.A. Scheraga.
282 Cumulant-based expressions for the multibody terms for the correlation between
283 local and electrostatic interactions in the united-residue force field.
284 {\it J. Chem. Phys.}, {\bf 2001}, 115, 2323-2347.
287 J. Lee, D.R. Ripoll, C. Czaplewski, J. Pillardy, W.J. Wedemeyer, H.A. Scheraga,
288 Optimization of parameters in macromolecular potential energy functions by
289 conformational space annealing. {\it J. Phys. Chem. B}, {\bf 2001}, 105, 7291-7298
291 \bibitem{pillardy_2001}
292 J. Pillardy, C. Czaplewski, A. Liwo, W.J. Wedemeyer, J. Lee, D.R. Ripoll,
293 P. Arlukowicz, S. Oldziej, Y.A. Arnautova, H.A. Scheraga,
294 Development of physics-based energy functions that predict medium-resolution
295 structures for proteins of the $\alpha, \beta$, and $\alpha/\beta$ structural classes.
296 {\it J. Phys. Chem. B}, {\bf 2001}, 105, 7299-7311
299 A. Liwo, P. Arlukowicz, C. Czaplewski, S. Oldziej, J. Pillardy, H.A. Scheraga.
300 A method for optimizing potential-energy functions by a hierarchical design
301 of the potential-energy landscape: Application to the UNRES force field.
302 {\it Proc. Natl. Acad. Sci. U.S.A.}, {\bf 2002}, 99, 1937-1942.
304 \bibitem{saunders_2003}
305 J. A. Saunders and H.A. Scheraga.
306 Ab initio structure prediction of two $\alpha$-helical oligomers
307 with a multiple-chain united-residue force field and global search.
308 {\it Biopolymers}, {\bf 2003}, 68, 300-317.
310 \bibitem{saunders_2003_02}
311 J.A. Saunders and H.A. Scheraga.
312 Challenges in structure prediction of oligomeric proteins at the united-residue
313 level: searching the multiple-chain energy landscape with CSA and CFMC procedures.
314 {\it Biopolymers}, {\bf 2003}, 68, 318-332.
316 \bibitem{oldziej_2003}
317 S. Oldziej, U. Kozlowska, A. Liwo, H.A. Scheraga.
318 Determination of the potentials of mean force for rotation about C$^\alpha$-C$^\alpha$
319 virtual bonds in polypeptides from the ab initio energy surfaces of terminally
320 blocked glycine, alanine, and proline. {\it J. Phys. Chem. A}, {\bf 2003}, 107, 8035-8046.
323 A. Liwo, S. Oldziej, C. Czaplewski, U. Kozlowska, H.A. Scheraga.
324 Parameterization of backbone-electrostatic and multibody contributions
325 to the UNRES force field for protein-structure prediction from ab initio
326 energy surfaces of model systems. {\it J. Phys. A}, {\bf 2004}, 108, 9421-9438.
328 \bibitem{oldziej_2004}
329 S. Oldziej, A. Liwo, C. Czaplewski, J. Pillardy, H.A. Scheraga.
330 Optimization of the UNRES force field by hierarchical design of the
331 potential-energy landscape. 2. Off-lattice tests of the method with single
332 proteins. {\it J. Phys. Chem. B.}, {\bf 2004}, 108, 16934-16949.
334 \bibitem{oldziej_2004_02}
335 S. Oldziej, J. Lagiewka, A. Liwo, C. Czaplewski, M. Chinchio,
336 M. Nanias, H.A. Scheraga.
337 Optimization of the UNRES force field by hierarchical design of the
338 potential-energy landscape. 3. Use of many proteins in optimization.
339 {\it J. Phys. Chem. B.}, {\bf 2004}, 108, 16950-16959.
341 \bibitem{oldziej_2004_03}
342 M. Khalili, A. Liwo, F. Rakowski, P. Grochowski, H.A. Scheraga.
343 Molecular dynamics with the united-residue model of polypeptide chains.
344 I. Lagrange equations of motion and tests of numerical stability in the
345 microcanonical mode, {\it J. Phys. Chem. B}, {\bf 2005}, 109, 13785-13797.
347 \bibitem{khalili_2005}
348 M. Khalili, A. Liwo, A. Jagielska, H.A. Scheraga.
349 Molecular dynamics with the united-residue model of polypeptide chains.
350 II. Langevin and Berendsen-bath dynamics and tests on model $\alpha$-helical
351 systems. {\it J. Phys. Chem. B}, {\bf 2005}, 109, 13798-13810.
353 \bibitem{khalili_2005_02}
354 A. Liwo, M. Khalili, H.A. Scheraga.
355 Ab initio simulations of protein-folding pathways by molecular dynamics with
356 the united-residue model of polypeptide chains.
357 {\it Proc. Natl. Acad. Sci. U.S.A.}, {\bf 2005}, 102, 2362-2367.
359 \bibitem{rakowski_2006}
360 F. Rakowski, P. Grochowski, B. Lesyng, A. Liwo, H. A. Scheraga.
361 Implementation of a symplectic multiple-time-step molecular dynamics algorithm,
362 based on the united-residue mesoscopic potential energy function.
363 {\it J. Chem. Phys.}, {\bf 2006}, 125, 204107.
365 \bibitem{nanias_2006}
366 M. Nanias, C. Czaplewski, H.A. Scheraga.
367 Replica exchange and multicanonical algorithms with the coarse-grained
368 united-residue (UNRES) force field.
369 {\it J. Chem. Theory and Comput.}, {\bf 2006}, 2, 513-528.
372 A. Liwo, M. Khalili, C. Czaplewski, S. Kalinowski, S. Oldziej, K. Wachucik, H.A. Scheraga.
373 Modification and optimization of the united-residue (UNRES) potential energy
374 function for canonical simulations. I. Temperature dependence of the effective
375 energy function and tests of the optimization method with single training
377 {\it J. Phys. Chem. B}, {\bf 2007}, 111, 260-285.
379 \bibitem{kozlowska_2007}
380 U. Kozlowska, A. Liwo, H.A. Scheraga.
381 Determination of virtual-bond-angle potentials of mean force for coarse-grained
382 simulations of protein structure and folding from ab initio energy surfaces of
383 terminally-blocked glycine, alanine, and proline.
384 {\it J. Phys.: Condens. Matter}, {\bf 2007}, 19, 285203.
386 \bibitem{chichio_2007}
387 M. Chinchio, C. Czaplewski, A. Liwo, S. Oldziej, H.A. Scheraga.
388 Dynamic formation and breaking of disulfide bonds in molecular dynamics
389 simulations with the UNRES force field.
390 {\it J. Chem. Theory Comput.}, {\bf 2007}, 3, 1236-1248.
393 A.V. Rojas, A. Liwo, H.A. Scheraga.
394 Molecular dynamics with the united-residue force field: Ab Initio folding
395 simulations of multichain proteins.
396 {\it J. Phys. Chem. B}, {\bf 2007}, 111, 293-309.
399 A. Liwo, C. Czaplewski, S. Oldziej, A.V. Rojas, R. Kazmierkiewicz,
400 M. Makowski, R.K. Murarka, H.A. Scheraga.
401 Simulation of protein structure and dynamics with the coarse-grained UNRES
402 force field. In: Coarse-Graining of Condensed Phase and Biomolecular
403 Systems., ed. G. Voth, Taylor \& Francis, 2008, Chapter 8, pp. 107-122.
405 \bibitem{czaplewski_2009}
406 C. Czaplewski, S. Kalinowski, A. Liwo, H.A. Scheraga.
407 Application of multiplexed replica exchange molecular dynamics
408 to the UNRES force field: tests with $\alpha$ and $\alpha+\beta$ proteins.
409 {\it J. Chem. Theory Comput.}, {\bf 2009}, 5, 627-640.
412 Y. He, Y. Xiao, A. Liwo, H.A. Scheraga.
413 Exploring the parameter space of the coarse-grained UNRES force field by random
414 search: selecting a transferable medium-resolution force field.
415 {\it J. Comput. Chem.}, {\bf 2009}, 30, 2127-2135.
417 \bibitem{kozlowska_2010}
418 U. Kozlowska, A. Liwo. H.A. Scheraga.
419 Determination of side-chain-rotamer and side-chain and backbone
420 virtual-bond-stretching potentials of mean force from AM1 energy surfaces of
421 terminally-blocked amino-acid residues, for coarse-grained simulations of
422 protein structure and folding. 1. The Method.
423 {\it J. Comput. Chem.}, {\bf 2010}, 31, 1143-1153.
425 \bibitem{kozlowska_2010_02}
426 U. Kozlowska, G.G. Maisuradze, A. Liwo, H.A. Scheraga.
427 Determination of side-chain-rotamer and side-chain and backbone
428 virtual-bond-stretching potentials of mean force from AM1 energy surfaces of
429 terminally-blocked amino-acid residues, for coarse-grained simulations of
430 protein structure and folding. 2. Results, comparison with statistical
431 potentials, and implementation in the UNRES force field.
432 {\it J. Comput. Chem.}, {\bf 2010}, 31, 1154-1167.
435 A. Liwo, S. Oldziej, C. Czaplewski, D.S. Kleinerman, P. Blood, H.A. Scheraga.
436 Implementation of molecular dynamics and its extensions with the coarse-grained
437 UNRES force field on massively parallel systems; towards millisecond-scale
438 simulations of protein structure, dynamics, and thermodynamics.
439 {\it J. Chem. Theory Comput.}, {\bf 2010}, 6, 890-909.
441 \bibitem{sieradzan_2012}
442 A.K. Sieradzan, U.H.E. Hansmann, H.A. Scheraga, A. Liwo.
443 Extension of UNRES force field to treat polypeptide chains with D-amino-acid residues.
444 {\it J. Chem. Theory Comput.}, {\bf 2012}, 8, 4746-4757.
447 P. Krupa, A.K. Sieradzan, S. Rackovsky, M. Baranowski, S. O{\l}dziej,
448 H.A. Scheraga, A. Liwo, C. Czaplewski.
449 Improvement of the treatment of loop structures in the UNRES
450 force field by inclusion of coupling between backbone- and
451 side-chain-local conformational states
452 {\it J. Chem. Theory Comput.}, {\bf 2013}, 4620-4632.
454 \bibitem{sieradzan_2014}
455 A.K. Sieradzan, A. Niadzvedtski, H.A. Scheraga, A. Liwo.
456 Revised backbone-virtual-bond-angle potentials to treat the L- and D-amino
457 acid residues in the coarse-grained united residue (UNRES) force field.
458 {\it J. Chem. Theory Comput.}, {\bf 2014}, 10, 2194-2203.
460 \end{thebibliography}
465 \section{INSTALLATION}
468 Please follow the instructions in the installation guide to download and put the package on your
469 system. In what follows, \$UNRESROOT is the location of the UNRES package in your system.
471 It is recommended to install all components of the package using the Cmake utility.
472 Please follow the instructions in the installation guide.
474 This section describes the installation of only the UNRES component of the package,
475 using make program. Sample Makefiles are present in the respective source directories.
477 To produce the executable do the following:
479 \begin{enumerate}[(a)]
482 To build parallel version, make sure that MPI is installed in your system.
483 Note that the package will have limited functions when compiled in a single-CPU mode.
484 On linux cluster the command source \$HOME/.env should be added to .tcshrc
485 or equivalent file to use parallel version of the program, the
486 alternative is to use queuing system like PBS.
487 In some cases the FORTRAN library subroutine GETENV does not work properly
488 with MPI, if the script is run interactively. In such a case try to
489 add the source mygentenv.F and turn on the -DMYGETENV preprocessor flag.
492 Change directory to the respective source directory.
495 Select the appropriate Makefile\_xxxx or copy the most matching Makefile\_xxx
496 to another name (e.g., Makefile\_MySystem) and edit it to customize to your
497 system. Note that the CSA version works only with MPI.
499 Makefile\_pgf90 - Linux, the pgf90 compiler,
500 Makefile\_intel - Linux, Intel Fortran compiler,
501 Makefile\_gfortran - Gnu Fortran compiler,
502 Makefile\_bluegene - BlueGene/Q (AIX Fortan).
505 Please note that Makefile must be a symbolic link to the Makefile\_xxx of choice. Make sure
506 that the file cinfo.f is present; if not, execute:
513 Other systems should not cause problems; all you have to do is to change
514 the compiler, compiler options, and preprocessor options.
516 By default, the executables will be placed in \$UNRESROOT/bin/unres/CSA
517 \$UNRESROOT/bin/unres/MD and UNRES/bin/unres/MINIM, respectively.
519 The following architectures are defined in the .F source files:
523 \item{AIX} -- AIX systems (put -DAIX as one of the preprocessor options, if
524 this is your system).
526 \item{LINUX} -- Linux (put -DLINUX).
528 \item{G77} -- Gnu-Fortran compilers (might require sum moderate source code editing)
529 (put -DG77). The recommended compiler is gfortran and not g77.
531 \item{PGI} -- PGI compilers.
533 \item{WINPGI} -- additional setting for PGI compilers for MS Windows.
535 \item{SGI} -- all SGI platforms; should also be good for SUN platforms (put -DSGI).
537 \item{CRAY} -- handles some Cray-specific I/Os and other instructions.
539 \item{WIN} -- MS Windows with Digital Fortran compiler (put -DWIN)
543 For other platforms, the only problems might appear in connection with
544 machine-specific I/O instructions. Many files are opened in the append
545 mode, whose specification in the OPEN statement is quite machine-dependent.
546 In this case you might need to modify the source code accordingly.
547 The other platform dependent routines are the timing routines contained
548 in timing.F. In addition to the platforms specified above, ES9000, SUN,
549 KSR, and CRAY are defined there.
551 For parallel build -DMP and -DMPI must be set (these are set in Makefile).
553 IMPORTANT! Apart from this, two define flags: -DCRYST\_TOR and -DMOMENT
554 define earlier versions of the force field. The MUST NOT be entered, if
555 the CASP5 and later versions of the force field are used.
559 Build the unres executables by typing at your UNIX prompt:
562 make # will build unres
563 make clean # will remove the object files
566 The bin directory contains pre-built binaries for Red Hat Linux. These
567 executables are specified in the csh scripts listed in section 4.
571 Customize the C-shell scripts unres.unres (to run the parallel version on
572 set of workstation). See the next section of this manual for guidance.
574 After the executables are build and C-shell scripts customized, you can run the
575 test examples contained in UNRES/examples.
581 \section{CUSTOMIZING YOUR C-SHELL SCRIPT}
584 IMPORTANT NOTE -- The unres.csh script is for Linux and should also be easily
585 adaptable to other systems running MPICH. This script is for interactive
586 parallel jobs. Examples of scripts compatible with PBS (pbs.sub) and LoadLever
587 (sp2.sub) queuing systems are also provided.
589 Edit the following lines in your unres.csh script:
592 set DD = your_database_directory
595 e.g., if you installed the package on the directory /usr/local, this line
599 set DD = /usr/local/UNRES/PARAM
600 set BIN = your_binaries_directory
601 set FGPROCS = number_of_processors_per_energy/force_evaluation (MD)
604 e.g., if the root directory is as above:
607 set BIN = /usr/local/UNRES/bin
610 \section{COMMAND LINE AND FILES}
613 To run UNRES interactively enter the following command at your Unix prompt
614 or put it in the batch script:
617 unres.csh POTENTIAL INPUT N_PROCS
622 POTENTIAL specifies the side-chain interaction potential type and must be
623 one of the following:
627 \item{LJ} -- 6-12 radial Lennard-Jones.
629 \item{LJK} -- 6-12 radial Lennard-Jones-Kihara (shifted Lennard Jones).
631 \item{BP} -- 6-12 anisotropic Berne-Pechukas based on Gaussian overlap (dilated
634 \item{GB} -- 6-12 anisotropic Gay-Berne (shifted Lennard-Jones).
636 \item{GBV} -- 6-12 anisotropic Gay-Berne-Vorobjev (shifted Lennard-Jones).
638 See section \ref{sect:forcefields} (Force Fields) for explanation and usage.
640 At present, only the LJ and GB potentials are applied. The LJ potential
641 is used in the ``CASP3'' version of the UNRES force field that is able
642 to predict only $\alpha$-helical structures. All further version of the
643 UNRES force field use the GB potential. For the description of all above-mentioned
644 potentials see ref. \cite{liwo_1997_02}.
646 \item{INPUT} is the prefix for input and output files (see below)
648 \item{N\_PROCS} is the number of processors; for a CSA or REMD/MREMD run it MUST be at least 2.
652 Note! The script takes one more variable, FGPROCS, as the fourth argument,
653 which is the number of fine-grain processors to parallelize energy
654 evaluations. The corresponding code is in UNRES/CSA, but it was written
655 using MPL instead of MPI and therefore is never used in the present version.
656 At present we have no plans to rewrite fine-grain parallelization using MPI,
657 because we found that the scalability for up to 200 residue polypeptide
658 chains was very poor, due to a small number of interactions and,
659 correspondingly, unfavorable ratio of the overhead to the computation time.
663 \item{INPUT.inp} contains the main input data and the control parameters of the CSA
666 \item{INPUT.out\_POTENTIAL\_xxx} is the main output files from different processors; xxx
667 denotes the number of the processor
669 \item{INPUT\_POTENTIALxxx.stat} is the summary files with the energies, energy components,
670 and RMS deviations of the conformations produced by each of the processors;
671 not used in CSA runs; also it outputs different quantity in MD/MREMD runs.
673 CSA version specific files:
675 \item{INPUT\_POTENTIALxxx.int} is the internal coordinates; in the CSA run
677 \item{INPUT\_POTENTIAL\_000.int} contains the coordinates of the conformations,
678 and the other files are empty
680 \item{INPUT.CSA.history} is the history file from a CSA run. This is an I/O file, because
681 it can be used to restart an interrupted CSA run.
683 \item{INPUT.CSA.seed} stores the random seed generated in a CSA run; written for
686 \item{INPUT.CSA.bank} is the current bank of conformations obtained in CSA calculations
687 (expressed as internal coordinates). This information is also stored in
688 INPUT\_POTENTIAL000.int
690 \item{INPUT.CSA.rbank} -- as above, but contains random-generated conformations.
694 MD version specific files:
698 \item{INPUT\_MDyyy.pdb} is the Cartesian coordinates of the conformations in PDB format.
700 \item{INPUT\_MDyyy.x} is the Cartesian coordinates of the conformations in ASCII format.
702 \item{INPUT\_MDyyy.cx} is the Cartesian coordinates of the conformations in compressed format
703 (need xdr2pdb to convert to PDB format).
706 The program currently produces some more files, but they are not used
707 for any purposes and most of them are scratched after a run is completed.
709 The run script also contains definitions of the parameter files through the
710 following environmental variables:
714 \item{SIDEPAR} -- parameters of the SC-SC interaction potentials ($U_{SC SC}$);
716 \item{SCPPAR} -- parameters of the SC-p interaction potential ($U_{SCp}$); this file can
717 be ignored by specifying the -DOLDSCP preprocessor flag, which means that the
718 built-in parameters are used; at present they are the same as the parameters
719 in the file specified by SCPPAR;
721 \item{ELEPAR} -- parameters of the p-p interaction potentials ($U_{pp}$);
723 \item{FOURIER} -- parameters of the multibody potentials of the coupling between the
724 backbone-local and backbone-electrostatic interactions ($U_{corr}$);
726 \item{THETPAR} -- parameters of the virtual-bond-angle bending potentials ($U_b$);
728 \item{ROTPAR} -- parameters of the side-chain rotamer potentials ($U_{rot}$);
730 \item{TORPAR} -- parameters of the torsional potentials ($U_{rot}$);
732 \item{TORDPAR} -- parameters of the double-torsional potentials.
734 \item{SCCORPAR} -- parameters of the torsional potentials that account for the
735 coupling between the local backbone and local sidechain states (implemented recently).
741 \section{FORCE FIELDS}
742 \label{sect:forcefields}
744 UNRES is being developed since 1997 and several versions of the force field
745 were produced. The settings and references to these force fields are
748 Force fields for CSA version (can be used in MD but haven't been parameterized for this
752 \hspace{-2cm}\begin{longtable}{|l|l|l|l|l|l|l|}\hline
754 %---------------------------------------------------------------------------------------
755 & Additional & SC-SC & Example script & Structural &\\
756 Force field & compiler flags& potential& and executables & classes covered& References\\
757 & & & (Linux; PGF90 &&\\
758 & & & and IFC) &&\\ \hline
759 %---------------------------------------------------------------------------------------
760 CASP3 & -DCRYST\_TOR & LJ & unres\_CASP3.csh &only $\alpha$ &\cite{liwo_1997,liwo_1997_02,liwo_1998}\\
761 & -DCRYST\_BOND & &unres\_pgf90\_cryst\_tor.exe&&\\
762 & -DCRYST\_THETA & &unres\_ifc6\_cryst\_tor.exe &&\\
766 ALPHA & -DMOMENT & GB & unres\_CASP4.csh &only $\alpha$ &\cite{liwo_2001,lee_2001,pillardy_2001}\\
767 & -DCRYST\_BOND & &unres\_pgf90\_moment.exe &&\\
768 & -DCRYST\_THETA & &unres\_ifc6\_moment.exe &&\\
771 BETA & -DMOMENT & GB & unres\_CASP4.csh &only $\beta$ &\cite{liwo_2001,lee_2001,pillardy_2001}\\
772 & -DCRYST\_BOND & &unres\_pgf90\_moment.exe &&\\
773 & -DCRYST\_THETA & &unres\_ifc6\_moment.exe &&\\
776 ALPHABETA & -DMOMENT & GB & unres\_CASP4.csh & all &\cite{liwo_2001,lee_2001,pillardy_2001}\\
777 & -DCRYST\_BOND & &unres\_pgf90\_moment.exe &&\\
778 & -DCRYST\_THETA & &unres\_ifc6\_moment.exe &&\\
781 CASP5 & -DCRYST\_BOND & GB & unres\_CASP5.csh & all &\cite{liwo_2002,saunders_2003,saunders_2003_02,liwo_2004}\\
782 & -DCRYST\_THETA & & unres\_pgf90.exe &&\\
783 & -DCRYST\_SC & & unres\_ifc6.exe &&\\
785 3P & -DCRYST\_BOND & GB & unres\_3P.csh & all &\cite{oldziej_2004,oldziej_2004_02}\\
786 & -DCRYST\_THETA & & unres\_pgf90.exe &&\\
787 & -DCRYST\_SC & & unres\_ifc6.exe &&\\
789 4P & -DCRYST\_BOND & GB & unees\_4P.csh & all &\cite{oldziej_2004,oldziej_2004_02}\\
790 & -DCRYST\_THETA & & unres\_pgf90.exe&&\\
791 & -DCRYST\_SC & & unres\_ifc6.exe&&\\ \hline
792 %---------------------------------------------------------------------------------------
798 Force fields for MD version \cite{khalili_2005,khalili_2005_02}.
801 \begin{longtable}{|l|l|l|l|l|l|l|}\hline
802 %---------------------------------------------------------------------------------------
803 & Additional & SC-SC & Example script & Structural &\\
804 Force field & compiler flags& potential& and executables & classes covered& References\\
805 & & & (Linux; PGF90&&\\
806 & & & and IFC)&&\\ \hline
807 %---------------------------------------------------------------------------------------
808 GAB & -DCRYST\_BOND & GB & unres\_GAB.csh & mostly $\alpha$ & \cite{liwo_2007}\\
809 & -DCRYST\_THETA &&&&\\
813 E0G & -DCRYST\_BOND & GB & unres\_E0G.csh & mostly $\alpha$ & \cite{liwo_2007}\\
814 & -DCRYST\_THET &&&&\\
818 E0LL2Y &-DPROCOR & GB & unres\_ab.csh & all & \cite{liwo_2007,kozlowska_2007,he_2009,kozlowska_2010,kozlowska_2010_02}\\ \hline
819 %---------------------------------------------------------------------------------------
823 The example scripts (the *.csh filed) contain all appropriate parameter files, while
824 the energy-term weights are provided in the example input files listed in EXAMPLES.TXT
825 (*.inp; see section \ref{sect:input}. for description of the input files). However, it is user's
826 responsibility to specify appropriate compiler flags. Note that a version WILL NOT work,
827 if the force-field specific compiler flags are not set. The parameter files specified
828 in the run script also must strictly correspond to the energy-term weights specified in
829 the input file. The parameter files for specific force fields are also specified below
830 and the energy-term weights are specified in section \ref{sect:input}.
832 The parameter files are as follows (the environment variables from section \ref{sect:command} are
833 used to identify the parameters):
837 \begin{longtable}{ll}
838 BONDPAR &bond.parm \\
839 THETPAR &thetaml.5parm\\
840 ROTPAR &scgauss.parm\\
841 TORPAR &torsion\_cryst.parm\\
842 TORDPAR &torsion\_double\_631Gdp.parm (not used)\\
843 SIDEPAR &scinter\_LJ.parm\\
844 ELEPAR &electr.parm\\
846 FOURIER &fourier\_GAP.parm (not used)\\
847 SCCORPAR&sccor\_am1\_pawel.dat (not used)\\
850 ALPHA, BETA, ALPHABETA (CASP4):
852 \begin{longtable}{ll}
853 BONDPAR &bond.parm \\
854 THETPAR &thetaml.5parm\\
855 ROTPAR &scgauss.parm\\
856 TORPAR &torsion\_ecepp.parm\\
857 TORDPAR &torsion\_double\_631Gdp.parm (not used)\\
858 SIDEPAR &scinter\_GB.parm\\
859 ELEPAR &electr.parm\\
861 FOURIER &fourier\_GAP.parm\\
862 SCCORPAR&sccor\_am1\_pawel.dat (not used)\\
867 \begin{longtable}{ll}
869 THETPAR &thetaml.5parm\\
870 ROTPAR &scgauss.parm\\
871 TORPAR &torsion\_631Gdp.parm\\
872 TORDPAR &torsion\_double\_631Gdp.parm\\
873 SIDEPAR &scinter\_GB.parm\\
874 ELEPAR &electr\_631Gdp.parm\\
876 FOURIER &fourier\_opt.parm.1igd\_iter7n\_c\\
877 SCCORPAR&sccor\_am1\_pawel.dat (not used)\\
882 \begin{longtable}{ll}
884 THETPAR &thetaml.5parm\\
885 ROTPAR &scgauss.parm\\
886 TORPAR &torsion\_631Gdp.parm\\
887 TORDPAR &torsion\_double\_631Gdp.parm\\
888 SIDEPAR &sc\_GB\_opt.3P7\_iter81\_1r\\
889 ELEPAR &electr\_631Gdp.parm\\
891 FOURIER &fourier\_opt.parm.1igd\_hc\_iter3\_3\\
892 SCCORPAR&sccor\_am1\_pawel.dat (not used)\\
897 \begin{longtable}{ll}
899 THETPAR &thetaml.5parm\\
900 ROTPAR &scgauss.parm\\
901 TORPAR &torsion\_631Gdp.parm\\
902 TORDPAR &torsion\_double\_631Gdp.parm\\
903 SIDEPAR &sc\_GB\_opt.4P5\_iter33\_3r\\
904 ELEPAR &electr\_631Gdp.parm\\
906 FOURIER &fourier\_opt.parm.1igd\_hc\_iter3\_3\\
907 SCCORPAR&sccor\_am1\_pawel.dat (not used)\\
912 \begin{longtable}{ll}
914 THETPAR &thetaml.5parm\\
915 ROTPAR &scgauss.parm\\
916 TORPAR &torsion\_631Gdp.parm\\
917 TORDPAR &torsion\_double\_631Gdp.parm\\
918 SIDEPAR &sc\_GB\_opt.1gab\_3S\_qclass5no310-shan2-sc-16-10-8k\\
919 ELEPAR &electr\_631Gdp.parm\\
921 FOURIER &fourier\_opt.parm.1igd\_hc\_iter3\_3\\
922 SCCORPAR&sccor\_pdb\_shelly.dat\\
927 \begin{longtable}{ll}
929 THETPAR &thetaml.5parm\\
930 ROTPAR &scgauss.parm\\
931 TORPAR &torsion\_631Gdp.parm\\
932 TORDPAR &torsion\_double\_631Gdp.parm\\
933 SIDEPAR &sc\_GB\_opt.1e0g-52-17k-2k-newclass-shan1e9\_gap8g-sc\\
934 ELEPAR &electr\_631Gdp.parm\\
936 FOURIER &fourier\_opt.parm.1igd\_hc\_iter3\_3\\
937 SCCORPAR&sccor\_pdb\_shelly.dat\\
942 \begin{longtable}{ll}
943 BONDPAR &bond\_AM1.parm\\
944 THETPAR &theta\_abinitio.parm\\
945 ROTPAR &rotamers\_AM1\_aura.10022007.parm\\
946 TORPAR &torsion\_631Gdp.parm\\
947 TORDPAR &torsion\_double\_631Gdp.parm\\
948 SIDEPAR &scinter\_\${POT}.parm\\
949 ELEPAR &electr\_631Gdp.parm\\
951 FOURIER &fourier\_opt.parm.1igd\_hc\_iter3\_3\\
952 SCCORPAR&sccor\_am1\_pawel.dat\\
955 Additionally, for E0LL2Y, the following environment variables and files are required
956 to generate random conformations:
958 THETPARPDB thetaml.5parm\\
959 ROTPARPDB scgauss.parm
961 For CSA, the best force field is 4P. For MD, the 1L2Y\_1LE1 force field is best for
962 ab initio prediction but provides medium resolution (5 A for 60-residue proteins) and
963 overemphasizes $\beta$-structures and has to be run with secondary-structure-prediction
964 information. For prediction of the structure of mostly $\alpha$-protein, and for running
965 dynamics of large proteins, the best is the GAB force field. All these force fields
966 were trained by using our procedure of hierarchical optimization \cite{oldziej_2004,oldziej_2004_02}.
967 The 4P and 1L2Y\_1LE1 force fields have considerable power independent of structural class.
968 The ALPHA, BETA, and ALPHABETA force fields (for CSA) were used in the CASP4 exercises
969 and the CASP5 force field was used in the CASP5 exercise with some success; ALPHA
970 predicts reasonably the structure of $\alpha$-helical proteins and is still not obsolete,
971 while for $\beta$- and $\alpha+\beta$-structure prediction
972 3P or 4P should be used, because they are cheaper and more reliable than BETA and
973 ALPHABETA. The early CASP3 force field is included for historical reasons only.
977 \section{INPUT FILES}
980 \subsection{Main input data file}
981 \label{sect:input:main}
983 Most of the data are organized as data lists, where the data can be put
984 in any order, using a series of statements of the form:
988 for simple non-logical variables
994 to indicate that the corresponding option is turned on. For array variables
995 the assignment statement is:
997 KEYWORD=value1,value2,...
999 However, the data lists are unnamed and that must be placed EXACTLY in the
1000 order indicated below. The presence of an \& in the 80th column of a line
1001 indicates that the next line will belong to the same data group. The parser
1002 subroutines that interpret the keywords are case insensitive.
1004 Each group of data organized as a data list is indicated as data list format
1007 \subsubsection{Title}
1008 \label{sect:input:main:title}
1010 Any string containing up to 80 characters. The first input line is always
1011 interpreted as title.
1013 \subsubsection{Control data}
1014 \label{sect:input:main:control}
1016 This data section is in data list format and is read in the READ\_CONTROL subroutine.
1018 \paragraph{Keywords to chose calculation type}
1022 \item{TIMLIM} -- time limit in minutes (960)
1024 %\item{OUT1FILE} -- only the master processor prints the output file in a parallel job
1026 \item{MINIMIZE} -- if present, energy minimization will be carried out.
1028 \item{REGULAR} -- regularize the read in conformation (usually a crystal or
1029 NMR structure) by doing a series of three constrained minimizations,
1030 to keep the structure as close as possible to the starting
1031 (experimental) structure. The constraints are the CA-CA distances
1032 of the initial structure. The constraints are gradually diminished
1033 and removed in the last minimization.
1035 \item{SOFTREG} -- regularize the read in conformation (usually a crystal or NMR
1036 structure) by doing a series of constrained minimizations, with
1037 additional use of soft potential and secondary structure
1038 freezing, to keep the structure as close as possible to the
1039 starting (experimental) structure.
1042 \item{CSA} -- if present, the run is a CSA run. At present, this is the only
1043 reliable mode of doing global conformational search with this
1044 package; it is NOT recommended to use MCM or THREAD for this
1047 \item{MCM} -- if present, this is a Monte Carlo Minimization (MCM) run.
1049 \item{MULTCONF} -- if present, conformations will be read from the INPUT.intin
1052 \item{MD} -- run canonical MD (single or multiple trajectories).
1054 \item{RE} -- run REMD or MREMD (parallel jobs only).
1056 \item{MUCA} -- run multicanonical MD calculations (parallel jobs only).
1058 \item{MAP=number} (integer) --
1059 Conformational map will be calculated in chosen angles.
1061 \item{THREAD=number} (integer) --
1062 Threading or threading-with-minimization run, using a database of structures
1063 contained in the \$DD/patterns.cart pattern data base (502 chains or chain
1064 fragments), using a total number patterns. It is recommended to use this with
1065 energy minimization; this implies regularization of each minimized pattern.
1066 See refs. \cite{liwo_1997_02} and \cite{liwo_1997_03}.
1068 \item{CHECKGRAD} -- compare numerical and analytical gradient; to be followed by:
1070 \item{CART} -- energy gradient in virtual-bond vectors (Cartesian coordinates)
1072 \item{INT} -- energy gradient in internal coordinates (default)
1074 \item{CARINT} -- derivatives of the internal coordinates in the virtual-bond vectors.
1078 \paragraph{Specification of protein and structure output in non-MD applications}
1082 \item{ONE\_LETTER} -- one-letter and not three-letter code of the amino-acid residues
1085 \item{SYM} (1) -- number of chains with same sequence (for oligomeric proteins only).
1087 \item{PDBSTART} -- the initial conformation is read in from a PDB file.
1089 \item{UNRES\_PDB} -- the starting conformation is in UNRES representation (C$^\alpha$
1090 and SC coordinates only). This keyword MUST appear in such a case
1091 or the program will generate erroneous and unrealistic side-chain
1094 \item{RAND\_CONF} -- start from a random conformation.
1096 \item{EXTCONF} -- start from an extended chain conformation.
1098 \item{PDBOUT} -- if present, conformations will be output in PDB format. Note that
1099 this keyword affects only the output from single energy evaluation,
1100 energy minimization and multiple-conformation data. To request
1101 conformations from MD/MREMD runs in PDB format, the MDPDB keyword
1102 must be placed on the MD input record.
1104 \item{MOL2OUT} -- if present, conformations will be output in SYBYL mol2 format.
1106 \item{REFSTR} -- if present, reference structure will be read (e.g., to monitor
1107 the RMS deviation from the crystal structure).
1109 \item{PDBREF} -- if present, a reference structure will be read in to compare
1110 the calculated conformations with it.
1112 \item{UNRES\_PBD} -- the starting/reference structure is read from an UNRES-generated
1117 Keywords: PDBOUT, MOL2OUT, PDBREF, and PDBSTART are ignored for a CSA run.
1118 Output mode for MD version is specified in MD input (see section \ref{sect:input:main:MD}).
1120 \paragraph{Miscellaneous}
1124 \item{CONSTR\_DIST=number}
1127 \item{0} -- no distance restraints,
1128 \item{$>0$} -- imposes harmonic restraints on selected distances; see section 5.12.
1129 In MD version, also restraints on the q variable \cite{liwo_2007} can be used.
1132 \item{WEIDIS=number} (real)
1133 the weight of the distance term; applies for REGULARIZE and THREAD, otherwise
1136 \item{USE\_SEC\_PRED} -- use secondary-structure prediction information.
1138 \item{SEED=number} (integer) (no default)
1139 Random seed (required, even if the run is not a CSA, MCM, MD or MREMD run).
1141 \item{PHI} -- only the virtual-bond dihedral angles $\gamma$ are considered as
1142 variables in energy minimization.
1144 \item{BACK} -- only the backbone virtual angles (virtual-bond angles theta and
1145 virtual-bond dihedral angles $\gamma$) are considered as variables
1146 in energy minimization.
1148 By default, all internal coordinates: $\theta$, $\gamma$, and the side-chain
1149 centroid polar angles $\alpha$ and $\beta$ are considered as variables in energy
1152 \item{RESCALE\_MODE=number} (real)
1153 Choice of the type of temperature dependence of the force field.
1155 \item{0} -- no temperature dependence
1156 \item{1} -- homographic dependence (not implemented yet with any force field)
1157 \item{2} -- hyperbolic tangent dependence \cite{liwo_2007}.
1160 \item{T\_BATH=number} (real)
1161 temperature (for MD runs and temperature-dependent force fields).
1164 The following keywords apply to MCM only:
1168 \item{MAXGEN=number} (integer) (10000)
1169 maximum number of conformations generated in a single MCM iteration
1171 \item{MAXOVERLAP=number} (integer) (1000)
1172 maximum number of conformations with ``bad'' overlaps allowed to appear in a
1173 row in a single MCM iteration.
1175 \item{DISTCHAINMAX} -- (multi-chain capacity only) maximum distance between the
1176 last residue of a given chain and the first residue of the
1177 next chain such that restraints will not be imposed; quartic
1178 restraints will be imposed for greater distances.
1180 \item{ENERGY\_DEC} -- detailed energies will be printed for each interacting pair
1181 or each virtual bond, virtual-bond angle and dihedral angle,
1182 side chain, etc. DO NOT use unless a single energy evaluation
1186 \subsubsection{Minimizer options}
1188 This data section is in data list format and is read in the READ\_MINIM subroutine.
1190 This data group is present, if MINIMIZE was specified on the control card.
1191 Otherwise, it must not appear.
1195 \item{CART} -- minimize in virtual-bond vectors instead of angles.
1197 \item{MAXMIN=number} (integer) (2000)
1198 maximum number of iterations of the SUMSL minimizer.
1200 \item{MAXFUN=number} (integer) (5000)
1201 maximum number of function evaluations in a single minimization.
1203 \item{TOLF=number} (real) (1.0e-2)
1204 Tolerance on function.
1206 \item{RTOLF=number} (real) (1.0d-4)
1207 Relative tolerance on function.
1209 \item{PRINT\_INI} -- turns on printing nondefault minimization parameters,
1210 initial variables, and gradients in the SUMSL procedures.
1212 \item{PRINT\_FINAL} -- turns on printing final variables and gradients in
1215 \item{PRINT\_STAT} -- turns on printing abbreviated minimization protocol.
1219 The SUMSL minimizer is used in UNRES/CSA. For detailed description of
1220 the control parameters see the source file cored.f and sumsld.f
1223 \subsubsection{CSA control parameters}
1224 \label{sect:input:main:CSA}
1226 This data group should be present only, if CSA was specified on the control
1227 card. It is recommended that the readers to read publications on CSA method
1228 for more complete description of the parameters. Brief description of
1233 \item{NCONF=number} (integer) (50)
1234 This corresponds to the size of the bank at the beginning of the
1235 CSA procedure. The size of the bank, nbank, is set to nconf.
1236 If necessary (at much later stages of the CSA: see icmax below),
1237 nbank increases by multiple of nconf.
1239 \item{JSTART=number} (integer) (1)
1241 \item{JEND}=number (integer) (1)
1242 This corresponds to the limit values of do loop, each of which
1243 corresponds to an separate CSA run. If jstart=1, and jstart=100,
1244 this routine will repeat 100 separate CSA runs (limited by CPU)
1245 each one with separate random number initialization.
1246 The only difference between two CSA runs (one with jstart=jend=1
1247 and another one with jstart=jend=2) would be different random
1248 number initializations if other parameters are identical.
1250 \item{NSTMAX=number} (integer) (500000)
1251 This is to set a limit the total number of local minimizations of CSA
1256 N1=number (integer) (6)\\
1257 N2=number (integer) (4)\\
1258 N3=number (integer) (0)\\
1259 N4=number (integer) (0)\\
1260 N5=number (integer) (0)\\
1261 N6=number (integer) (10)\\
1262 N7=number (integer) (0)\\
1263 N8=number (integer) (0)\\
1264 N9=number (integer) (0)\\
1265 IS1=number (integer) (1)\\
1266 IS2=number (integer) (8)\\
1268 These numbers are used to generate trial conformations for each seed.
1269 See the file newconf.f for more details.
1272 \item{n1:} the total number of trial conformations for each seed by substituting
1273 nran number of variable angles (see subroutine newconf1ab and
1274 subroutine newconf1ar),
1275 \item{n2:} the total number of trial conformations for each seed by substituting
1276 nran number of groups of variable angles (see subroutine newconf1bb and
1277 subroutine newconf1br),
1278 \item{n3:} the total number of trial conformations for each seed by substituting
1279 a window of residues which forms a $\beta$-hairpin, if there is no enough
1280 $\beta$-hairpins uses the same algorithm as n6,
1281 \item{n4:} the total number of trial conformations for each seed by shifting the
1282 turn in $\beta$-hairpin by +/- 1 or 2 residues, if there is no enough
1283 $\beta$-hairpins uses the same algorithm as n6,
1284 \item{n5:} not used,
1285 \item{n6:} the total number of trial conformations for each seed by substituting
1286 a window of residues [is1,is2] inclusive. The size of the window is
1287 determined in a random fashion (see subroutine newconf\_residue for
1288 generation of the trial conformations),
1289 \item{n7:} the total number of trial conformations for each seed by copying a
1290 remote strand pair forming nonlocal $\beta$-sheet contact,
1291 \item{n8:} the total number of trial conformations for each seed by copying an
1292 $\alpha$-helical segment,
1293 \item{n9:} the total number of trial conformations for each seed by shifting the
1294 $\alpha$-helical segment by +/- 1 or 2 residues.
1297 Typical values used for a 75-residue helical protein is
1298 (6 4 0 0 0 10 1 26) for (n1,n2,n3,n4,n5,n6,is1,is2), respectively.
1299 In this example, a total of 20 trial conformations are generated for a seed
1300 Usually is1=1 is used for all applications, and the value of is2 is set about
1301 to 1/3 of the total number of residues. n3, n4 and n7 are design to help in
1302 case of proteins with $\beta$-sheets
1304 NRAN0=number (integer) (4)\\
1305 NRAN1=number (integer) (2)\\
1306 IRR=number (integer) (1)\\
1308 These numbers are used to determine if the CSA stage is very early.
1309 One can use (4 2 1) for these values. For more details one should look into
1310 the file, newconf.f, for more details.
1312 NTOTAL=number (integer) (10000)\\
1313 CUT1=number (real) (2.0)\\
1314 CUT2=number (real) (5.0)\\
1316 Annealing schedule is set in following fashion.
1317 The value of D\_cut is reduced geometrically from 1/cut1 of D\_ave (at the
1318 beginning) to 1/cut2 of D\_ave (after ntotal number of minimizations) where
1319 D\_ave is the average distance between two conformations in the First\_bank.
1323 \item{ESTOP=number} (real) (-3000.0)
1324 The CSA procedure stops if a conformations with energy lower than estop is
1325 obtained. If the do-loop set by jstart and jend requires more than one loop,
1326 the program will go on until the do-loop is finished.
1328 \item{ICMAX=number} (integer) (3)
1329 The maximum value of cycle (see the original publications for details).
1330 If the number of cycle exceeds this value the program will add nconf
1331 more conformations to Bank and First\_bank to continue CSA procedure if
1332 the new size of the nbank is within the maximum set by nbankm (see above).
1333 If the size of nbank exceeds the maximum set by nbankm the CSA procedure
1334 for this run will stop and next CSA will begin depending on the do-loop
1335 set by jstart and jend.
1337 \item{IRESTART=number} (integer) (0)
1338 This tells you if the run is fresh start (irestart=0) or a restart (irestart=1)
1339 starting from an old results
1341 \item{NDIFF=number} (integer) (2)
1342 The number of variables use in comparison when structure is added to the
1343 bank,4 - all angels, 2 - only backbone angles $\gamma$ and $\theta$
1345 \item{NBANKTM=number} (integer) (0)
1346 The maximum number of structures saved in *.CSA.bankt as history of the run
1347 Do not use bankt on massively parallel computation as it kills scalability.
1349 \item{DELE=number} (real) (20.0)
1350 Energy cutoff for bankt.
1352 \item{DIFCUT=number} (real) (720.0)
1353 Angle cutoff for bankt.
1355 \item{IREF=number} (integer) (0)
1356 0 - normal run, 1 - local CSA which generates only structures close to the
1357 reference one read from *.CSA.native.int file.
1359 \item{RMSCUT=number} (real) (4.0)
1360 CA RMSD cut off used in local CSA
1362 \item{PNCCUT=number} (real) (0.5)
1363 Percentage of native contact used in local CSA
1365 \item{NCONF\_IN=number} (integer) (0)
1366 The number of conformation read for the first bank from the input file
1370 Optionally, the CSA parameters can be read from file INPUT.CSA.in, if
1371 this file exists. If so, they are read in free format in the following
1377 n1,n2,n3,n4,n5,n6,n7,n8,is1,is2\\
1383 ntbankm,dele,difcut\\
1384 iref,rmscut,pnccut\\
1388 \subsubsection{MCM data}
1389 \label{sect:input:main:MCM}
1391 (Data list format, subroutine MCMREAD.)
1393 This data group is present, if MCM was specified on the control card.
1394 Otherwise it must not appear.
1398 \item{MAXACC=number} (integer) (100)
1399 Maximum number of accepted conformations.
1401 \item{MAXTRIAL=number} (integer) (100)
1402 Maximum number of unsuccessful trials in a row.
1404 \item{MAXTRIAL\_ITER=number} (integer) (1000)
1405 Maximum number of unsuccessful trials in a single iteration.
1407 \item{MAXREPM=number} (integer) (200)
1408 Maximum number of repetitions of the same minimum.
1410 \item{RANFRACT=number} (real) (0.5d0)
1411 Fraction of chain-rebuild motions.
1413 \item{OVERLAP=number} (real) (1.0d3)
1414 Bad contact energy criterion.
1416 \item{NSTEPH=number} (integer) (0)
1417 Number of heating step in adaptive sampling.
1419 \item{NSTEPC=number} (integer) (0)
1420 Number of cooling step in adaptive sampling.
1422 \item{TMIN=number} (real) (298.0d0)
1423 Minimum temperature in adaptive-temperature sampling).
1425 \item{TMAX=number} (real) (298.0d0)
1426 Maximum temperature in adaptive-temperature sampling).
1428 The temperature is changed according to the formula:
1430 T = TMIN*EXP(ISTEPH*(TMAX-TMIN)/NSTEPH) when heating
1434 T = TMAX*EXP(-ISTEPC*(TMAX-TMIN)/NSTEPC) when cooling
1436 The default is to use a constant temperature.
1438 \item{NWINDOW=number} (integer) (0)
1439 Number of windows in which the variables will be perturbed; the windows are
1440 defined by the numbers of the respective amino-acid residues. If NWINDOW
1441 is nonzero, after specifying all MCM input the next lines must define the
1442 windows. Each line looks like this:
1444 winstart winend (free format)
1446 e.g. if NWINDOW=2, the input:
1451 will mean that only the variables of residues 4-10 and 15-20 will be perturbed.
1452 However, in general, all variables will be considered in minimization.
1454 \item{PRINT\_MC=number} (0)
1455 Printout level in MCM. 0 - no intermediate printing, 1 and 2 - moderate
1456 printing, 3 - extensive printing.
1458 \item{NO\_PRINT\_STAT} -- no output to INPUT\_POTENTIALxxx.stat.
1460 \item{NO\_PRINT\_INT} -- no internal-coordinate output to INPUT\_POTENTIALxxx.int.
1464 \subsubsection{MD data}
1465 \label{sect:input:main:MD}
1467 (Mixed format; subroutine READ\_MDPAR.)
1471 \item{NSTEP} (1000000) number of time steps per trajectory.
1473 \item{NTWE} (100) NTWX (1000) frequency of energy and coordinate output, respectively.
1474 The coordinates are dumped in the pdb or compressed Gromacs (cx) format,
1475 depending on the next keyword.
1476 NTWE=0 means no energy dump.
1478 \item{MDPDB} - dump coordinates in the PDB format (cx otherwise)
1480 \item{TRAJ1FILE} only the master processor outputs coordinates. This feature pertains
1481 only to REMD/MREMD jobs and overrides NTWX; coordinates are dumped at every
1484 \item{REST1FILE} only the master writes the restart file
1486 \item{DT} (real) (0.1) time step; the unit is ``molecular time unit'' (mtu); 1 mtu = 48.9 fs
1488 \item{DAMAX} (real) (1.0) maximum allowed change of acceleration during a single time step.
1489 The time step gets scaled down, if this is exceeded.
1491 \item{DVMAX} (real) (20.0) -- maximum allowed velocity (in A/mtu)
1493 \item{EDRIFTMAX} (real) (10.0) -- maximum allowed energy drift in a single MD step (10 kcal/mol)
1495 \item{REST} -- restart flag. The calculation is restarted if present.
1497 \item{LARGE} -- very detailed output. Don't use except for debugging.
1499 \item{PRINT\_COMPON} -- prints energy components.
1501 \item{RESET\_MOMENT} (1000) -- frequency of zeroing out the total angular momentum when
1502 running Berendsen mode calculations (for Langevin calculations meaningless).
1504 \item{RESET\_VEL}=number (integer) (1000) -- frequency of resetting velocities to values
1505 from Gaussian distribution.
1507 \item{RATTLE} -- use the RATTLE algorithm (constraint bonds); not yet implemented.
1509 \item{RESPA} -- use the Multiple Time Step (MTS) or Adaptive Multiple Time Step (A-MTS)
1510 algorithm \cite{rakowski_2006}. Without this flag the variable time step (VTS) \cite{khalili_2005} is run.
1512 \item{NTIME\_SPLIT=number} (integer) (1) -- initial number of time-split steps
1514 \item{MAXTIME\_SPLIT=number} (integer) (64) -- maximum number of time-split step
1516 If NTIME\_SPLIT==MAXTIME\_SPLIT, MTS is run.
1518 \item{R\_CUT=number} (real) (2.0) -- the cut-off distance in splitting the forces into short- and
1519 long-range in site-site VDW distance units.
1521 \item{LAMBDA} (real) (0.3) -- the transition length (in site-site VDW distance units) between
1522 short- and long-range forces.
1524 \item{XIRESP} -- flag to use MTS/A-MTS with Nos\'e-Hoover/Nos\'e-Poincar\'e thermostats.
1526 \item{LANG=number} (integer) (0) Langevin dynamics flag:
1529 \item{0} -- No explicit Langevin dynamics.
1530 \item{1} -- Langevin with direct integration of the equations of motion (recommended
1531 for Langevin calculations)
1532 \item{2} -- Langevin calculation with analytical pre-integration of the friction and
1533 stochastic part of the equations of motion using an algorithm adapted from TINKER.
1534 This is MUCH MORE time- and memory-consuming than 1 and requires compiling without
1535 the -DLANG0 flag and enormously increases memory requirements.
1536 \item{3} -- The stochastic integrator developed by Cicotti and coworkers.
1537 \item{4} -- for other stochastic integrators (not used at present).
1540 Note: With the enclosed code, the -DLANG0 compiler flag is included which disables
1543 \item{TBF} -- Berendsen thermostat.
1545 \item{TAU\_BATH} (1.0) (units are mtus; 1mtu=48.9 fs) -- constant of the coupling to the thermal bath
1546 used with the Berendsen thermostat.
1548 \item{NOSEPOINCARE99} -- the Nose-Poincare thermostat as of 1999 will be used.
1550 \item{NOSEPOINCARE01} -- the Nose-Poincare thermostat as of 2001 will be used.
1552 \item{NOSEHOOVER96} -- the Nose-Hoover thermostat will be used.
1554 \item{Q\_NP=number} (real) (0.1) -- the value of the mass of the fictitious particle in the calculations
1555 with the Nose-Poincare thermostat.
1557 \item{T\_BATH} (300.0) (in K) -- temperature of canonical simulation or temperature to generate
1560 \item{ETAWAT} (0.8904) -- viscosity of water (in centipoises).
1562 \item{RWAT} (1.4) -- radius of water molecule (in A)
1564 \item{SCAL\_FRIC=number} (real) (0.02) -- scaling factor of the friction coefficients.
1566 \item{SURFAREA} -- scale friction acting on atoms by atoms' solvent accessible area.
1568 \item{RESET\_FRICMAT=number} (integer) (1000) -- recalculate friction matrix every RESET\_FRICMAT MD steps.
1570 \item{USAMPL} -- restraints on q (see reference 5 for meaning) will be imposed (see section .
1571 In this case, the next records specify the restraints; these records are
1572 placed before the list of temperatures or numbers of trajectories.
1574 \item{EQ\_TIME=number} (real) (1.0e4) -- time (in mtus; 1 mtu=48.9 fs) after which restraints
1575 on q will start to be in force.
1579 If USAMPL has been specified, the following information must be supplied after the
1580 main MD input data record (subroutine READ\_FRAGMENTS):
1582 Line 1: nset, npair, nfrag\_back (number of sets of restraints, number of restrained
1583 fragments, number of restrained pairs, number of restrained backbone fragments
1584 (in terms of $\theta$ and $\gamma$ angles)
1586 For each set of restraints (1, 2,..., nset):
1590 \item{mset(iset)} -- how many times the set is multiplied.
1592 \item{wfrag(i,iset), ifrag(1,i,iset), ifrag2(2,i,iset),qfrag(i,iset)} --
1593 weight of the restraint, first and last residue of the fragment, target q value.
1594 This information is repeated through nfrag.
1596 \item{wpair(i,iset), ipair(1,i,iset), ipair(2,i,iset),qinpair(i,iset)} --
1597 weight of the restraint, first and second fragment of the pair (according to fragment
1598 list), target q value. This information is repeated through npair
1600 \item{wfrag\_back(1,i,iset), wfrag\_back(2,i,iset), wfrag\_back(3,i,iset),
1601 ifrag\_back(1,i,iset),ifrag\_back(2,i,iset)} --
1602 weight of the restraints on $\theta$ angles, weight on the restraints on $\gamma$ angles,
1603 weight of the restraints on side-chain rotamers, first residue of the fragment,
1604 last residue of the fragment. This information is repeated through nfrag\_back.
1608 \subsubsection{REMD/MREMD data}
1609 label{sect:input:main:MREMD}
1611 (Miced format; subroutine READ\_REMDPAR.)
1615 \item{NREP} (3) -- number of replicas in a REMD/MREMD run.
1617 \item{NSTEX} (1000) -- number of steps after which exchange is performed in REMD/MREMD
1620 The temperatures in replicas can be specified through
1622 \item{RETMIN} (10.0) -- minimum temperature in a REMD/MREMD run,
1624 \item{RETMAX} (1000.0) -- maximum temperature in a REMD/MREMD run.
1628 Then the range from retmin to retmax is divided into equal segments and
1629 temperature of the replicas assigned accordingly,
1635 \item{TLIST} means that the NREP temperature of the replicas will be input in the
1638 \item{MLIST} numbers of trajectories per each of the NREP temperatures will be
1639 specified in the record after the list of temperatures; this specifies
1644 Important! The number of processors must be exactly equal to the number of
1645 trajectories, i.e., NREP for a REMD run or $\sum_i mlist(i)$ for a MREMD run.
1649 \item{SYNC} -- all trajectories will be synchronized every NSTEX time steps
1650 (by default, they are not synchronized).
1652 \item{TRAJ1FILE} -- only the master processor outputs coordinates. This feature pertains
1653 only to REMD/MREMD jobs and overrides NTWX; coordinates are dumped at every
1656 \item{REST1FILE} -- only the master writes the restart file.
1658 \item{HREMD} -- Hamiltonian replica exchange flag; not only temperatures but also
1659 sets energy-term weights are exchanged between conformations.
1661 \item{TONLY} -- run a ``fake'' HREMD with many sets of energy-term weights in a
1662 single run but only temperature exchange.
1666 \subsubsection{Energy-term weights}
1667 \label{sect:input:main:weights}
1669 (Data list format; subroutine MOLREAD.)
1673 \item{WLONG=number} (real) (1.0d0) --
1674 common weight of the U(SC-SC) (side-chain side-chain interaction)
1675 and U(SC,p) (side-chain peptide-group) term.
1677 \item{WSCC=number} (real) (WLONG) --
1678 weight of the U(SC-SC) term.
1680 \item{WSCP=number} (real) (WLONG)
1681 weight of the U(SC-p) term.
1683 \item{WELEC=number} (real) (1.0d0)
1684 weight of the U(p-p) (peptide-group peptide-group interaction) term.
1686 \item{WEL\_LOC=number} (real) (1.0d0)
1687 weight of the $U_{el;loc}^3$ (local-electrostatic cooperativity, third-order) term.
1689 \item{WCORRH=number} (real) (1.0d0)
1690 weight of the U(corr) (cooperativity of hydrogen-bonding interactions, fourth-order) term.
1692 \item{WCORR5=number} (real) (0.0d0) --
1693 weight of the $U_{el;loc}^5$ (local-electrostatic cooperativity, 5th order
1696 \item{WCORR6=number} (real) (0.0d0) --
1697 weight of the $U_{el;loc}^6$ (local-electrostatic cooperativity, 6th order
1700 \item{WTURN3=number} (real) (1.0d0) --
1701 weight of the $U_{turn}^3$ (local-electrostatic cooperativity within 3 residue
1702 segment, 3rd order contribution).
1704 \item{WTURN4=number} (real) (1.0d0) --
1705 weight of the $U_{turn}^4$ (local-electrostatic cooperativity within 4 residue
1706 segment, 4rd order contributions).
1708 \item{WTURN6=number} (real) (1.0d0) --
1709 weight of the $U_{turn}^6$ (local-electrostatic cooperativity within 6 residue
1710 segment, 6rd order contributions).
1712 \item{WTOR=number} (real) (1.0d0) --
1713 weight of the torsional term, $U_{tor}$.
1715 \item{WANG=number} (real) (1.0d0) --
1716 weight of the virtual-bond angle bending term, $U_b$.
1718 \item{WSCLOC=number} (real) (1.0d0) --
1719 weight of the side-chain rotamer term, $U_{SC}$.
1721 \item{WSTRAIN=number} (real) (1.0d0) --
1722 scaling factor of the distance-constrain or disulfide-bond strain energy term.
1724 \item{SCALSCP=number} (real) (1.0d0) --
1725 scaling factor of $U_{SCp}$; this is an alternative to specifying WSCP; in
1726 this case WSCP will be calculated as WLONG*SCALSCP.
1728 \item{SCAL14=number} (real) (1.0d0) --
1729 scaling factor of the 1,4 SC-p interactions.
1731 \item{CUTOFF} (7.0) -- cut-off on backbone-electrostatic interactions to compute 4-
1732 and higher-order correlations.
1734 \item{DELT\_CORR} (0.5) - thickness of the distance range in which the energy is
1739 The defaults are NOT the recommended values. No ``working'' default values
1740 have been set, because the force field is still under development. The values
1741 corresponding to the force fields listed in section 4 are as follows:
1745 WELEC=1.5 WSTRAIN=1.0 WTOR=0.08617 WANG=0.10384 WSCLOC=0.10384 WCORR=1.5 &
1746 WTURN3=0 WTURN4=0 WTURN6=0 WEL_LOC=0 WCORR5=0 WCORR6=0 SCAL14=0.40 SCALSCP=1.0 &
1747 CUTOFF=7.00000 WSCCOR=0.0
1752 WSC=1.00000 WSCP=0.72364 WELEC=1.10890 WANG=0.68702 WSCLOC=1.79888 &
1753 WTOR=0.30562 WCORRH=1.09616 WCORR5=0.17452 WCORR6=0.36878 WEL_LOC=0.19508 &
1754 WTURN3=0.00000 WTURN4=0.55588 WTURN6=0.11539 CUTOFF=7.00000 WCORR4=0.0000 &
1755 WTORD=0.0 WSCCOR=0.0
1760 WSC=1.00000 WSCP=1.10684 WELEC=0.70000 WANG=0.80775 WSCLOC=1.91939 &
1761 WTOR=3.36070 WCORRH=2.50000 WCORR5=0.99949 WCORR6=0.46247 WEL_LOC=2.50000 &
1762 WTURN3=1.80121 WTURN4=4.35377 WTURN6=0.10000 CUTOFF=7.00000 WCORR4=0.00000 &
1768 WSC=1.00000 WSCP=1.43178 WELEC=0.41501 WANG=0.37790 WSCLOC=0.12880 &
1769 WTOR=1.98784 WCORRH=2.50526 WCORR5=0.23873 WCORR6=0.76327 WEL_LOC=2.97687 &
1770 WTURN3=0.09261 WTURN4=0.79171 WTURN6=0.01074 CUTOFF=7.00000 WCORR4=0.00000 &
1776 WSC=1.00000 WSCP=1.54864 WELEC=0.20016 WANG=1.00572 WSCLOC=0.06764 &
1777 WTOR=1.70537 WTORD=1.24442 WCORRH=0.91583 WCORR5=0.00607 WCORR6=0.02316 &
1778 WEL_LOC=1.51083 WTURN3=2.00764 WTURN4=0.05345 WTURN6=0.05282 WSCCOR=0.0 &
1779 CUTOFF=7.00000 WCORR4=0.00000 WSCCOR=0.0
1784 WSC=1.00000 WSCP=2.85111 WELEC=0.36281 WANG=3.95152 WSCLOC=0.15244 &
1785 WTOR=3.00008 WTORD=2.89863 WCORRH=1.91423 WCORR5=0.00000 WCORR6=0.00000 &
1786 WEL_LOC=1.72128 WTURN3=2.99827 WTURN4=0.59174 WTURN6=0.00000 &
1787 CUTOFF=7.00000 WCORR4=0.00000 WSCCOR=0.0
1792 WSC=1.00000 WSCP=2.73684 WELEC=0.06833 WANG=4.15526 WSCLOC=0.16761 &
1793 WTOR=2.99546 WTORD=2.89720 WCORRH=1.98989 WCORR5=0.00000 WCORR6=0.00000 &
1794 WEL_LOC=1.60072 WTURN3=2.36351 WTURN4=1.34051 WTURN6=0.00000 &
1795 CUTOFF=7.00000 WCORR4=0.00000 WSCCOR=0.0
1800 WLONG=1.35279 WSCP=1.59304 WELEC=0.71534 WBOND=1.00000 WANG=1.13873 &
1801 WSCLOC=0.16258 WTOR=1.98599 WTORD=1.57069 WCORRH=0.42887 WCORR5=0.00000 &
1802 WCORR6=0.00000 WEL_LOC=0.16036 WTURN3=1.68722 WTURN4=0.66230 WTURN6=0.00000 &
1803 WVDWPP=0.11371 WHPB=1.00000 &
1804 CUTOFF=7.00000 WCORR4=0.00000
1809 WLONG=1.70905 WSCP=2.18310 WELEC=1.06684 WBOND=1.00000 WANG=1.17536 &
1810 WSCLOC=0.22070 WTOR=2.65798 WTORD=2.00646 WCORRH=0.23541 WCORR5=0.00000 &
1811 WCORR6=0.00000 WEL_LOC=0.42789 WTURN3=1.68126 WTURN4=0.75080 WTURN6=0.00000 &
1812 WVDWPP=0.27044 WHPB=1.00000 WSCP14=0.00000 &
1813 CUTOFF=7.00000 WCORR4=0.00000
1818 WLONG=1.00000 WSCP=1.23315 WELEC=0.84476 WBOND=1.00000 WANG=0.62954 &
1819 WSCLOC=0.10554 WTOR=1.84316 WTORD=1.26571 WCORRH=0.19212 WCORR5=0.00000 &
1820 WCORR6=0.00000 WEL_LOC=0.37357 WTURN3=1.40323 WTURN4=0.64673 WTURN6=0.00000 &
1821 WVDWPP=0.23173 WHPB=1.00000 WSCCOR=0.0 &
1822 CUTOFF=7.00000 WCORR4=0.00000
1825 \subsubsection{Input and/or reference PDB file name}
1826 \label{sect:input:main:PDB}
1828 (Text format; subroutine MOLREAD.)
1830 If PDBSTART or PDBREF was specified in the control card, this line contains
1831 the PDB file name. Trailing slashes to specify the full path are permitted.
1832 The file name can contain up to 64 characters.
1834 \subsubsection{Amino-acid sequence}
1835 \label{sect:input:main:sequence}
1839 This data appears, if PDBSTART was not specified, otherwise must not be present
1840 because the sequence would be taken from the PDB file. The first line contains
1841 the number of amino-acid residues, including the end groups (free format),
1842 the next lines contain the sequence in 20(1X,A3) format for the three-letter
1843 or 80A1 format for the one-letter code. There are two types of end-groups:
1844 Gly (three-letter code) or G (one-letter code), if an end group contains a full
1845 peptide bond (e.g., the acetyl N-terminal group or the carboxyamide C-terminal
1846 group) and D (in the three-letter code) or X (in the one-letter code), if the
1847 end group does not contain a peptide group (e.g., the NH2 N-terminal end group
1848 or the COOH C-terminal end group). (Note the Gly or G also denotes the regular
1849 glycine residue, if found in the middle of a chain).
1850 In the second case the end group is considered as a ``dummy'' group and serves
1851 only to define the first (last) virtual-bond dihedral angle $\gamma$ for the
1852 first (last) full amino-acid residue.
1854 Consider, for example, the Ac-Ala(19)-NHMe polypeptide. The three-letter code
1855 input will look like this:
1859 Gly Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala
1863 And the one-letter code input will be:
1867 GAAAAAAAAAAAAAAAAAAAG
1870 If the sequence is changed to NH3(+)-Ala(19)-COO(-), the inputs will look
1875 D Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala
1883 XAAAAAAAAAAAAAAAAAAAX
1886 The sequence input is case-insensitive, because the present version of UNRES
1887 considers each amino-acid residue as an L-residue (there are no torsional
1888 parameters for the combinations of the D- and L-residues yet). Furthermore,
1889 each peptide group is considered as a trans group.
1891 If the version of UNRES has multi-chain capacity, placing a dummy residue
1892 inside the sequence indicates start of a new chain. For example, a system
1893 composed of two Ala(10) chains can be specified as follows (3-letter code):
1897 D Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala D Ala Ala Ala Ala Ala Ala Ala Ala
1905 XAAAAAAAAAAXAAAAAAAAAAX
1908 \subsubsection{Disulfide-bridge information}
1909 \label{sect:input:main:disulphide}
1911 (Free format; subroutine READ\_BRIDGE.)
1919 \item{NS} -- the number of half-cystines (required even if no half-cystines are present).
1921 \item{ISS(i)} -- the position of ith half-cystine in the sequence (starting from the
1922 N-terminal end group)
1926 Next line(s) (present only, if $ns>0$ and must not appear otherwise):
1928 NSS,(IHPB(i),JHPB(i),i=1,NSS)
1932 \item{NSS} -- the number of disulfide bridges; must not be greater than NS/2.
1934 \item{IHPB(i),JHPB(i)} -- the cystine residue forming the ith bridge.
1938 The program will check, whether the residues specified in the ISS list
1939 are cystines and terminate with error, if any of them is not. The program
1940 also checks, if the numbers from the IHPB and the JHPB lists have appeared
1943 \subsubsection{Dihedral-angle restraint data}
1944 \label{sect:input:main:dihedral-restraints}
1946 (Free format; subroutine MOLREAD.)
1948 This set of data specifies the harmonic constraints (if any) imposed on selected
1949 virtual-bond dihedral angles $\gamma$.
1955 \item{NDIH\_CONSTR} -- the number of restrained $\gamma$ angles (required even if no
1956 restrains are applied).
1960 2nd line (present only, if NDIH\_CONSTR$>$0; must not appear otherwise):
1961 FTORS - the force constant expressed in kcal/(mol*rad**2)
1963 next NDIH\_CONSTR lines (present only, if NDIH\_CONSTR$>$0):
1965 IDIH\_CONSTR(i),PHI0(i),DRANGE(i)
1969 \item{IDIH\_CONSTR(i)} -- the number of ith restrained $\gamma$ angle. The angles are
1970 numbered after the LAST $\alpha$-carbons. Thus, the first ``real'' angle has number
1971 4 and it corresponds to the rotation about the CA(2)-CA(3) virtual-bond axis
1972 and the last angle has the number NRES and corresponds to the rotation about
1973 the CA(NRES-2)-CA(NRES-1) virtual-bond axis.
1975 \item{PHI0(i)} -- the ``center'' of the restraint (expressed in degrees).
1977 \item{DRANGE(i)} -- the ``flat well'' range of the restraint (in degrees).
1981 The restraint energy for the ith restrained angle is expressed as:
1984 E_{dih} = \begin{cases}
1985 \rm FTORS\times(\gamma_{IDIH\_CONSTR(i)}-PHI0(i)+DRANGE(i))^2&\mbox{if}\ \ \rm \gamma_{IDIH\_CONSTR(i)}\\
1986 &<PHI0(i)+DRANGE(i)\\
1988 0 &\rm if\ \ PHI0(i)-DRANGE(i) \\
1989 &\le \gamma_{IDIH\_CONSTR(i)} \\
1990 &\le PHI0(i)+DRANGE(i)\\
1992 \rm FTORS\times(\gamma_{IDIH\_CONSTR(i)}-PHI0(i)+DRANGE(i))^2&\mbox{if}\ \ \rm \gamma_{IDIH\_CONSTR(i)}\\
1997 Applying dihedral-angle constraints also implies that for ith constrained
1998 $\gamma$ angle the sampling be carried out from the
1999 [PHI0(i)-DRANGE(i)..PHI0(i)+DRANGE(i)] interval and not from the $[-\pi..\pi]$
2000 interval, if random conformations are generated. If only this and not
2001 restrained minimization is required, just set FTORS to 0.
2003 \subsubsection{Distance restraints}
2004 \label{sect:input:main:disance-restraints}
2006 (Mixed format; subroutine READ\_DIST\_CONSTR.)
2008 Restraints are imposed on C$^\alpha\cdots$C$^\alpha$ SC$\cdots$SC distances (C$^\beta\cdots$C$^\beta$.
2012 \item{NDIST=number} (integer) (0) -- number of restraints on specific distances.
2014 \item{NFRAG=number} (integer) (0) -- number of distance-restrained protein segments.
2016 \item{NPAIR=number} (integer) (0) -- number of distance-restrained pairs of segments.
2017 Specifying NPAIR requires specification of segments.
2019 \item{IFRAG=start(1),end(1),start(2),end(2)...start(NFRAG),end(NFRAG)} (integers) --
2020 First and last residues of the distance restrained segments.
2022 \item{WFRAG=w(1),w(2),...,w(NFRAG) (reals)} -- force constants or bases for force
2023 constant calculation corresponding to fragment restraints.
2025 \item{IPAIR=start(1),end(1),start(2),end(2),...,start(NPAIR),end(NPAIR)} (integers)
2026 -- numbers of segments (consecutive numbers of start or end pairs in IFRAG
2027 specification), the distances between which will be restrained.
2029 \item{WPAIR=w(1),w(2),...,w(NFRAG)} (reals) -- force constants or bases for force
2030 constant calculation corresponding to pair restraints.
2032 \item{DIST\_CUT=number} (real) (5.0) -- the cut-off distance in angstroms for force-
2033 constant calculations.
2035 The force constants within fragments/between pairs of fragments are calculated
2036 depending on the value of DIST\_CONSTR described in section 5.1:
2040 \item{1} -- all force constants are equal to the respective entries of WFRAG/WPAIR
2042 \item{2} -- the force constants are equal to the respective entries of WFRAG/WPAIR
2043 when the distance between the C$^\alpha$ atoms in the reference structure
2044 $\le$D\_CUT, 0 otherwise.
2046 \item{3} -- the force constants are calculated from the formula:
2050 \item{$k(C^\alpha_j,C^\alpha_k)=W\times\exp{-[d(C^\alpha_j,C^\alpha_k)/DIST\_CUT)]^2/2}$}
2052 where $k(C^\alpha_j,C^\alpha_k)$ is the force constant between the respective C$^\alpha$ atoms,
2053 $d(C^\alpha_j,C^\alpha_k)$ is the distance between these C$^\alpha$ atoms in the reference
2054 structure, and W is the basis for force-constant calculation (see above).
2058 The above restraints are harmonic resatraints of the form
2061 E_{dis} = \sum_i k_i \left(d_i - d_i^{ref}\right)^2
2064 where $d_i$ is the distance in the calculated structure and $d_i^{ref}$ is the respective
2065 distance in the reference (PDB) structure. The reference structure is required.
2067 If NDIST$>$0, the restraints on specific distance are input explicitly (no reference structure is requires).
2068 The restraints are quartic restraints of a similar form as that in section
2069 \ref{sect:input:main:dihedral-restraints} but with angles replaced with distances.
2071 ihpb(i), jhpb(i), dhpb(i), dhpb1(i), ibecarb(i), forcon(i), i=1,NDIST
2075 \item{ihpb(i)} and jhpb(i) are the numbers of the residues the distance
2076 between the C$^\alpha$ atoms of which will be distance restrained,
2078 \item{dhpb(i)} and dhpb1(i) are the lower and upper distance-restraint,
2080 \item{ibecarc(i)} is the restraint-type flag;
2081 ibecarb(i)==0 indicates that the restraints are imposed on the
2082 C$^\alpha\cdots$C$^\alpha$ distances; otherwise restraints on the
2083 SC$\cdots$SC distances are imposed,
2086 is the respective force constant.
2090 \subsubsection{Internal coordinates of the reference structure}
2091 \label{sect:input:main:internalref}
2093 (Free format; subroutine READ\_ANGLES.)
2095 This part of the data is present, if REFSTR, but not PDBREF was specified,
2096 otherwise must not appear. It contains the following group of variables:
2099 \item{(THETA(i),i=3,NRES)} -- the virtual-bond valence angles THETA.
2100 \item{(PHI(i),i=4,NRES)} -- the virtual-bond dihedral angles GAMMA.
2101 \item{(ALPH(i),i=2,NRES-1)} -- the ALPHA polar angles of consecutive side chains.
2102 \item{(OMEG(i),i=2,NRES-1)} -- the BETA polar angles of consecutive side chains.
2105 ALPHA(i) and OMEG(i) correspond to the side chain attached to CA(i). THETA(i)
2106 is the CA(i-2)-CA(i-1)-CA(i) virtual-bond angle and PHI(i) is the
2107 CA(i-3)-CA(i-2)-CA(i-1)-CA(i) virtual-bond dihedral angle $\gamma$.
2109 \subsubsection{Internal coordinates of the initial conformation}
2110 \label{sect:input:main:intcoord}
2112 (Free format; subroutine READ\_ANGLES.)
2114 This part of the data is present, if RAND\_CONF, MULTCONF, THREAD, or PDBSTART
2115 were not specified, otherwise must not appear. This input is as in section \ref{sect:support}.
2117 \paragraph{File name with internal coordinates of the conformations to be processed}
2118 \label{sect:input:main:intcord:files}
2120 (Text format; subroutine MOLREAD.)
2122 This data is present only, if MULTCONF was specified. It contains the name of
2123 the file with the internal coordinates. Up to 64 characters are allowed.
2124 The structure of the file is that of the *.int file produced by UNRES/CSA.
2125 See section ``The structure of the INT files'' for details.
2127 \subsubsection{Control data for energy map construction}
2128 \label{sect:input:main:map}
2130 (Data list format; subroutine MAP\_READ.)
2132 These data lists appear, if NMAP=n was specified, where n is the number of
2133 variables that will be grid-searched. One list is per one variable or a
2134 group of variables set equal (see below):
2137 \item{PHI} -- the variable is a virtual-bond dihedral angle $\gamma$.
2138 \item{THE} -- the variable is a virtual-bond angle $\theta$.
2139 \item{ALP} -- the variable is a side-chain polar angle $\alpha$.
2140 \item{OME} -- the variable is a side-chain polar angle $\beta$.
2144 \item{RES1=number} (integer)
2145 \item{RES2=number} (integer)
2148 The range of residues for which the values will be set; all these variables
2149 will be set at the same value. It is required that RES2$>$RES1.
2152 \item{FROM=angle} (real)
2153 \item{TO=angle} (real)
2156 Lower and upper limit of scanning in grid search (in degrees)
2159 \item{NSTEP=number} (integer)
2162 Number of steps in scanning along this variable/group of variables.
2164 \subsection{Input coordinate files}
2165 \label{sect:input:coordfiles}
2167 (Text format; subroutine MOLREAD.)
2169 At present, geometry can be input either from the external files in the PDB
2170 format (with the PDBSTART option) or multiple conformations can be read
2171 as virtual-bond-valence and virtual-bond dihedral angles when the MULTCONF
2172 option is used (the latter, however, implies using standard virtual-bond
2173 lengths as initial values). The structure of internal-coordinate files
2174 is the same as that of output internal-coordinate files described in section
2177 \subsection{Other input files}
2178 \label{sect:input:otherfiles}
2180 CSA parameters can optionally be read in free format from file INPUT.CSA.in
2181 (see section 8.1.4). When a CSA run is restarted, the CSA-specific output files
2182 also serve as input files. INPUT is the prefix of input and output files
2183 as explained in section \ref{sect:command}.
2185 Restart files for MD and REMD simulations. They are read when the keyword
2186 RESTART appears on the MD/REMD data group (section \ref{sect:input:main:MD}).
2190 \section{OUTPUT FILES}
2193 UNRES ``main'' output files (INPUT.out\_\$\{POT\}[processor]) are log files from
2194 a run. They contain the information of the molecule, force field, calculation
2195 type, control parameters, etc.; however, not the structures produced during
2196 the run or their energies except single-point energy evaluation and
2197 minimization-related runs. The structural information is included in
2198 coordinate files (*.int, *.x, *.pdb, *.mol2, *.cx) and statistics files (*.stat),
2199 respectively; these files are further processed by other programs (WHAM,
2200 CLUSTER) or can be viewed by molecular viewers (pdb or mol2 files).
2202 \subsection{Coordinate files}
2203 \label{sect:output:coord}
2205 \subsubsection{The internal coordinate (INT) file}
2206 \label{sect:output:coord:int}
2208 This file contains the internal coordinates of the conformations produced
2209 by UNRES in non-MD runs. The virtual-bond lengths are assumed constant so
2210 only the angular variables are provided.
2212 IT,ENER,NSS,(IHPB(I),JHPB(I),I=1,NSS)\\
2213 (I5,F12.5,I2,9(1X,2I3))
2216 \item{IT} -- the number of the conformation.
2217 \item{ENER} -- total energy.
2218 \item{NSS} -- the number of disulfide bridges.
2219 \item{(IHPB(I),JHPB(I),I=1,NSS)} -- the positions of the pairs of half-cystines .
2220 forming the bridges. If NSS$>9$9, the remaining pairs are written in the
2221 following lines in the (3X,11(1X,2I3)) format.
2224 (THETA(I),I=3,NRES)\\
2227 The virtual-bond angles THETA (in degrees)
2232 The virtual-bond dihedral angles GAMMA (in degrees)
2234 (ALPH(I),I=2,NRES-1)\\
2235 (OMEG(I),I=2,NRES-1)\\
2238 The polar angles ALPHA and BETA of the side-chain centers (in degrees).
2240 \subsubsection{The plain Cartesian coordinate (X) files}
2241 \label{sect:output:coord:cart}
2243 (Subroutine CARTOUT.)
2245 This file contains the Cartesian coordinates of the $\alpha$-carbon and
2246 side-chain-center coordinates. All conformations from an MD/MREMD
2247 trajectory are collated to a single file. The structure of each
2248 conformation's record is as follows:
2250 1st line: time, potE, uconst, t\_bath,nss, (ihpb(j), jhpb(j), j=1,nss),
2251 nrestr, (qfrag(i), i=1,nfrag), (qpair(i), i=1,npair),
2252 (utheta(i), ugamma(i), uscdiff(i), i=1,nfrag\_back)
2255 \item{time:} MD time (in ``molecular time units'' 1 mtu = 4.89 fs),
2256 \item{potE:} potential energy,
2257 \item{uconst:} restraint energy corresponding to restraints on Q and backbone geometry,
2258 (see section \ref{sect:input:main:MD}),
2259 \item{t\_bath:} thermostat temperature,
2260 \item{nss:} number of disulfide bonds,
2261 \item{ihpb(j), jhpb(j):} the numbers of linked cystines for jth disulfide bond,
2262 \item{nrestr:} number of restraints on q and local geometry,
2263 \item{qfrag(i):} q value for ith fragment,
2264 \item{qpair(i):} q value for ith pair,
2265 \item{utheta(i):} sum of squares of the differences between the theta angles
2266 of the current conformation from those of the experimental conformation,
2267 \item{ugamma(i):} sum of squares of the differences beaten the gamma angles
2268 of the current conformation from those of the experimental conformation,
2269 \item{uscdiff(i):} sum of squares of the differences between the Cartesian difference
2270 of the unit vector of the C$^\alpha$-SC axis of the current conformation from
2271 those of the experimental conformation.
2274 Next lines: Cartesian coordinates of the C$^\alpha$ atoms (including dummy atoms)
2275 (sequentially, 10 coordinates per line)
2276 Next lines: Cartesian coordinates of the SC atoms (including glycines and
2277 dummy atoms) (sequentially, 10 coordinates per line)
2279 \subsubsection{The compressed Cartesian coordinate (CX) files}
2280 \label{sect:output:coord:cx}
2282 These files are compressed binary files (extension cx). For each conformation,
2283 the items are written in the same order as specified in section \ref{sect:output:coord:cx}. For
2284 MREMD runs, if TRAJ1FILE is specified on MREMD record (see section \ref{sect:input:main:MD}),
2285 snapshots from all trajectories are written every time the coordinates
2286 are dumped. Thus, the file contains snapshot 1 from trajectory 1, ...,
2287 snapshot 1 from trajectory M, snapshot 2 from trajectory 1, ..., etc.
2289 The compressed cx files can be converted to pdb file by using the xdrf2pdb
2290 auxiliary program (single trajectory files) or xdrf2pdb-m program (multiple
2291 trajectory files from MREMD runs generated by using the TRAJ1FILE option).
2292 The multiple-trajectory cx files are also input files for the auxiliary
2295 \subsubsection{The Brookhaven Protein Data Bank format (PDB) files}
2296 \label{sect:output:coord:PDB}
2298 (Subroutine PDBOUT.)
2301 These files are written in PDB standard (see. e.g.,
2302 \href{ftp://ftp.wwpdb.org/pub/pdb/doc/format_descriptions/Format_v33_Letter.pdf}{\textcolor{blue}{ftp://ftp.wwpdb.org/pub/pdb\-/doc/\-format\_descriptions}}). %\-/Format\_v33\_Letter.pdf}.
2303 The REMARK, ATOM, SSBOND, HELIX, SHEET, CONECT, TER, and ENDMDL are used.
2304 The C$^\alpha$ (marked CA) and SC (marked CB) coordinates are output. The CONECT
2305 records specify the C$^\alpha\cdots$C$^\alpha$ and C$^\alpha\cdots$SC virtual bonds. Secondary
2306 structure is detected based on peptide-group contacts, as specified in
2307 ref 12. Dummy residues are omitted from the output. If the program has
2308 multiple-chain function, the presence of a dummy residue in a sequence
2309 starts a new chain, which is assigned the next alphabet letter as ID, and
2310 residue numbering is started over.
2312 \subsubsection{The SYBYLL (MOL2) files}
2313 \label{sect:output:coord:subyll}
2315 See the description of mol2 format (e.g.,
2316 \href{http://tripos.com/data/support/mol2.pdf}{http://tripos.com/data/support/mol2.pdf}.
2317 Similar remarks apply as for
2318 the PDB format (section \ref{sect:output:coord:PDB}).
2320 \subsection{The summary (STAT) file}
2322 \subsubsection{Non-MD runs}
2324 This file contains a short summary of the quantities characterizing the
2325 conformations produced by UNRES/CSA. It is created for MULTCONF and MCM.
2327 NOUT,EVDW,EVDW2,EVDW1+EES,ECORR,EBE,ESCLOC,ETORS,ETOT,RMS,FRAC\\
2331 \item{NOUT} -- the number of the conformations
2332 \item{EVDW,EVDW2,EVDW1+EES,ECORR,EBE,ESCLOC,ETORS} -- energy components
2333 \item{ETOT} -- total energy
2334 \item{RMS} -- RMS deviation from the reference structure (if REFSTR was specified)
2335 \item{FRAC} -- fraction of side chain - side chain contacts of the reference
2336 structure present in this conformation (if REFSTR was specified)
2339 \subsubsection{MD and MREMD runs}
2340 \label{sect:output:coord:MD}
2342 Each line of the stat file generated by MD/MREMD runs contains the following
2346 \item{step} -- the number of the MD step
2347 \item{time} -- time [unit is MTU (molecular time unit) equal to 48.9 fs]
2348 \item{Ekin} -- kinetic energy [kcal/mol]
2349 \item{Epot} -- potential energy [kcal/mol]
2350 \item{Etot} -- total energy (Ekin+Epot)
2351 \item{H-H0} -- the difference between the cureent and initial extended Hamiltionian
2352 in Nose-Hoover or Nose-Poincare runs; not present for other thermostats.
2353 \item{RMSD} -- root mean square deviation from the reference structure (only in
2354 REFSTR has been specified)
2355 item{damax} -- maximum change of acceleration between two MD steps
2356 \item{fracn} -- fraction of native side-chain concacts (very crude, based on
2357 SC-SC distance only)
2358 \item{fracnn} -- fraction of non-native side-chain contacts
2359 \item{co} -- contact order
2360 \item{temp} -- actual temperature [K]
2361 \item{T0} -- initial (microcanonical runs) or thermostat (other run types)
2363 \item{Rgyr} -- radius of gyration based on C$^\alpha$ coordinates [A]
2364 \item{proc} -- in MREMD runs the number of the processor (the number of the
2365 trajectory less 1); not present for other runs.
2368 For an USAMPL run, the following items follow the above list:
2371 \item{iset} -- the number of the restraint set
2372 \item{uconst} -- restraint energy pertaining to q-values
2373 \item{uconst\_back} -- restraint energy pertaining to virtual-backbone restraints
2374 \item{(qfrag(i),i=1,nfrag)} -- q values of the specified fragments
2375 \item{(qpair(ii2),ii2=1,npair)} -- q values of the specified pairs of fragments
2376 \item{(utheta(i),ugamma(i),uscdiff(i),i=1,nfrag\_back)} -- virtual-backbone and
2377 side-chain-rotamer restraint energies of the fragments specified
2380 If PRINT\_COMPON has been specified, the energy components are printed
2381 after the items described above.
2383 \subsection{CSA-specific output files}
2384 \label{sect:output:coord:CSA}
2386 There are several output files from the CSA routine:
2387 INPUT.CSA.seed, INPUT.CSA.history, INPUT.CSA.bank, INPUT.CSA.bank1,
2388 INPUT.CSA.rbank INPUT.CSA.alpha, INPUT.CSA.alpha1.
2390 The most informative outfile is INPUT.CSA.history. This file first write down
2391 the parameters in INPUT.CSA.csa file. Later it shows the energies of random
2392 minimized conformations in its generation. After sorting the First\_bank
2393 in energy (ascending order), the energies of the First\_bank is re-written here.
2394 After this the output looks like:
2397 1 0 100 6048.2 1 100-224.124-114.346 202607 100 100
2398 1 0 700 5882.6 2 29-235.019-203.556 1130308 100 100
2399 1 0 1300 5721.5 2 18-242.245-212.138 2028008 100 100
2400 1 0 1900 5564.8 13 54-245.185-218.087 2897988 98 100
2401 1 0 2500 5412.4 13 61-246.214-222.068 3706478 97 100
2402 1 0 3100 5264.2 13 89-248.715-224.939 4514196 96 100
2405 Each line is written between each iteration (just after selection
2406 of seed conformations) containing following data:
2407 jlee,icycle,nstep,cutdif,ibmin,ibmax,ebmin,ebmax,nft,iuse,nbank
2408 ibmin and ibmax lists the index of bank conformations corresponding to the
2409 lowest and highest energies with ebmin and ebmax.
2410 nft is the total number of function evaluations so far.
2411 iuse is the total number of conformations which have not been used as seeds
2412 prior to calling subroutine select\_is which select seeds.
2414 Therefore, in the example shown above, one notes that so far 3100
2415 minimizations has been performed corresponding to the total of 4514196
2416 function evaluations. The lowest and highest energy in the Bank is
2417 -248.715 (\#13) and -224.939 (\#89), respectively. The number of conformations
2418 already used as seeds (not including those selected as seeds in this iteration)
2419 so far is 4 (100-96).
2421 The files INPUT.CSA.bank and INPUT.CSA.rbank contains data of Bank and
2422 First\_bank. For more information on these look subroutines write\_bank
2423 and write\_rbank. The file INPUT.CSA.bank is overwritten between each
2424 iteration whereas Bank is accumulated in INPUT.CSA.bank1 (not for every
2425 iteration but as specified in the subroutine together.f).
2427 The file INPUT.CSA.seed lists the index of the seed conformations with their
2428 energies. Files INPUT.CSA.alpha, INPUT.CSA.alpha1 are written only once
2429 at the beginning of the CSA run. These files contain some arrays used
2434 \section{TECHNICAL SUPPORT CONTACT INFORMATION}
2435 \label{sect:support}
2438 Faculty of Chemistry, University of Gdansk\\
2439 ul. Wita Stwosza 63, 80-308 Gdansk Poland.\\
2440 phone: +48 58 523 5124\\
2441 fax: +48 58 523 5012\\
2442 e-mail: \href{mailto:adam@sun1.chem.univ.gda.pl}{adam@sun1.chem.univ.gda.pl}\\
2444 Dr. Cezary Czaplewski\\
2445 Faculty of Chemistry, University of Gdansk\\
2446 ul. Wita Stwosza 63, 80-308 Gdansk Poland.\\
2447 phone: +48 58 523 5126\\
2448 fax: +48 58 523 5012\\
2449 e-mail: \href{mailto:cezary.czaplewski@ug.edu.pl}{cezary.czaplewski@ug.edu.pl}\\
2451 Dr. Adam Sieradzan\\
2452 Faculty of Chemistry, University of Gdansk\\
2453 ul. Wita Stwosza 63, 80-308 Gdansk Poland.\\
2454 phone: +48 58 523 5124\\
2455 fax: +48 58 523 5012\\
2456 e-mail: \href{mailto:adasko@sun1.chem.univ.gda.pl}{adasko@sun1.chem.univ.gda.pl}\\
2458 Dr. Stanislaw Oldziej\\
2459 Intercollegiate Faculty of Biotechnology\\
2460 University of Gdansk, Medical University of Gdansk\\
2461 ul. Kladki 22, 80-922 Gdansk, Poland\\
2462 phone: +48 58 523 5361\\
2463 fax: +48 58 523 5472\\
2464 e-mail: \href{mailto:stan@biotech.ug.edu.pl}{stan@biotech.ug.edu.pl}\\
2467 Korea Institute for Advanced Study\\
2468 207-43 Cheongnyangni 2-dong, Dongdaemun-gu,\\
2469 Seoul 130-722, Korea\\
2470 phone: +82-2-958-3890\\
2471 fax: +82-2-958-3731\\
2472 email: \href={mailto:jlee@kias.re.kr}{jlee@kias.re.kr}
2475 Prepared by Adam Liwo and Jooyoung Lee, 7/17/99\\
2476 Revised by Cezary Czaplewski 1/4/01\\
2477 Revised by Cezary Czaplewski and Adam Liwo 8/26/03\\
2478 Revised by Cezary Czaplewski and Adam Liwo 11/26/11\\
2479 Revised by Adam Liwo 02/19/12\\
2480 LaTeX version by Adam Liwo 09/25/12\\
2481 revised by Adam Liwo 12/04/14