Execution Configuration Options¶
Nipype gives you many liberties on how to create workflows, but the execution of them uses a lot of default parameters. But you have of course all the freedom to change them as you like.
Nipype looks for the configuration options in the local folder under the name nipype.cfg
and in ~/.nipype/nipype.cfg
(in this order). It can be divided into Logging and Execution options. A few of the possible options are the following:
Logging¶
workflow_level
: How detailed the logs regarding workflow should be
(possible values:INFO
andDEBUG
; default value:INFO
)utils_level
: How detailed the logs regarding nipype utils, like file operations (for example overwriting warning) or the resource profiler, should be
(possible values:INFO
andDEBUG
; default value:INFO
)interface_level
: How detailed the logs regarding interface execution should be
(possible values:INFO
andDEBUG
; default value:INFO
)filemanip_level
(deprecated as of 1.0): How detailed the logs regarding file operations (for example overwriting warning) should be
(possible values:INFO
andDEBUG
)log_to_file
: Indicates whether logging should also send the output to a file
(possible values:true
andfalse
; default value:false
)log_directory
: Where to store logs.
(string, default value: home directory)log_size
: Size of a single log file.
(integer, default value: 254000)log_rotate
: How many rotations should the log file make.
(integer, default value: 4)
Execution¶
plugin
: This defines which execution plugin to use.
(possible values:Linear
,MultiProc
,SGE
,IPython
; default value:Linear
)stop_on_first_crash
: Should the workflow stop upon the first node crashing or try to execute as many nodes as possible?
(possible values:true
andfalse
; default value:false
)stop_on_first_rerun
: Should the workflow stop upon the first node trying to recompute (by that we mean rerunning a node that has been run before - this can happen due changed inputs and/or hash_method since the last run).
(possible values:true
andfalse
; default value:false
)hash_method
: Should the input files be checked for changes using their content (slow, but 100% accurate) or just their size and modification date (fast, but potentially prone to errors)?
(possible values:content
andtimestamp
; default value:timestamp
)keep_inputs
: Ensures that all inputs that are created in the nodes working directory are kept after node execution
(possible values:true
andfalse
; default value:false
)single_thread_matlab
: Should all of the Matlab interfaces (including SPM) use only one thread? This is useful if you are parallelizing your workflow using MultiProc or IPython on a single multicore machine.
(possible values:true
andfalse
; default value:true
)display_variable
: Override the$DISPLAY
environment variable for interfaces that require an X server. This option is useful if there is a running X server, but$DISPLAY
was not defined in nipype’s environment. For example, if an X server is listening on the default port of 6000, setdisplay_variable = :0
to enable nipype interfaces to use it. It may also point to displays provided by VNC, xnest or Xvfb.
If neitherdisplay_variable
nor the$DISPLAY
environment variable is set, nipype will try to configure a new virtual server using Xvfb.
(possible values: any X server address; default value: not set)remove_unnecessary_outputs
: This will remove any interface outputs not needed by the workflow. If the required outputs from a node changes, rerunning the workflow will rerun the node. Outputs of leaf nodes (nodes whose outputs are not connected to any other nodes) will never be deleted independent of this parameter.
(possible values:true
andfalse
; default value:true
)try_hard_link_datasink
: When the DataSink is used to produce an organized output file outside of nipypes internal cache structure, a file system hard link will be attempted first. A hard link allows multiple file paths to point to the same physical storage location on disk if the conditions allow. By referring to the same physical file on disk (instead of copying files byte-by-byte) we can avoid unnecessary data duplication. If hard links are not supported for the source or destination paths specified, then a standard byte-by-byte copy is used.
(possible values:true
andfalse
; default value:true
)use_relative_paths
: Should the paths stored in results (and used to look for inputs) be relative or absolute. Relative paths allow moving the whole working directory around but may cause problems with symlinks.
(possible values:true
andfalse
; default value:false
)local_hash_check
: Perform the hash check on the job submission machine. This option minimizes the number of jobs submitted to a cluster engine or a multiprocessing pool to only those that need to be rerun.
(possible values:true
andfalse
; default value:true
)job_finished_timeout
: When batch jobs are submitted through, SGE/PBS/Condor they could be killed externally. Nipype checks to see if a results file exists to determine if the node has completed. This timeout determines for how long this check is done after a job finish is detected. (float in seconds; default value: 5)remove_node_directories
(EXPERIMENTAL): Removes directories whose outputs have already been used up. Doesn’t work with IdentiInterface or any node that patches data through (without copying)
(possible values:true
andfalse
; default value:false
)stop_on_unknown_version
: If this is set to True, an underlying interface will raise an error, when no version information is available. Please notify developers or submit a patch.parameterize_dirs
: If this is set to True, the node’s output directory will contain full parameterization of any iterable, otherwise parameterizations over 32 characters will be replaced by their hash.
(possible values:true
andfalse
; default value:true
)poll_sleep_duration
: This controls how long the job submission loop will sleep between submitting all pending jobs and checking for job completion. To be nice to cluster schedulers the default is set to 2 seconds.xvfb_max_wait
: Maximum time (in seconds) to wait for Xvfb to start, if the _redirect_x parameter of an Interface is True.crashfile_format
: This option controls the file type of any crashfile generated. Pklz crashfiles allow interactive debugging and rerunning of nodes, while text crashfiles allow portability across machines and shorter load time.
(possible values:pklz
andtxt
; default value:pklz
)
Resource Monitor¶
enabled
: Enables monitoring the resources occupation (possible values:true
andfalse
; default value:false
). All the following options will be dismissed if the resource monitor is not enabled.sample_frequency
: Sampling period (in seconds) between measurements of resources (memory, cpus) being used by an interface
(default value:1
)summary_file
: Indicates where the summary file collecting all profiling information from the resource monitor should be stored after execution of a workflow. Thesummary_file
does not apply to interfaces run independently. (unset by default, in which case the summary file will be written out to<base_dir>/resource_monitor.json
of the top-level workflow).summary_append
: Append to an existing summary file (only applies to workflows).
(default value:true
, possible values:true
orfalse
).
Example¶
[logging]
workflow_level = DEBUG
[execution]
stop_on_first_crash = true
hash_method = timestamp
display_variable = :1
[monitoring]
enabled = false
Workflow.config
property has a form of a nested dictionary reflecting the structure of the .cfg
file.
from nipype import Workflow
myworkflow = Workflow(name='myworkflow')
myworkflow.config['execution'] = {'stop_on_first_rerun': 'True',
'hash_method': 'timestamp'}
You can also directly set global config options in your workflow script. An example is shown below. This needs to be called before you import the pipeline or the logger. Otherwise, logging level will not be reset.
from nipype import config
cfg = dict(logging=dict(workflow_level = 'DEBUG'),
execution={'stop_on_first_crash': False,
'hash_method': 'content'})
config.update_config(cfg)
Enabling logging to file¶
By default, logging to file is disabled. One can enable and write the file to a location of choice as in the example below.
import os
from nipype import config, logging
config.update_config({'logging': {'log_directory': os.getcwd(),
'log_to_file': True}})
logging.update_logging(config)
The logging update line is necessary to change the behavior of logging such as output directory, logging level, etc.
Debug configuration¶
To enable debug mode, one can insert the following lines:
from nipype import config
config.enable_debug_mode()
In this mode the following variables are set:
config.set('execution', 'stop_on_first_crash', 'true')
config.set('execution', 'remove_unnecessary_outputs', 'false')
config.set('execution', 'keep_inputs', 'true')
config.set('logging', 'workflow_level', 'DEBUG')
config.set('logging', 'interface_level', 'DEBUG')
config.set('logging', 'utils_level', 'DEBUG')
The primary loggers (workflow
, interface
and utils
) are also reset to level DEBUG
.
You may wish to adjust these manually using:
from nipype import logging
logging.getLogger(<logger>).setLevel(<level>)
Global, workflow & node level¶
The configuration options can be changed globally (i.e. for all workflows), for just a workflow, or for just a node. The implementations look as follows (note that you should first create directories if you want to change crashdump_dir
and log_directory
):
At the global level:¶
from nipype import config, logging
import os
os.makedirs('/output/log_folder', exist_ok=True)
os.makedirs('/output/crash_folder', exist_ok=True)
config_dict={'execution': {'remove_unnecessary_outputs': 'true',
'keep_inputs': 'false',
'poll_sleep_duration': '60',
'stop_on_first_rerun': 'false',
'hash_method': 'timestamp',
'local_hash_check': 'true',
'create_report': 'true',
'crashdump_dir': '/output/crash_folder',
'use_relative_paths': 'false',
'job_finished_timeout': '5'},
'logging': {'workflow_level': 'INFO',
'filemanip_level': 'INFO',
'interface_level': 'INFO',
'log_directory': '/output/log_folder',
'log_to_file': 'true'}}
config.update_config(config_dict)
logging.update_logging(config)
At the workflow level:¶
from nipype import Workflow
wf = Workflow(name="config_test")
# Change execution parameters
wf.config['execution']['stop_on_first_crash'] = 'true'
# Change logging parameters
wf.config['logging'] = {'workflow_level' : 'DEBUG',
'filemanip_level' : 'DEBUG',
'interface_level' : 'DEBUG',
'log_to_file' : 'True',
'log_directory' : '/output/log_folder'}
At the node level:¶
from nipype import Node
from nipype.interfaces.fsl import BET
bet = Node(BET(), name="config_test")
bet.config = {'execution': {'keep_unnecessary_outputs': 'false'}}