ansible/docs/docsite/rst/playbooks_error_handling.rst

149 lines
5.2 KiB
ReStructuredText
Raw Normal View History

2013-09-30 01:10:28 +02:00
Error Handling In Playbooks
===========================
2012-05-13 17:00:02 +02:00
.. contents:: Topics
2013-10-03 04:03:15 +02:00
Ansible normally has defaults that make sure to check the return codes of commands and modules and
it fails fast -- forcing an error to be dealt with unless you decide otherwise.
2013-11-16 23:01:26 +01:00
2017-01-14 02:55:19 +01:00
Sometimes a command that returns different than 0 isn't an error. Sometimes a command might not always
need to report that it 'changed' the remote system. This section describes how to change
the default behavior of Ansible for certain tasks so output and error handling behavior is
as desired.
.. _ignoring_failed_commands:
Ignoring Failed Commands
````````````````````````
Generally playbooks will stop executing any more steps on a host that has a task fail.
Sometimes, though, you want to continue on. To do so, write a task that looks like this::
- name: this will not be counted as a failure
2013-07-15 19:50:48 +02:00
command: /bin/false
ignore_errors: yes
2015-10-27 19:29:37 +01:00
Note that the above system only governs the return value of failure of the particular task,
so if you have an undefined variable used or a syntax error, it will still raise an error that users will need to address.
Note that this will not prevent failures on connection or execution issues.
This feature only works when the task must be able to run and return a value of 'failed'.
.. _resetting_unreachable:
Resetting Unreachable Hosts
```````````````````````````
.. versionadded:: 2.2
Connection failures set hosts as 'UNREACHABLE', which will remove them from the list of active hosts for the run.
To recover from these issues you can use `meta: clear_host_errors` to have all currently flagged hosts reactivated,
so subsequent tasks can try to use them again.
2015-10-27 19:29:37 +01:00
2014-02-15 20:05:42 +01:00
.. _handlers_and_failure:
Handlers and Failure
````````````````````
When a task fails on a host, handlers which were previously notified
will *not* be run on that host. This can lead to cases where an unrelated failure
can leave a host in an unexpected state. For example, a task could update
a configuration file and notify a handler to restart some service. If a
task later on in the same play fails, the service will not be restarted despite
the configuration change.
You can change this behavior with the ``--force-handlers`` command-line option,
or by including ``force_handlers: True`` in a play, or ``force_handlers = True``
in ansible.cfg. When handlers are forced, they will run when notified even
if a task fails on that host. (Note that certain errors could still prevent
the handler from running, such as a host becoming unreachable.)
2013-10-12 17:20:56 +02:00
.. _controlling_what_defines_failure:
2013-10-12 16:51:32 +02:00
Controlling What Defines Failure
````````````````````````````````
2013-10-17 16:00:58 +02:00
Suppose the error code of a command is meaningless and to tell if there
2013-10-12 16:51:32 +02:00
is a failure what really matters is the output of the command, for instance
if the string "FAILED" is in the output.
2013-10-12 16:51:32 +02:00
Ansible provides a way to specify this behavior as follows::
2013-10-12 16:51:32 +02:00
- name: Fail task when the command error output prints FAILED
2013-10-12 16:51:32 +02:00
command: /usr/bin/example-command -x -y -z
register: command_result
failed_when: "'FAILED' in command_result.stderr"
or based on the return code::
- name: Fail task when both files are identical
raw: diff foo/file1 bar/file2
register: diff_cmd
failed_when: diff_cmd.rc == 0 or diff_cmd.rc >= 2
In previous version of Ansible, this can still be accomplished as follows::
2013-10-12 16:51:32 +02:00
- name: this command prints FAILED when it fails
command: /usr/bin/example-command -x -y -z
register: command_result
ignore_errors: True
- name: fail the play if the previous command did not succeed
fail: msg="the command failed"
when: "'FAILED' in command_result.stderr"
.. _override_the_changed_result:
2013-09-30 01:10:28 +02:00
Overriding The Changed Result
`````````````````````````````
2013-07-14 21:43:10 +02:00
2013-07-21 16:48:22 +02:00
When a shell/command or other module runs it will typically report
2013-07-24 00:49:27 +02:00
"changed" status based on whether it thinks it affected machine state.
2013-07-21 16:48:22 +02:00
Sometimes you will know, based on the return code
or output that it did not make any changes, and wish to override
the "changed" result such that it does not appear in report output or
does not cause handlers to fire::
2013-07-14 21:43:10 +02:00
tasks:
2013-07-21 16:48:22 +02:00
- shell: /usr/bin/billybass --mode="take me to the river"
register: bass_result
changed_when: "bass_result.rc != 2"
# this will never report 'changed' status
- shell: wall 'beep'
2013-10-18 20:13:13 +02:00
changed_when: False
2013-07-14 21:43:10 +02:00
Aborting the play
`````````````````
Sometimes it's desirable to abort the entire play on failure, not just skip remaining tasks for a host.
The ``any_errors_fatal`` play option will mark all hosts as failed if any fails, causing an immediate abort::
- hosts: somehosts
any_errors_fatal: true
roles:
- myrole
for finer-grained control ``max_fail_percentage`` can be used to abort the run after a given percentage of hosts has failed.
2012-05-13 17:00:02 +02:00
.. seealso::
:doc:`playbooks`
An introduction to playbooks
:doc:`playbooks_best_practices`
Best practices in playbooks
:doc:`playbooks_conditionals`
Conditional statements in playbooks
:doc:`playbooks_variables`
All about variables
`User Mailing List <http://groups.google.com/group/ansible-devel>`_
Have a question? Stop by the google group!
`irc.freenode.net <http://irc.freenode.net>`_
#ansible IRC chat channel
2012-05-13 17:00:02 +02:00