Re: etcd fails to start when trying to deploy diego with


Gwenn Etourneau
 

What bosh instances --ps give to you ?

On Wed, Jan 27, 2016 at 8:03 AM, Martin Jackson <martin(a)uncommonsense-uk.com
wrote:
Hi there I'm trying to deploy a CF release with Diego but I'm getting:

Started updating job etcd_z2 > etcd_z2/0 (canary). Failed: `etcd_z2/0' is
not running after update (00:10:46)
Error 400007: `etcd_z2/0' is not running after update

When I check the `etcd_ctl.err.log` on the failing node I can see:
```Error: cannot sync with the cluster using endpoints
https://etcd-z2-0.etcd.service.cf.internal:4001

# dig etcd-z2-0.etcd.service.cf.internal

; <<>> DiG 9.9.5-3ubuntu0.5-Ubuntu <<>> etcd-z2-0.etcd.service.cf.internal
;; global options: +cmd
;; Got answer:
;; ->>HEADER<<- opcode: QUERY, status: NXDOMAIN, id: 60655
;; flags: qr aa rd; QUERY: 1, ANSWER: 0, AUTHORITY: 0, ADDITIONAL: 0
;; WARNING: recursion requested but not available

;; QUESTION SECTION:
;etcd-z2-0.etcd.service.cf.internal. IN A

;; Query time: 1 msec
;; SERVER: 127.0.0.1#53(127.0.0.1)
;; WHEN: Tue Jan 26 09:14:40 UTC 2016
;; MSG SIZE rcvd: 52

So I can not resolv anything under the services domain.

deployment details:

+-----------+----------------------+----------------------------------------------+--------------+
| Name | Release(s) | Stemcell(s)
| Cloud Config |

+-----------+----------------------+----------------------------------------------+--------------+
| pulverize | cf/225 |
bosh-aws-xen-hvm-ubuntu-trusty-go_agent/3104 | none |
| | diego/0.1441.0 |
| |
| | etcd/18 |
| |
| | garden-linux/0.327.0 |
| |
| | nginx/2 |
| |

+-----------+----------------------+----------------------------------------------+--------------+

Regards

Martin Jackson

Join cf-dev@lists.cloudfoundry.org to automatically receive all group messages.