Re: cf-stub.yml example with minimum or required info


Gwenn Etourneau
 

Are you using centos ?

/etc/resolvconf/resolv.conf.d/head

On Wed, Jun 17, 2015 at 2:21 PM, Ahmed Ali (ahmeali) <ahmeali(a)cisco.com>
wrote:

These are the errors I found under /var/vcap/sys/log:


/var/vcap/sys/log/consul_template_ctl.err.log:

[2015-06-17 05:10:03+0000] ------------ STARTING consul_template_ctl at
Wed Jun 17 05:10:03 UTC 2015 --------------
[2015-06-17 05:10:03+0000] 2015/06/17 05:10:03 [ERR] (runner) watcher
reported error: health services: error fetching: Get
http://127.0.0.1:8500/v1/health/service/ssh-proxy?wait=60000ms: dial tcp
127.0.0.1:8500: connection refused
[2015-06-17 05:10:08+0000] 2015/06/17 05:10:08 [ERR] (runner) watcher
reported error: health services: error fetching: Get
http://127.0.0.1:8500/v1/health/service/ssh-proxy?wait=60000ms: dial tcp
127.0.0.1:8500: connection refused
[2015-06-17 05:10:13+0000] 2015/06/17 05:10:13 [ERR] (runner) watcher
reported error: health services: error fetching: Get
http://127.0.0.1:8500/v1/health/service/ssh-proxy?wait=60000ms: dial tcp
127.0.0.1:8500: connection refused
[2015-06-17 05:10:14+0000] Received interrupt, cleaning up...
[2015-06-17 05:10:14+0000] /var/vcap/jobs/haproxy/bin/consul_template_ctl:
line 46: exit: : numeric argument required


/var/vcap/sys/log/monit/consul_agent.err.log:

/var/vcap/jobs/consul_agent/bin/agent_ctl: line 39:
/etc/resolvconf/resolv.conf.d/head: No such file or directory


/var/vcap/sys/log/metron_agent/metron_agent.stdout.log:

{"timestamp":1434518129.113139153,"process_id":2263,"source":"metron","log_level":"warn","message":"Failed
to create client: Could not connect to NATS: nats: No servers available for
connection","data":null,"file":"/var/vcap/data/compile/metron_agent/loggregator/src/
github.com/cloudfoundry/loggregatorlib/cfcomponent/registrars/collectorregistrar/collector_registrar.go
","line":51,"method":"
github.com/cloudfoundry/loggregatorlib/cfcomponent/registrars/collectorregistrar.(*CollectorRegistrar).Run
”}


Note: There is a proxy server between all CF VMS and the internet, is
there any connections to the outside? If so where in cf-stub.yml proxy can
be placed.

Thank you!


From: Gwenn Etourneau <getourneau(a)pivotal.io>
Reply-To: "Discussions about the Cloud Foundry BOSH project." <
cf-bosh(a)lists.cloudfoundry.org>
Date: Tuesday, June 16, 2015 at 6:56 PM
To: "Discussions about the Cloud Foundry BOSH project." <
cf-bosh(a)lists.cloudfoundry.org>
Subject: Re: [cf-bosh] cf-stub.yml example with minimum or required info

bosh ssh to the haproxy and check the logs /var/vcap/sys/log.

On Wed, Jun 17, 2015 at 5:14 AM, Ahmed Ali (ahmeali) <ahmeali(a)cisco.com>
wrote:

Thank you!


After adding environment to meta section, the deployment is moving
forward but timing out on “Started updating job ha_proxy_z1 > ha_proxy_z1/0
(canary)"


What changed:

meta:
environment: cf

properties:

domain: foundry-appx.company.com #(domain used in the ssl cert)



I think this could be related to SSL cert and keys which should be
included in cf-stub.yml, I created SSL certificate for “ssl_pem” by
following this link
<https://github.com/cloudfoundry/cf-release/tree/master/example_manifests> ,
also added the jwt signing_key which was created using “ssh-keygen -t rsa”,
I see another place (here
<http://docs.cloudfoundry.org/deploying/cf-stub-vsphere.html>) where SSL
cert/key are needed in cf-stub.yml under “router > ssl_cert:” but not
sure if it is the same as ssl_pem, any idea what Im missing here?


"bosh cck" is coming out clean and no problems, "bosh vms" show all
VMs in running state except “ha_proxy_z1” in failing state.


# bosh deploy


Processing deployment manifest

------------------------------

Getting deployment properties from director...

Compiling deployment manifest...

Please review all changes carefully


Deploying

---------

Deployment name: `cf-deployment.yml'

Director name: `bosh2'

Are you sure you want to deploy? (type 'yes' to continue): yes


Director task 227

Started unknown

Started unknown > Binding deployment. Done (00:00:00)


Started preparing deployment

Started preparing deployment > Binding releases. Done (00:00:01)

Started preparing deployment > Binding existing deployment. Done
(00:00:00)

Started preparing deployment > Binding resource pools. Done (00:00:00)

Started preparing deployment > Binding stemcells. Done (00:00:00)

Started preparing deployment > Binding templates. Done (00:00:00)

Started preparing deployment > Binding properties. Done (00:00:00)

Started preparing deployment > Binding unallocated VMs. Done (00:00:00)

Started preparing deployment > Binding instance networks. Done
(00:00:00)


Started preparing package compilation > Finding packages to compile.
Done (00:00:00)


Started preparing dns > Binding DNS. Done (00:00:00)


Started preparing configuration > Binding configuration. Done
(00:00:02)


Started updating job ha_proxy_z1 > ha_proxy_z1/0 (canary). Failed: `ha_proxy_z1/0'
is not running after update (00:10:18)


Error 400007: `ha_proxy_z1/0' is not running after update




bosh task 227 --debug


E, [2015-06-16 19:41:51 #13416] [canary_update(ha_proxy_z1/0)] *ERROR*
-- DirectorJobRunner: *Error* updating canary instance:
#<Bosh::Director::AgentJobNotRunning: `ha_proxy_z1/0' is not running after
update>

I, [2015-06-16 19:41:51 #13416] [task:227] INFO -- DirectorJobRunner:
sending update deployment *error* event

D, [2015-06-16 19:41:51 #13416] [task:227] DEBUG -- DirectorJobRunner:
SENT: hm.director.alert
{"id":"52682ea9-0fa2-4edb-9611-128490279ba5","severity":3,"title":"director
- *error* during update deployment","summary":"*Error* during update
deployment for 'cf' against Director
'b9a1bf7b-952f-48e1-a496-f6543d7a782c':
#<Bosh::Director::AgentJobNotRunning: `ha_proxy_z1/0' is not running after
update>","created_at":1434483711}

E, [2015-06-16 19:41:51 #13416] [task:227] *ERROR* -- DirectorJobRunner:
`ha_proxy_z1/0' is not running after update

D, [2015-06-16 19:41:51 #13416] [task:227] DEBUG -- DirectorJobRunner:
(0.000495s) UPDATE "tasks" SET "state" = '*error*', "timestamp" =
'2015-06-16 19:41:51.761481+0000', "description" = 'create deployment',
"result" = '`ha_proxy_z1/0'' is not running after update', "output" =
'/var/vcap/store/director/tasks/227', "checkpoint_time" = '2015-06-16
19:41:29.218401+0000', "type" = 'update_deployment', "username" = 'admin'
WHERE ("id" = 227)

Task 227 *error*

Thanks!


From: CF Runtime <cfruntime(a)gmail.com>
Reply-To: "Discussions about the Cloud Foundry BOSH project." <
cf-bosh(a)lists.cloudfoundry.org>
Date: Monday, June 15, 2015 at 2:10 PM
To: "Discussions about the Cloud Foundry BOSH project." <
cf-bosh(a)lists.cloudfoundry.org>
Subject: Re: [cf-bosh] cf-stub.yml example with minimum or required info

Hi Ahmed,

This property is coming from the "templates/cf-lamb.yml" within
cf-release. You are able to overwrite this property in your stub like so:

meta:
environment: [name-of-environment]

Hope this helps,
Dan && James, CF Runtime Team

On Sun, Jun 14, 2015 at 7:13 PM, Ahmed Ali (ahmeali) <ahmeali(a)cisco.com>
wrote:

What should be the value then, it is generated by spiff?

should I place this in cf-stub.yml under properties to overwrite what
spiff doing as following:

properties:
metron_agent:
deployment: <???>

I found this link talking about same issue:
https://github.com/cloudfoundry/bosh-lite/issues/265 but could not find
an answer

Thanks Gwenn



From: Gwenn Etourneau <getourneau(a)pivotal.io>
Reply-To: "Discussions about the Cloud Foundry BOSH project." <
cf-bosh(a)lists.cloudfoundry.org>
Date: Sunday, June 14, 2015 at 6:58 PM
To: "Discussions about the Cloud Foundry BOSH project." <
cf-bosh(a)lists.cloudfoundry.org>
Subject: Re: [cf-bosh] cf-stub.yml example with minimum or required info

Seems this could not be null in deployment.yml

metron_agent:
deployment: null

On Mon, Jun 15, 2015 at 10:50 AM, Ahmed Ali (ahmeali) <ahmeali(a)cisco.com
wrote:
Both cf-stub.yml and the spiff generated cf-deployment.yml are
attached.

I noticed in cf-deployment.yml a section called “- default_networks:”
inserted between jobs, does this look normal?

Note: I did not edit cf-deployment, it is what I get from spiff.

Environment info:
Ubuntu 14

BOSH 1.2977.0

cf version 6.11.3-cebadc9-2015-05-20T19:00:58+00:00

spiff version 1.0.6

ruby 1.9.3p484 (2013-11-22 revision 43786) [x86_64-linux]
vSphere 5.5


#bosh status

Config

/root/.bosh_config


Director

Name bosh2

URL https://10.195.166.12:25555

Version 1.2976.0 (00000000)

User admin

UUID b9a1bf7b-952f-48e1-a496-f6543d7a782c

CPI vsphere

dns enabled (domain_name: bosh)

compiled_package_cache disabled

snapshots enabled


Deployment

Manifest /root/deployment/cf-deployment.yml

#bosh releases


+------+------------+-------------+

| Name | Versions | Commit Hash |

+------+------------+-------------+

| cf | 211+dev.1* | 2121dc64+ |

+------+------------+-------------+

(*) Currently deployed

(+) Uncommitted changes




Thank you!

From: Gwenn Etourneau <getourneau(a)pivotal.io>
Reply-To: "Discussions about the Cloud Foundry BOSH project." <
cf-bosh(a)lists.cloudfoundry.org>
Date: Sunday, June 14, 2015 at 5:35 PM
To: "Discussions about the Cloud Foundry BOSH project." <
cf-bosh(a)lists.cloudfoundry.org>
Subject: Re: [cf-bosh] cf-stub.yml example with minimum or required
info

Please show us you manifest, seems something is missing.



On Mon, Jun 15, 2015 at 4:08 AM, Ahmed Ali (ahmeali) <ahmeali(a)cisco.com
wrote:
Hi Joseph,

Thank you! I changed to using two different networks and now "bosh
deploy” works and all VMs are deployed successfully, but looks like there
is a binding configuration error:

Binding configuration. Failed: Error filling in template
`metron_agent.json.erb' for `ha_proxy_z1/0' (line 5: Can't find property
`["metron_agent.deployment"]') (00:00:00)


Error 100: Error filling in template `metron_agent.json.erb' for
`ha_proxy_z1/0' (line 5: Can't find property `["metron_agent.deployment"]')


Also tried to connect using CF and could not connect:

root(a)cloudfoundry:~/deployment# cf api --skip-ssl-validation
foundry-appx.domain.com

Setting api endpoint to *foundry-app.domain.com
<http://foundry-app.domain.com>*...

*FAILED*
Error performing request: Get http://foundry-appx.domain.com/v2/info:
dial tcp x.x.166.18:80: connection refused


Full run console log:

root(a)cloudfoundry:~/deployment# bosh deploy


Processing deployment manifest

------------------------------

Getting deployment properties from director...

Unable to get properties list from director, trying without it...

Compiling deployment manifest...

Cannot get current deployment information from director, possibly a
new deployment

Please review all changes carefully


Deploying

---------

Deployment name: `cf-deployment.yml'

Director name: `bosh2'

Are you sure you want to deploy? (type 'yes' to continue): yes


Director task 172

Started unknown

Started unknown > Binding deployment. Done (00:00:00)


Started preparing deployment

Started preparing deployment > Binding releases. Done (00:00:00)

Started preparing deployment > Binding existing deployment. Done
(00:00:00)

Started preparing deployment > Binding resource pools. Done
(00:00:00)

Started preparing deployment > Binding stemcells. Done (00:00:00)

Started preparing deployment > Binding templates. Done (00:00:00)

Started preparing deployment > Binding properties. Done (00:00:00)

Started preparing deployment > Binding unallocated VMs. Done
(00:00:00)

Started preparing deployment > Binding instance networks. Done
(00:00:00)


Started preparing package compilation > Finding packages to compile.
Done (00:00:00)


Started preparing dns > Binding DNS. Done (00:00:00)


Started creating bound missing vms

Started creating bound missing vms > small_z1/0

Started creating bound missing vms > small_z1/1

Started creating bound missing vms > small_z1/2

Started creating bound missing vms > small_z2/0

Started creating bound missing vms > small_z2/1

Started creating bound missing vms > medium_z1/0

Started creating bound missing vms > medium_z1/1

Started creating bound missing vms > medium_z1/2

Started creating bound missing vms > medium_z1/3

Started creating bound missing vms > medium_z1/4

Started creating bound missing vms > medium_z1/5

Started creating bound missing vms > medium_z1/6

Started creating bound missing vms > medium_z1/7

Started creating bound missing vms > medium_z1/8

Started creating bound missing vms > medium_z2/0

Started creating bound missing vms > medium_z2/1

Started creating bound missing vms > medium_z2/2

Started creating bound missing vms > medium_z2/3

Started creating bound missing vms > medium_z2/4

Started creating bound missing vms > large_z1/0

Started creating bound missing vms > large_z2/0

Started creating bound missing vms > runner_z1/0

Started creating bound missing vms > runner_z2/0

Started creating bound missing vms > router_z1/0

Started creating bound missing vms > router_z1/1

Started creating bound missing vms > router_z2/0

Done creating bound missing vms > medium_z1/0 (00:00:30)

Done creating bound missing vms > medium_z1/5 (00:00:32)

Done creating bound missing vms > small_z2/1 (00:00:34)

Done creating bound missing vms > medium_z1/2 (00:00:34)

Done creating bound missing vms > medium_z2/4 (00:00:35)

Done creating bound missing vms > medium_z1/1 (00:00:45)

Done creating bound missing vms > medium_z1/7 (00:00:45)

Done creating bound missing vms > small_z1/0 (00:00:46)

Done creating bound missing vms > router_z1/1 (00:00:47)

Done creating bound missing vms > medium_z2/2 (00:00:49)

Done creating bound missing vms > medium_z2/3 (00:00:49)

Done creating bound missing vms > large_z2/0 (00:00:51)

Done creating bound missing vms > medium_z1/4 (00:00:52)

Done creating bound missing vms > router_z1/0 (00:00:52)

Done creating bound missing vms > small_z1/1 (00:00:55)

Done creating bound missing vms > router_z2/0 (00:00:55)

Done creating bound missing vms > small_z2/0 (00:00:59)

Done creating bound missing vms > large_z1/0 (00:00:59)

Done creating bound missing vms > medium_z2/1 (00:01:00)

Done creating bound missing vms > medium_z1/6 (00:01:00)

Done creating bound missing vms > medium_z1/3 (00:01:01)

Done creating bound missing vms > medium_z1/8 (00:01:01)

Done creating bound missing vms > medium_z2/0 (00:01:01)

Done creating bound missing vms > runner_z2/0 (00:01:02)

Done creating bound missing vms > runner_z1/0 (00:01:02)

Done creating bound missing vms > small_z1/2 (00:01:03)

Done creating bound missing vms (00:01:03)


Started binding instance vms

Started binding instance vms > ha_proxy_z1/0

Started binding instance vms > nats_z1/0

Started binding instance vms > nats_z2/0

Started binding instance vms > etcd_z1/0

Started binding instance vms > etcd_z1/1

Started binding instance vms > etcd_z2/0

Started binding instance vms > stats_z1/0

Started binding instance vms > nfs_z1/0

Started binding instance vms > postgres_z1/0

Started binding instance vms > uaa_z1/0

Started binding instance vms > uaa_z2/0

Started binding instance vms > api_z1/0

Started binding instance vms > api_z2/0

Started binding instance vms > clock_global/0

Started binding instance vms > api_worker_z1/0

Started binding instance vms > api_worker_z2/0

Started binding instance vms > hm9000_z1/0

Started binding instance vms > hm9000_z2/0

Started binding instance vms > runner_z1/0

Started binding instance vms > runner_z2/0

Started binding instance vms > loggregator_z1/0

Started binding instance vms > loggregator_z2/0

Started binding instance vms > loggregator_trafficcontroller_z1/0

Started binding instance vms > router_z1/0

Started binding instance vms > loggregator_trafficcontroller_z2/0

Started binding instance vms > router_z2/0

Done binding instance vms > etcd_z2/0 (00:00:00)

Done binding instance vms > ha_proxy_z1/0 (00:00:00)

Done binding instance vms > nats_z1/0 (00:00:00)

Done binding instance vms > nats_z2/0 (00:00:00)

Done binding instance vms > clock_global/0 (00:00:00)

Done binding instance vms > etcd_z1/0 (00:00:00)

Done binding instance vms > uaa_z1/0 (00:00:00)

Done binding instance vms > nfs_z1/0 (00:00:00)

Done binding instance vms > postgres_z1/0 (00:00:00)

Done binding instance vms > api_z2/0 (00:00:00)

Done binding instance vms > api_z1/0 (00:00:00)

Done binding instance vms > uaa_z2/0 (00:00:00)

Done binding instance vms > etcd_z1/1 (00:00:00)

Done binding instance vms > stats_z1/0 (00:00:00)

Done binding instance vms > hm9000_z2/0 (00:00:00)

Done binding instance vms > hm9000_z1/0 (00:00:00)

Done binding instance vms > runner_z1/0 (00:00:00)

Done binding instance vms > loggregator_z1/0 (00:00:00)

Done binding instance vms > loggregator_z2/0 (00:00:00)

Done binding instance vms > loggregator_trafficcontroller_z1/0
(00:00:00)

Done binding instance vms > loggregator_trafficcontroller_z2/0
(00:00:00)

Done binding instance vms > runner_z2/0 (00:00:00)

Done binding instance vms > router_z2/0 (00:00:00)

Done binding instance vms > router_z1/0 (00:00:00)

Done binding instance vms > api_worker_z1/0 (00:00:01)

Done binding instance vms > api_worker_z2/0 (00:00:01)

Done binding instance vms (00:00:01)


Started preparing configuration > Binding configuration. Failed: Error
filling in template `metron_agent.json.erb' for `ha_proxy_z1/0' (line 5:
Can't find property `["metron_agent.deployment"]') (00:00:00)


Error 100: Error filling in template `metron_agent.json.erb' for
`ha_proxy_z1/0' (line 5: Can't find property `["metron_agent.deployment"]')


Task 172 error


For a more detailed error report, run: bosh task 172 --debug

root(a)cloudfoundry:~/deployment# bosh cck

Performing cloud check...


Processing deployment manifest

------------------------------


Director task 173

Started scanning 26 vms

Started scanning 26 vms > Checking VM states. Done (00:00:00)

Started scanning 26 vms > 26 OK, 0 unresponsive, 0 missing, 0
unbound, 0 out of sync. Done (00:00:00)

Done scanning 26 vms (00:00:00)


Started scanning 0 persistent disks

Started scanning 0 persistent disks > Looking for inactive disks.
Done (00:00:00)

Started scanning 0 persistent disks > 0 OK, 0 missing, 0 inactive,
0 mount-info mismatch. Done (00:00:00)

Done scanning 0 persistent disks (00:00:00)


Task 173 done


Started2015-06-14 18:34:55 UTC

Finished2015-06-14 18:34:55 UTC

Duration00:00:00


Scan is complete, checking if any problems found...

No problems found

root(a)cloudfoundry:~/deployment# cf api --skip-ssl-validation
foundry-appx.domain.com

Setting api endpoint to *foundry-app.doamin.com
<http://foundry-app.doamin.com>*...

*FAILED*

Error performing request: Get http://foundry-appx.domain.com/v2/info:
dial tcp x.x.166.18:80: connection refused




Note: the cluster is done inside a lab environment with proxy, did
not configure CF with proxy and not sure if I need to do something specific.



Thanks!




From: CF Runtime <cfruntime(a)gmail.com>
Reply-To: "Discussions about the Cloud Foundry BOSH project." <
cf-bosh(a)lists.cloudfoundry.org>
Date: Friday, June 12, 2015 at 5:39 PM
To: "Discussions about the Cloud Foundry BOSH project." <
cf-bosh(a)lists.cloudfoundry.org>
Subject: Re: [cf-bosh] cf-stub.yml example with minimum or required
info

Anything in the network section that is not static or reserved,
BOSH will assume it can use for any other instances in that zone. Because
your two subnets overlap, and you have not partitioned it off using the
reserved sections, BOSH is using that IP for something in the other zone.

Normally, if you only have a single network, it is easier to just
set the instance count for jobs in the second zone to zero, and scale up
any in the first zone to multiple if you want to have redundancy.

Joseph Palermo
CF Runtime Team

On Fri, Jun 12, 2015 at 1:29 PM, Ahmed Ali (ahmeali) <
ahmeali(a)cisco.com> wrote:

Sorry I did not see this reply from Gwenn Etourneau.

After “bosh cck”, it found 6 problems and then I tried option 2 to
reboot and also tried option 3 to recreate VM, none of them fixed the
issue, but I noticed the problematic VMs are using duplicate IPs, my
network section in the manifest has 2 networks (cf1 and cf2) and there is
no overlab, see network section below.

For example the VM router_z1/0
(vm-c36efd49-7ac7-4b90-9779-b5192408e4a6) got the ip
10.195.166.110, and another VM from same deployment (
vm-2b532db0-36df-433d-8132-89c76a9c81c3) got the same ip
10.195.166.110

I tried also removing the section “jobs" and go with defaults which
is generated by spiff and run into same issue, do I have
to statically assign IP address to each job?


M-20JW:cf-release ahmeali$ bosh cck
Performing cloud check...

Processing deployment manifest
------------------------------

Director task 141
Started scanning 26 vms
Started scanning 26 vms > Checking VM states. Done (00:00:10)
Started scanning 26 vms > 20 OK, 6 unresponsive, 0 missing, 0
unbound, 0 out of sync. Done (00:00:00)
Done scanning 26 vms (00:00:10)

Started scanning 0 persistent disks
Started scanning 0 persistent disks > Looking for inactive disks.
Done (00:00:00)
Started scanning 0 persistent disks > 0 OK, 0 missing, 0 inactive,
0 mount-info mismatch. Done (00:00:00)
Done scanning 0 persistent disks (00:00:00)

Task 141 done

Started 2015-06-12 20:03:17 UTC
Finished 2015-06-12 20:03:27 UTC
Duration 00:00:10

Scan is complete, checking if any problems found...

Found 6 problems

Problem 1 of 6: Unknown VM
(vm-e4da0933-52ba-473f-903d-a9ee09d1671f) is not responding.
1. Ignore problem
2. Reboot VM
3. Recreate VM using last known apply spec
4. Delete VM reference (DANGEROUS!)
Please choose a resolution [1 - 4]: 2

Problem 2 of 6: hm9000_z1/0
(vm-5ea4c90d-247e-43ad-a189-ff4d2d781854) is not responding.
1. Ignore problem
2. Reboot VM
3. Recreate VM using last known apply spec
4. Delete VM reference (DANGEROUS!)
Please choose a resolution [1 - 4]: 2

Problem 3 of 6: router_z1/0
(vm-c36efd49-7ac7-4b90-9779-b5192408e4a6) is not responding.
1. Ignore problem
2. Reboot VM
3. Recreate VM using last known apply spec
4. Delete VM reference (DANGEROUS!)
Please choose a resolution [1 - 4]: 2

Problem 4 of 6: nfs_z1/0 (vm-868e2bb9-ac61-49f0-86fb-38d5c338201b)
is not responding.
1. Ignore problem
2. Reboot VM
3. Recreate VM using last known apply spec
4. Delete VM reference (DANGEROUS!)
Please choose a resolution [1 - 4]: 2

Problem 5 of 6: router_z2/0
(vm-773e83ee-c97f-4aa4-b163-d09a703a4678) is not responding.
1. Ignore problem
2. Reboot VM
3. Recreate VM using last known apply spec
4. Delete VM reference (DANGEROUS!)
Please choose a resolution [1 - 4]: 2

Problem 6 of 6: ha_proxy_z1/0
(vm-643f3c80-93cb-4ccf-9239-1f85a407a317) is not responding.
1. Ignore problem
2. Reboot VM
3. Recreate VM using last known apply spec
4. Delete VM reference (DANGEROUS!)
Please choose a resolution [1 - 4]: 2

Below is the list of resolutions you've provided
Please make sure everything is fine and confirm your changes

1. Unknown VM (vm-e4da0933-52ba-473f-903d-a9ee09d1671f) is not
responding
Reboot VM

2. hm9000_z1/0 (vm-5ea4c90d-247e-43ad-a189-ff4d2d781854) is not
responding
Reboot VM

3. router_z1/0 (vm-c36efd49-7ac7-4b90-9779-b5192408e4a6) is not
responding
Reboot VM

4. nfs_z1/0 (vm-868e2bb9-ac61-49f0-86fb-38d5c338201b) is not
responding
Reboot VM

5. router_z2/0 (vm-773e83ee-c97f-4aa4-b163-d09a703a4678) is not
responding
Reboot VM

6. ha_proxy_z1/0 (vm-643f3c80-93cb-4ccf-9239-1f85a407a317) is not
responding
Reboot VM

Apply resolutions? (type 'yes' to continue): yes
Applying resolutions...

Director task 142
Started applying problem resolutions
Started applying problem resolutions > unresponsive_agent 154:
Reboot VM. Failed: Agent is responding now, skipping resolution (00:00:00)
Started applying problem resolutions > unresponsive_agent 184:
Reboot VM. Failed: Agent is responding now, skipping resolution (00:00:00)
Started applying problem resolutions > unresponsive_agent 177:
Reboot VM. Done (00:00:30)
Started applying problem resolutions > unresponsive_agent 179:
Reboot VM. Failed: Agent is responding now, skipping resolution (00:00:00)
Started applying problem resolutions > unresponsive_agent 185:
Reboot VM. Done (00:00:11)
Started applying problem resolutions > unresponsive_agent 180:
Reboot VM. Done (00:00:29)
Failed applying problem resolutions (00:01:10)

Task 142 done

Started 2015-06-12 20:04:01 UTC
Finished 2015-06-12 20:05:11 UTC
Duration 00:01:10
Cloudcheck is finished


Network section in cf-stub.yml


---
name: cf
director_uuid: b9a1bf7b-952f-48e1-a496-f6543d7a782c

releases:
- {name: cf, version: latest}

networks:

- name: cf1

subnets:

- range: 10.195.166.0/23

gateway: 10.195.166.1

static:

* - 10.195.166.105 - 10.195.166.135*

reserved:

# .1 is special

- 10.195.166.2 - 10.195.166.104

- 10.195.166.188 - 10.195.166.254

# .255 is special

dns: [10.166.168.183]

cloud_properties:

name: '10.195.166.x'

- name: cf2

subnets:

- range: 10.195.166.0/23

gateway: 10.195.166.1

static:

*- 10.195.166.136 - 10.195.166.166*

reserved:

# .1 is special

- 10.195.166.2 - 10.195.166.104

- 10.195.166.188 - 10.195.166.254

# .255 is special

dns: [10.166.168.183]

cloud_properties:

name: '10.195.166.x'


Thank you


From: Gwenn Etourneau <getourneau(a)pivotal.io>
Reply-To: "Discussions about the Cloud Foundry BOSH project." <
cf-bosh(a)lists.cloudfoundry.org>
Date: Monday, June 8, 2015 at 6:16 PM

To: "Discussions about the Cloud Foundry BOSH project." <
cf-bosh(a)lists.cloudfoundry.org>
Subject: Re: [cf-bosh] cf-stub.yml example with minimum or required
info

Try to make bosh cck before bosh deploy.

On Tue, Jun 9, 2015 at 3:43 AM, Ahmed Ali (ahmeali) <
ahmeali(a)cisco.com> wrote:

I was able to fix this by copying from the generated
cf-deployment.yml and use it in cf-stub.yml after changing cf release and
jobs section.


Now "bosh deploy" works but not complete, running into an issue I
think has to do with jobs/nodes misconfigured, not sure what are the list
of jobs/nodes I need to include and the properties of each one, any idea
where I can find information about the list of jobs/nodes and properties
for each such as templates/network ..etc?

"Bosh vms” shows some Vms in “unresponsive agent” and unknown job.


+------------------------------------+--------------------+---------------+----------------+

| Job/index | State | Resource
Pool | IPs |


+------------------------------------+--------------------+---------------+----------------+

| unknown/unknown | unresponsive agent |
| |

| unknown/unknown | unresponsive agent |
| |

| unknown/unknown | unresponsive agent |
| |

| unknown/unknown | unresponsive agent |
| |

| unknown/unknown | unresponsive agent |
| |

| unknown/unknown | unresponsive agent |
| |

| unknown/unknown | running |
medium_z1 | 10.195.166.106 |

| unknown/unknown | running |
router_z1 | 10.195.166.108 |

| unknown/unknown | running | small_z1
| 10.195.166.141 |

| api_z1/0 | running | large_z1
| 10.195.166.139 |

| api_z2/0 | running | large_z2
| 10.195.166.107 |

| clock_global/0 | running |
medium_z1 | 10.195.166.140 |

| consul_z1/0 | running |
medium_z1 | 10.195.166.122 |

| consul_z2/0 | running |
medium_z2 | 10.195.166.153 |

| etcd_z1/0 | running |
medium_z1 | 10.195.166.119 |

| etcd_z1/1 | running |
medium_z1 | 10.195.166.120 |

| etcd_z2/0 | running |
medium_z2 | 10.195.166.149 |

| hm9000_z1/0 | running |
medium_z1 | 10.195.166.142 |

| hm9000_z2/0 | running |
medium_z2 | 10.195.166.109 |

| loggregator_trafficcontroller_z1/0 | running | small_z1
| 10.195.166.145 |

| loggregator_trafficcontroller_z2/0 | running | small_z2
| 10.195.166.112 |

| loggregator_z1/0 | running |
medium_z1 | 10.195.166.144 |

| loggregator_z2/0 | running |
medium_z2 | 10.195.166.111 |

| login_z1/0 | running |
medium_z1 | 10.195.166.138 |

| postgres_z1/0 | running |
medium_z1 | 10.195.166.117 |

| routing_api_z1/0 | running |
medium_z1 | 10.195.166.121 |

| routing_api_z2/0 | running |
medium_z2 | 10.195.166.152 |

| runner_z1/0 | running |
runner_z1 | 10.195.166.143 |

| runner_z2/0 | running |
runner_z2 | 10.195.166.110 |

| stats_z1/0 | running | small_z1
| 10.195.166.136 |

| uaa_z1/0 | running |
medium_z1 | 10.195.166.137 |

| uaa_z2/0 | running |
medium_z2 | 10.195.166.105 |


+------------------------------------+--------------------+---------------+----------------+


VMs total: 32


Started preparing deployment > Binding existing deployment.
Failed: Timed out sending `get_state' to
eabc55f7-b402-42f7-85b5-e3732306b813 after 45 seconds (00:02:15)
Error 450002: Timed out sending `get_state' to
eabc55f7-b402-42f7-85b5-e3732306b813 after 45 seconds


Thanks
Ali

From: AHMED ALI <ahmeali(a)cisco.com>
Reply-To: "Discussions about the Cloud Foundry BOSH project." <
cf-bosh(a)lists.cloudfoundry.org>
Date: Friday, June 5, 2015 at 10:45 AM

To: "Discussions about the Cloud Foundry BOSH project." <
cf-bosh(a)lists.cloudfoundry.org>
Subject: Re: [cf-bosh] cf-stub.yml example with minimum or required
info

That fixed the error, thank you!

And run into another error :)

M-20JW:cf-release ali00$ bosh deployment cf-deployment.yml
Deployment set to
`/Users/ali00/deployments/cf-release/cf-deployment.yml'
M-20JW:cf-release ali00$ ./generate_deployment_manifest vsphere
cf-stub.yml > cf-deployment.yml
M-20JW:cf-release ali00$ bosh deploy

Processing deployment manifest
------------------------------
Getting deployment properties from director...
Compiling deployment manifest...
Please review all changes carefully

Deploying
---------
Deployment name: `cf-deployment.yml'
Director name: `bosh2'
Are you sure you want to deploy? (type 'yes' to continue): yes

Director task 38
Started unknown
Started unknown > Binding deployment. Done (00:00:00)

Error 140002: Template `consul_agent' (job `consul_z1') references
an unknown release `cf'

Task 38 error

For a more detailed error report, run: bosh task 38 --debug
M-20JW:cf-release ali00$


How can construct the section in cf-stub to provide correct cf
release which I think should be cf-210 in my case (not sure)?


Modified networks part in my cf-stub.yml
-------------------------------------------------------
releases:
- name: cf-210
version: latest

networks:

- name: cf1

subnets:

- range: 10.195.166.0/23

gateway: 10.195.166.1

static:

- 10.195.166.104 - 10.195.166.125

reserved:

# .1 is special

- 10.195.166.2 - 10.195.166.101

- 10.195.166.147 - 10.195.166.254

# .255 is special

dns: [10.166.168.183]

cloud_properties:

name: '10.195.166.x'

- name: cf2

subnets:

- range: 10.195.166.0/23

gateway: 10.195.166.1

static:

- 10.195.166.126 - 10.195.166.146

reserved:

# .1 is special

- 10.195.166.2 - 10.195.166.101

- 10.195.166.147 - 10.195.166.254

# .255 is special

dns: [10.166.168.183]

cloud_properties:

name: '10.195.166.x'


------------------------------------------------




Thanks
Ali


From: CF Runtime <cfruntime(a)gmail.com>
Reply-To: "Discussions about the Cloud Foundry BOSH project." <
cf-bosh(a)lists.cloudfoundry.org>
Date: Friday, June 5, 2015 at 10:11 AM
To: "Discussions about the Cloud Foundry BOSH project." <
cf-bosh(a)lists.cloudfoundry.org>
Subject: Re: [cf-bosh] cf-stub.yml example with minimum or required
info

Hi Ahmed,

It looks like you haven't allocated enough IPs in your network.
The line for reserved IPs "10.166.166.104 - 10.166.166.115" should
be increased to have at least 19 IPs. You'll need to decrease the number of
reserved addresses as well in order to increase the number of available IPs
in your network. We recommend "10.166.166.104 - 10.166.166.123"
for available IPs and "10.166.166.124 - 10.194.167.254" for your
reserved range. If you're tracking our current develop branch and not the
final releases you should look in
cf-release/spec/fixtures/vsphere/cf-stub.yml for the stub that we use to do
our vsphere acceptance tests.

Best,
Zachary Auerbach + Dan Lavine CF Runtime Team

On Thu, Jun 4, 2015 at 2:18 PM, Ahmed Ali (ahmeali) <
ahmeali(a)cisco.com> wrote:

Thanks Joseph for your help, please see the error below:

20JXXW:cf-release ali00$ ./generate_deployment_manifest vsphere
cf-stub.yml > cf-deployment.yml
2015/06/04 13:34:50 error generating manifest: unresolved nodes:
(( static_ips(12) )) in
./templates/cf-infrastructure-vsphere.yml
jobs.[5].networks.[0].static_ips
(( static_ips(16) )) in
./templates/cf-infrastructure-vsphere.yml
jobs.[8].networks.[0].static_ips
(( static_ips(14, 15) )) in
./templates/cf-infrastructure-vsphere.yml
jobs.[15].networks.[0].static_ips
(( static_ips(17, 18, 19) )) in
./templates/cf-infrastructure-vsphere.yml
jobs.[17].networks.[0].static_ips
(( jobs.postgres_z1.networks.cf1.static_ips.[0] )) in
dynaml properties.databases.address
(( properties.databases.address )) in dynaml
properties.ccdb.address
(( properties.databases.address )) in dynaml
properties.uaadb.address
M-2XX0JW:cf-release ali00$


I do not want to bug cf-bosh alias with every error I run into so
my ask is to find a sample of cf-stub.yml with all minimum required values,
Im sure Im missing a lot :), the sample online here
http://docs.cloudfoundry.org/deploying/cf-stub-vsphere.html, when
I first run it I got an error regarding “Error 40001: Required property
`range' was not specified in object”, then after I added “range” property I
got the error above.

Im looking for building a POC CF with minimum effort, do have one
network (10.166.166.0/23) and vSphere 5.x, I want to use it for
both CF networks (cf1 and cf2), not sure how many Ips I need on each
network, and if I have to specify nodes spec and vsphere info in cf-stub
since I do not see section for it?

I also tried bosh-lite and it worked fine on Ubuntu 14.

Here is my cf-stub.yml in case you want to have a look


# The following line helps maintain current documentation at
http://docs.cloudfoundry.org.
# code_snippet cf-stub-vsphere start
---
name: cloudfoundry
director_uuid: b9a1bf7b-952f-48e1-a496-f6543d7a782c

releases:
- name: cf-210
version: latest

networks:

- name: cf1

subnets:

- range: 10.166.166.0/23

gateway: 10.195.76.1

static:

- 10.166.166.104 - 10.166.166.115

reserved:

# .1 is special

- 10.166.166.2 - 10.166.166.101

- 10.166.166.120 - 10.194.167.254

# .255 is special

dns: [10.166.168.183]

cloud_properties:

name: '10.166.166.x'

- name: cf2

subnets:

- range: 10.166.166.0/23

gateway: 10.166.166.1

static:

- 10.166.166.120 - 10.166.166.140

reserved:

# .1 is special

- 10.166.166.2 - 10.166.166.101

- 10.166.166.120 - 10.195.167.254

# .255 is special

dns: [10.166.168.183]

cloud_properties:

name: '10.166.166.x'

jobs:
ha_proxy_z1:
properties:
ha_proxy:
disable_http: true
properties:
cc:
droplets:
droplet_directory_key: the_key
buildpacks:
buildpack_directory_key: bd_key
staging_upload_user: username
staging_upload_password: password
bulk_api_password: password
db_encryption_key: the_key
dea_next:
disk_mb: 2048
memory_mb: 1024
loggregator_endpoint:
shared_secret: loggregator_endpoint_secret
nats:
user: nats_user
password: nats_password
router:
enable_ssl: true
ssl_cert: |
-----BEGIN CERTIFICATE-----

MIIDBjCCAe4CCQCz3nn1SWrDdTANBgkqhkiG9w0BAQUFADBFMQswCQYDVQQGEwJB

VTETMBEGA1UECBMKU29tZS1TdGF0ZTEhMB8GA1UEChMYSW50ZXJuZXQgV2lkZ2l0

cyBQdHkgTHRkMB4XDTE1MDMwMzE4NTMyNloXDTE2MDMwMjE4NTMyNlowRTELMAkG

A1UEBhMCQVUxEzARBgNVBAgTClNvbWUtU3RhdGUxITAfBgNVBAoTGEludGVybmV0

IFdpZGdpdHMgUHR5IEx0ZDCCASIwDQYJKoZIhvcNAQEBBQADggEPADCCAQoCggEB

AKtTK9xq/ycRO3fWbk1abunYf9CY6sl0Wlqm9UPMkI4j0itY2OyGyn1YuCCiEdM3

b8guGSWB0XSL5PBq33e7ioiaH98UEe+Ai+TBxnJsro5WQ/TMywzRDhZ4E7gxDBav

88ZY+y7ts0HznfxqEIn0Gu/UK+s6ajYcIy7d9L988+hA3K1FSdes8MavXhrI4xA1

fY21gESfFkD4SsqvrkISC012pa7oVw1f94slIVcAG+l9MMAkatBGxgWAQO6kxk5o

oH1Z5q2m0afeQBfFqzu5lCITLfgTWCUZUmbF6UpRhmD850/LqNtryAPrLLqXxdig

OHiWqvFpCusOu/4z1uGC5xECAwEAATANBgkqhkiG9w0BAQUFAAOCAQEAV5RAFVQy

8Krs5c9ebYRseXO6czL9/Rfrt/weiC1XLcDkE2i2yYsBXazMYr58o4hACJwe2hoC

bihBZ9XnVpASEYHDLwDj3zxFP/bTuKs7tLhP7wz0lo8i6k5VSPAGBq2kjc/cO9a3

TMmLPks/Xm42MCSWGDnCEX1854B3+JK3CNEGqSY7FYXU4W9pZtHPZ3gBoy0ymSpg

mpleiY1Tbn5I2X7vviMW7jeviB5ivkZaXtObjyM3vtPLB+ILpa15ZhDSE5o71sjA

jXqrE1n5o/GXHX+1M8v3aJc30Az7QAqWohW/tw5SoiSmVQZWd7gFht9vSzaH2WgO
LwcpBC7+cUJEww==
-----END CERTIFICATE-----
ssl_key: |
-----BEGIN RSA PRIVATE KEY-----

MIIEpAIBAAKCAQEAq1Mr3Gr/JxE7d9ZuTVpu6dh/0JjqyXRaWqb1Q8yQjiPSK1jY

7IbKfVi4IKIR0zdvyC4ZJYHRdIvk8Grfd7uKiJof3xQR74CL5MHGcmyujlZD9MzL

DNEOFngTuDEMFq/zxlj7Lu2zQfOd/GoQifQa79Qr6zpqNhwjLt30v3zz6EDcrUVJ

16zwxq9eGsjjEDV9jbWARJ8WQPhKyq+uQhILTXalruhXDV/3iyUhVwAb6X0wwCRq

0EbGBYBA7qTGTmigfVnmrabRp95AF8WrO7mUIhMt+BNYJRlSZsXpSlGGYPznT8uo

22vIA+ssupfF2KA4eJaq8WkK6w67/jPW4YLnEQIDAQABAoIBAQCDVqpcOoZKK9K8

Bt3eXQKEMJ2ji2cKczFFJ5MEm9EBtoJLCryZbqfSue3Fzpj9pBUEkBpk/4VT5F7o

0/Vmc5Y7LHRcbqVlRtV30/lPBPQ4V/eWtly/AZDcNsdfP/J1fgPSvaoqCr2ORLWL

qL/vEfyIeM4GcWy0+JMcPbmABslw9O6Ptc5RGiP98vCLHQh/++sOtj6PH1pt+2X/

Uecv3b1Hk/3Oe+M8ySorJD3KA94QTRnKX+zubkxRg/zCAki+as8rQc/d+BfVG698

ylUT5LVLNuwbWnffY2Zt5x5CDqH01mJnHmxzQEfn68rb3bGFaYPEn9EP+maQijv6

SsUM9A3lAoGBAODRDRn4gEIxjPICp6aawRrMDlRc+k6IWDF7wudjxJlaxFr2t7FF

rFYm+jrcG6qMTyq+teR8uHpcKm9X8ax0L6N6gw5rVzIeIOGma/ZuYIYXX2XJx5SW

SOas1xW6qEIbOMv+Xu9w2SWbhTgyRmtlxxjr2e7gQLz9z/vuTReJpInnAoGBAMMW

sq5lqUfAQzqxlhTobQ7tnB48rUQvkGPE92SlDj2TUt9phek2/TgRJT6mdcozvimt

JPhxKg3ioxG8NPmN0EytjpSiKqlxS1R2po0fb75vputfpw16Z8/2Vik+xYqNMTLo

SpeVkHu7fbtNYEK2qcU44OyOZ/V+5Oo9TuBIFRhHAoGACkqHhwDRHjaWdR2Z/w5m

eIuOvF3lN2MWZm175ouynDKDeoaAsiS2VttB6R/aRFxX42UHfoYXC8LcTmyAK5zF

8X3SMf7H5wtqBepQVt+Gm5zGSSqLcEnQ3H5c+impOh105CGoxt0rk4Ui/AeRIalv

C70AJOcvD3eu5aFq9gDe/1ECgYBAhkVbASzYGnMh+pKVH7rScSxto8v6/XBYT1Ez

7JOlMhD667/qvtFJtgIHkq7qzepbhnTv5x3tscQVnZY34/u9ILpD1s8dc+dibEvx

6S/gYLVorB5ois/DLMqaobRcew6Gs+XX9RPwmLahOJpZ9mh4XrOmCgPAYtP71YM9

ExpHCQKBgQCMMDDWGMRdFMJgXbx1uMere7OoniBdZaOexjbglRh1rMVSXqzBoU8+

yhEuHGAsHGWQdSBHnqRe9O0Bj/Vlw2VVEaJeL1ewRHb+jXSnuKclZOJgMsJAvgGm
SOWIahDrATA4g1T6yLBWQPhj3ZXD3eCMxT1Q3DvpG1DjgvXwmXQJAA==
-----END RSA PRIVATE KEY-----
cipher_suites:
TLS_RSA_WITH_RC4_128_SHA:TLS_RSA_WITH_AES_128_CBC_SHA
status:
user: router_user
password: router_password
login:
logout:
redirect:
parameter:
disable: false
uaa:
admin:
client_secret: admin_secret
batch:
username: batch_username
password: batch_password
cc:
client_secret: cc_client_secret
clients:
app-direct:
secret: app-direct_secret
developer_console:
secret: developer_console_secret
login:
secret: login_client_secret
notifications:
secret: notification_secret
doppler:
secret: doppler_secret
cloud_controller_username_lookup:
secret: cloud_controller_username_lookup_secret
gorouter:
secret: gorouter_secret

jwt:
verification_key: vk
signing_key: sk
scim:
users:
-
admin|fakepassword|scim.write,scim.read,openid,cloud_controller.admin,doppler.firehose

# code_snippet cf-stub-vsphere end
# The previous line helps maintain current documentation at
http://docs.cloudfoundry.org.




Thank you

Ahmed





From: CF Runtime <cfruntime(a)gmail.com>
Reply-To: "Discussions about the Cloud Foundry BOSH project." <
cf-bosh(a)lists.cloudfoundry.org>
Date: Wednesday, June 3, 2015 at 5:40 PM
To: "cf-bosh(a)lists.cloudfoundry.org" <
cf-bosh(a)lists.cloudfoundry.org>
Subject: Re: [cf-bosh] cf-stub.yml example with minimum or
required info

Hi Ali,

We try to keep those docs up to date, but it is possible they are
missing some pieces.

Can you tell me what errors you are getting?

Joseph Palermo
CF Runtime Team

_______________________________________________
cf-bosh mailing list
cf-bosh(a)lists.cloudfoundry.org
https://lists.cloudfoundry.org/mailman/listinfo/cf-bosh

_______________________________________________
cf-bosh mailing list
cf-bosh(a)lists.cloudfoundry.org
https://lists.cloudfoundry.org/mailman/listinfo/cf-bosh

_______________________________________________
cf-bosh mailing list
cf-bosh(a)lists.cloudfoundry.org
https://lists.cloudfoundry.org/mailman/listinfo/cf-bosh

_______________________________________________
cf-bosh mailing list
cf-bosh(a)lists.cloudfoundry.org
https://lists.cloudfoundry.org/mailman/listinfo/cf-bosh

_______________________________________________
cf-bosh mailing list
cf-bosh(a)lists.cloudfoundry.org
https://lists.cloudfoundry.org/mailman/listinfo/cf-bosh

_______________________________________________
cf-bosh mailing list
cf-bosh(a)lists.cloudfoundry.org
https://lists.cloudfoundry.org/mailman/listinfo/cf-bosh

_______________________________________________
cf-bosh mailing list
cf-bosh(a)lists.cloudfoundry.org
https://lists.cloudfoundry.org/mailman/listinfo/cf-bosh

_______________________________________________
cf-bosh mailing list
cf-bosh(a)lists.cloudfoundry.org
https://lists.cloudfoundry.org/mailman/listinfo/cf-bosh

Join cf-bosh@lists.cloudfoundry.org to automatically receive all group messages.