diego error: failed to initialize container --where to look?
Tom Sherrod <tom.sherrod@...>
After upgrade to/deploy diego in multiple environments with success, I'm a bit stumped on error for a new install.
A cf docker-push results in: 015-09-26T16:02:01.65+0000 [API/0] OUT Updated app with guid 7aa5800b-96d3-48a0-b707-1d85680e10c9 ({"state"=>"STARTED"}) 2015-09-26T16:02:01.74+0000 [API/0] ERR Failed to stage application: insufficient resources I back up a step and diego enable an app. Fails on start of app: 2015-09-26T16:14:26.38+0000 [API/0] OUT App instance exited with guid be3f6daa-96d1-41a3-8a3a-2d0ff697f8c4 payload: {"instance"=>"8d1947c6-84cb-4144-4a97-9038a7978ebf", "index"=>0, "reason"=>"CRASHED", "exit_description"=>"failed to initialize container", "crash_count"=>3, "crash_timestamp"=>1443284066323748334, "version"=>"b5a0a268-82f7-445d-937d-4349415b2a70"} I'm betting it is one machine not being able to talk to another. What log may provide any additional details on what's failing? Thanks, Tom
|
|
Re: postgres out of disk space
Aleksey Zalesov
It will be handy to have a configuration option for CC to prune unused droplets after 7 days of inactivity, for example.
toggle quoted messageShow quoted text
On 26 Sep 2015, at 01:47, CF Runtime <cfruntime(a)gmail.com<mailto:cfruntime(a)gmail.com>> wrote:
cf-release v213 fixed a bug where buildpack caches were not getting cleaned up properly. Are you running a version prior to 213? If so, you might need to go into the `shared/run.10.10.2.37.xip.io-cc-droplets` directory and delete the `buildpack_cache` subdirectory. Deleting the contents of buildpack_cache will not cause failures in the system, the caches will simply get generated from scratch the next time apps stage. If you upgrade to 213 or later, you can just call the api endpoint to clean out the buildpack cache. You can find the details in the v213 release notes: https://github.com/cloudfoundry/cf-release/releases/tag/v213 Joseph & Natalie CF Release Integration Team On Fri, Sep 25, 2015 at 11:53 AM, Matthias Ender <Matthias.Ender(a)sas.com<mailto:Matthias.Ender(a)sas.com>> wrote: ah, it’s not postgres: 90M postgres it’s actually: 94G shared/run.10.10.2.37.xip.io-cc-droplets Does it keep a copy of each artifact ever pushed? Or is this part of the event storage, controlled by these properties: app_events: cutoff_age_in_days: 31 app_usage_events: cutoff_age_in_days: 31 audit_events: cutoff_age_in_days: 31 ?? From: Aleksey Zalesov [mailto:aleksey.zalesov(a)altoros.com<mailto:aleksey.zalesov(a)altoros.com>] Sent: Friday, September 25, 2015 12:23 PM To: Discussions about Cloud Foundry projects and the system overall. <cf-dev(a)lists.cloudfoundry.org<mailto:cf-dev(a)lists.cloudfoundry.org>> Subject: [cf-dev] Re: postgres out of disk space Does your database occupy this 100 GB? Or something else like logs? Aleksey Zalesov | CloudFoundry Engineer | Altoros Tel: (617) 841-2121 ext. 5707<tel:%28617%29%20841-2121%20ext.%205707> | Toll free: 855-ALTOROS Fax: (866) 201-3646<tel:%28866%29%20201-3646> | Skype: aleksey_zalesov www.altoros.com<http://www.altoros.com/> | blog.altoros.com<http://blog.altoros.com/> | twitter.com/altoros<http://twitter.com/altoros> On 25 Sep 2015, at 14:22, Matthias Ender <Matthias.Ender(a)sas.com<mailto:Matthias.Ender(a)sas.com>> wrote: I have a cf-aws-tiny cf-boshrelease deployment, and it’s been running well for over 4 months. We have about 40 apps, with a couple of dozen of cf pushes each day. Yesterday pushing apps became spotty and then impossible, with various errors. Turned out the 100GB disk for the postgres instance on the data note was full. I increased the disk size and things a running again. But – what happened there? 100G and growing seems like awfully large database for a rather modest use. And I’m worried it’ll just happen again in a few months. thanks, Matthias
|
|
Re: postgres out of disk space
CF Runtime
cf-release v213 fixed a bug where buildpack caches were not getting cleaned
up properly. Are you running a version prior to 213? If so, you might need to go into the `shared/run.10.10.2.37.xip.io-cc-droplets` directory and delete the `buildpack_cache` subdirectory. Deleting the contents of buildpack_cache will not cause failures in the system, the caches will simply get generated from scratch the next time apps stage. If you upgrade to 213 or later, you can just call the api endpoint to clean out the buildpack cache. You can find the details in the v213 release notes: https://github.com/cloudfoundry/cf-release/releases/tag/v213 Joseph & Natalie CF Release Integration Team On Fri, Sep 25, 2015 at 11:53 AM, Matthias Ender <Matthias.Ender(a)sas.com> wrote: ah, it’s not postgres:
|
|
Re: postgres out of disk space
Matthew Sykes <matthew.sykes@...>
Droplets and app packages should be pruned when the applications they're
associated with are deleted. Only one or two versions of the droplet are ever held and only copy of the application bits. There's another blob store that holds application resources for resource match flow when uploading an app. Those resources are not pruned so you should make sure that you disable that in your manifest if you don't want that behavior. The droplet blob store is clearly your issue - not events. On Fri, Sep 25, 2015 at 2:53 PM, Matthias Ender <Matthias.Ender(a)sas.com> wrote: ah, it’s not postgres: -- Matthew Sykes matthew.sykes(a)gmail.com
|
|
Re: postgres out of disk space
Matthias Ender <Matthias.Ender@...>
ah, it’s not postgres:
90M postgres it’s actually: 94G shared/run.10.10.2.37.xip.io-cc-droplets Does it keep a copy of each artifact ever pushed? Or is this part of the event storage, controlled by these properties: app_events: cutoff_age_in_days: 31 app_usage_events: cutoff_age_in_days: 31 audit_events: cutoff_age_in_days: 31 ?? From: Aleksey Zalesov [mailto:aleksey.zalesov(a)altoros.com] Sent: Friday, September 25, 2015 12:23 PM To: Discussions about Cloud Foundry projects and the system overall. <cf-dev(a)lists.cloudfoundry.org> Subject: [cf-dev] Re: postgres out of disk space Does your database occupy this 100 GB? Or something else like logs? Aleksey Zalesov | CloudFoundry Engineer | Altoros Tel: (617) 841-2121 ext. 5707 | Toll free: 855-ALTOROS Fax: (866) 201-3646 | Skype: aleksey_zalesov www.altoros.com<http://www.altoros.com/> | blog.altoros.com<http://blog.altoros.com/> | twitter.com/altoros<http://twitter.com/altoros> On 25 Sep 2015, at 14:22, Matthias Ender <Matthias.Ender(a)sas.com<mailto:Matthias.Ender(a)sas.com>> wrote: I have a cf-aws-tiny cf-boshrelease deployment, and it’s been running well for over 4 months. We have about 40 apps, with a couple of dozen of cf pushes each day. Yesterday pushing apps became spotty and then impossible, with various errors. Turned out the 100GB disk for the postgres instance on the data note was full. I increased the disk size and things a running again. But – what happened there? 100G and growing seems like awfully large database for a rather modest use. And I’m worried it’ll just happen again in a few months. thanks, Matthias
|
|
Re: Environment variable changes in DIEGO
Mike Heath
By the `application_uris` data are broken logic, VCAP_SERVICES is also
broken and should be removed as it doesn't reflect service binding changes. In my (strong) opinion it doesn't make any sense to run an application in a dynamic environment and NOT expose its route(s). This has a lot of implications. For example, an application running on Diego can no longer register itself with a dynamic service registry since the information needed to route to the application is totally obscured. (i.e. Spring Cloud's integration with Eureka.) It makes more sense to me to match the behavior of service bind/unbind by notifying the user at route map/unmap time that they need to restart their app for it to see the changes. -Mike On Wed, Sep 16, 2015 at 9:09 AM Matthew Sykes <matthew.sykes(a)gmail.com> wrote: The changes, in general, were intentional. The `application_uris` data was
|
|
Re: Running/Deploying UAA as a standalone service (not within CloudFoundry environment)
Filip Hanik
hi TJ,
toggle quoted messageShow quoted text
because the UAA supports multi tenancy and it denotes a tenant by using a subdomain, we need to know what the base URL is. By default it is localhost so http://tenant1.localhost:8080/uaa http://tenant2.localhost:8080/uaa would support the 'default' zone at localhost, and tenant1,localhost and tenant2.localhost tenants. You can add more default host names to (in addition to localhost) by adding hostnames to a Yaml configuration file. zones: internal: hostnames: - example.com - test.com and this would now also support example.com (default tenant/zone) tenant1.example.com (tenant1) tenant2.example.com (tenant2)
On Fri, Sep 25, 2015 at 10:20 AM, TJ Brown <tjbrown(a)gmail.com> wrote:
I'm trying to evaluate UAA to be used as a user authentication /
|
|
Re: postgres out of disk space
Aleksey Zalesov
Does your database occupy this 100 GB? Or something else like logs?
toggle quoted messageShow quoted text
Aleksey Zalesov | CloudFoundry Engineer | Altoros Tel: (617) 841-2121 ext. 5707 | Toll free: 855-ALTOROS Fax: (866) 201-3646 | Skype: aleksey_zalesov www.altoros.com <http://www.altoros.com/> | blog.altoros.com <http://blog.altoros.com/> | twitter.com/altoros <http://twitter.com/altoros>
On 25 Sep 2015, at 14:22, Matthias Ender <Matthias.Ender(a)sas.com> wrote:
I have a cf-aws-tiny cf-boshrelease deployment, and it’s been running well for over 4 months. We have about 40 apps, with a couple of dozen of cf pushes each day. Yesterday pushing apps became spotty and then impossible, with various errors. Turned out the 100GB disk for the postgres instance on the data note was full. I increased the disk size and things a running again. But – what happened there? 100G and growing seems like awfully large database for a rather modest use. And I’m worried it’ll just happen again in a few months. thanks, Matthias
|
|
Re: Running/Deploying UAA as a standalone service (not within CloudFoundry environment)
Frans Thamura
we modify the uaa (spring security) become OAuth2 Server independently
toggle quoted messageShow quoted text
take a look www.merv.id the code in github.com/meruvian/yama F -- Frans Thamura (曽志胜) Java Champion Shadow Master and Lead Investor Meruvian. Integrated Hypermedia Java Solution Provider. Mobile: +628557888699 Blog: http://blogs.mervpolis.com/roller/flatburger (id) FB: http://www.facebook.com/meruvian TW: http://www.twitter.com/meruvian / @meruvian Website: http://www.meruvian.org "We grow because we share the same belief."
On Fri, Sep 25, 2015 at 11:20 PM, TJ Brown <tjbrown(a)gmail.com> wrote:
I'm trying to evaluate UAA to be used as a user authentication /
|
|
Running/Deploying UAA as a standalone service (not within CloudFoundry environment)
TJ Brown
I'm trying to evaluate UAA to be used as a user authentication / authorization service to be used within a microservice architecture but not within a CloudFoundry environment. When running UAA using
$ ./gradlew run the application seems to work locally but I can't access it externally. So, what do I need to configure to allow access from other hosts? Also, are there instructions / guides / tutorial for how to deploy uaa on an existing Tomcat server? Thanks for your help.
|
|
Re: F5 Load Balancer Configuration for Cloud Foundry Loggregator
Anthony
Thanks for all the responses. We ended up finding an f5 device that is on 11.5. With that, things worked out of the box with only the irule to insert x-forwarded-proto. Everything is working over 443.
toggle quoted messageShow quoted text
Regards, Anthony
On Sep 22, 2015, at 11:35 PM, Mike Youngstrom <youngm(a)gmail.com> wrote:
If you are sharing a vip for http and websocket then 443 would be correct. But Anthony, you can try creating a layer 4 virtual server on 4443 that goes to the same pool on the back end and configure the CC to use that port instead for loggregator connections. Mike On Tue, Sep 22, 2015 at 10:32 PM, Johannes Hiemer <jvhiemer(a)gmail.com> wrote:
|
|
Re: postgres out of disk space
Matthew Sykes <matthew.sykes@...>
Just a thought: How large is your events table? Is it getting pruned at the
correct interval for you? I believe events, app usage events, and audit events are pruned when they're more than 31 days old default. You could change the cc properties in the deployment manifest to make that a little more aggressive if that's the problem. On Fri, Sep 25, 2015 at 7:22 AM, Matthias Ender <Matthias.Ender(a)sas.com> wrote: I have a cf-aws-tiny cf-boshrelease deployment, and it’s been running well -- Matthew Sykes matthew.sykes(a)gmail.com
|
|
Re: Running the app test suite within the CATs, and the admin_buildpack_lifecycle_test is failing
JT Archie <jarchie@...>
Jordan,
toggle quoted messageShow quoted text
Try the CATs with the deployment, just to be sure, but that being said... It is also possible that your deployment may not have all the buildpacks installed. If you run the command `cf buildpacks`. It returns a list of uploaded buildpacks on your CF deployment. If you happen to have one missing that's it. Cheers, JT and Amin
On Thu, Sep 24, 2015 at 6:17 PM, CF Runtime <cfruntime(a)gmail.com> wrote:
You can also check out the v208 tag of cf-release, then run the
|
|
Re: Instance crashing after running once. Error: "reason"=>"CRASHED", "exit_status"=>0, "exit_description"=>"app instance exited"
Daniel Mikusa
On Fri, Sep 25, 2015 at 6:11 AM, zooba Sir <myfakename90(a)gmail.com> wrote:
I've pushed the app which uses redis service (A sample app which simply Does it start and send / receive a set number of messages or does it do this forever? Instance is created at start and after successfully running once the Exit code 0 generally means your application shutdown cleanly. i.e. it finished what it was doing and exited. If that's unexpected, I would suggest that you increase the log level of the application to get more details. And after sometime another instance getting created automatically,The system is restarting your application because it exited. The expectation of an application that runs on CF is that it won't exit, ever. Because of this if CF sees that one of your applications has stopped, it assumes the app has "crashed" and will automatically try to restart it. That's what's happening here. You app has exited, CF sees that and it's helpfully trying to restart your app. Dan
|
|
postgres out of disk space
Matthias Ender <Matthias.Ender@...>
I have a cf-aws-tiny cf-boshrelease deployment, and it's been running well for over 4 months.
We have about 40 apps, with a couple of dozen of cf pushes each day. Yesterday pushing apps became spotty and then impossible, with various errors. Turned out the 100GB disk for the postgres instance on the data note was full. I increased the disk size and things a running again. But - what happened there? 100G and growing seems like awfully large database for a rather modest use. And I'm worried it'll just happen again in a few months. thanks, Matthias
|
|
Instance crashing after running once. Error: "reason"=>"CRASHED", "exit_status"=>0, "exit_description"=>"app instance exited"
Zuba Al <myfakename90@...>
I've pushed the app which uses redis service (A sample app which simply send and receives a message thru Redis service). Instance is created at start and after successfully running once the instance getting crashed with error: "reason"=>"CRASHED", "exit_status"=>0, "exit_description"=>"app instance exited". And after sometime another instance getting created automatically, sucessfully running once and crashing with below logs. And this goes on for sometime.
my manifest.yml: name: RedisApp no-route: true memory: 512M random-route: true instances: 1 path: target/gs-messaging-redis-0.1.0.jar services: - redislite cf logs RedisApp command output: 2015-09-25T11:58:51.85+0200 [DEA/1] OUT Starting app instance (index 0) with guid aec41933-ef0c-4d5b-8e67-da6729ca3005 2015-09-25T11:58:56.58+0200 [App/0] OUT 2015-09-25T11:58:56.58+0200 [App/0] OUT . ____ _ __ _ _ 2015-09-25T11:58:56.58+0200 [App/0] OUT /\\ / ___'_ __ _ _(_)_ __ __ _ \ \ \ \ 2015-09-25T11:58:56.58+0200 [App/0] OUT ( ( )\___ | '_ | '_| | '_ \/ _` | \ \ \ \ 2015-09-25T11:58:56.58+0200 [App/0] OUT \\/ ___)| |_)| | | | | || (_| | ) ) ) ) 2015-09-25T11:58:56.58+0200 [App/0] OUT ' |____| .__|_| |_|_| |_\__, | / / / / 2015-09-25T11:58:56.58+0200 [App/0] OUT =========|_|==============|___/=/_/_/_/ 2015-09-25T11:58:56.58+0200 [App/0] OUT :: Spring Boot :: (v1.2.6.RELEASE) 2015-09-25T11:58:56.69+0200 [App/0] OUT 2015-09-25 09:58:56.690 INFO 29 --- [ main] pertySourceApplicationContextInitializer : Adding 'cloud' PropertySource to ApplicationContext 2015-09-25T11:58:56.78+0200 [App/0] OUT 2015-09-25 09:58:56.786 INFO 29 --- [ main] nfigurationApplicationContextInitializer : Adding cloud service auto-reconfiguration to ApplicationContext 2015-09-25T11:58:56.80+0200 [App/0] OUT 2015-09-25 09:58:56.804 INFO 29 --- [ main] hello.Application : Starting Application on 18venf3o9v7 with PID 29 (/home/vcap/app started by vcap in /home/vcap/app) 2015-09-25T11:58:56.86+0200 [App/0] OUT 2015-09-25 09:58:56.869 INFO 29 --- [ main] s.c.a.AnnotationConfigApplicationContext : Refreshing org.springframework.context.annotation.AnnotationConfigApplicationContext(a)6d2ca421: startup date [Fri Sep 25 09:58:56 UTC 2015]; root of context h ierarchy 2015-09-25T11:58:57.22+0200 [App/0] OUT 2015-09-25 09:58:57.223 WARN 29 --- [ main] .i.s.PathMatchingResourcePatternResolver : Skipping [/home/vcap/app/.java-buildpack/spring_auto_reconfiguration/spring_auto_reconfiguration-1.10.0_RELEASE.jar] because it does not denote a directory 2015-09-25T11:58:57.78+0200 [App/0] OUT 2015-09-25 09:58:57.780 INFO 29 --- [ main] urceCloudServiceBeanFactoryPostProcessor : Auto-reconfiguring beans of type javax.sql.DataSource 2015-09-25T11:58:57.79+0200 [App/0] OUT 2015-09-25 09:58:57.789 INFO 29 --- [ main] urceCloudServiceBeanFactoryPostProcessor : No beans of type javax.sql.DataSource found. Skipping auto-reconfiguration. 2015-09-25T11:58:57.79+0200 [App/0] OUT 2015-09-25 09:58:57.794 INFO 29 --- [ main] edisCloudServiceBeanFactoryPostProcessor : Auto-reconfiguring beans of type org.springframework.data.redis.connection.RedisConnectionFactory 2015-09-25T11:58:57.90+0200 [App/0] OUT 2015-09-25 09:58:57.905 INFO 29 --- [ main] edisCloudServiceBeanFactoryPostProcessor : Reconfigured bean redisConnectionFactory into singleton service connector org.springframework.data.redis.connection.jedis.JedisConnectionFactory(a)74ca9fd4 2015-09-25T11:58:58.29+0200 [App/0] OUT 2015-09-25 09:58:58.298 INFO 29 --- [ main] o.s.j.e.a.AnnotationMBeanExporter : Registering beans for JMX exposure on startup 2015-09-25T11:58:58.30+0200 [App/0] OUT 2015-09-25 09:58:58.308 INFO 29 --- [ main] o.s.c.support.DefaultLifecycleProcessor : Starting beans in phase 2147483647 2015-09-25T11:58:58.37+0200 [App/0] OUT 2015-09-25 09:58:58.375 INFO 29 --- [ main] hello.Application : Started Application in 2.491 seconds (JVM running for 3.401) 2015-09-25T11:58:58.37+0200 [App/0] OUT 2015-09-25 09:58:58.376 INFO 29 --- [ main] hello.Application : Sending message... 2015-09-25T11:58:58.39+0200 [App/0] OUT 2015-09-25 09:58:58.398 INFO 29 --- [ container-2] hello.Receiver : Received <Hello from Redis!> 2015-09-25T11:58:58.40+0200 [App/0] OUT 2015-09-25 09:58:58.400 INFO 29 --- [ Thread-2] s.c.a.AnnotationConfigApplicationContext : Closing org.springframework.context.annotation.AnnotationConfigApplicationContext(a)6d2ca421: startup date [Fri Sep 25 09:58:56 UTC 2015]; root of context hier archy 2015-09-25T11:58:58.40+0200 [App/0] OUT 2015-09-25 09:58:58.401 INFO 29 --- [ Thread-2] o.s.c.support.DefaultLifecycleProcessor : Stopping beans in phase 2147483647 2015-09-25T11:58:58.40+0200 [App/0] OUT 2015-09-25 09:58:58.404 INFO 29 --- [ Thread-2] o.s.j.e.a.AnnotationMBeanExporter : Unregistering JMX-exposed beans on shutdown 2015-09-25T11:58:58.44+0200 [App/0] ERR 2015-09-25T11:58:58.49+0200 [API/0] OUT App instance exited with guid aec41933-ef0c-4d5b-8e67-da6729ca3005 payload: {"cc_partition"=>"default", "droplet"=>"aec41933-ef0c-4d5b-8e67-da6729ca3005", "version"=>"8985cc3d-6aa2-4e34-a11c-f64289aeace3", "instance"=>"0147066c534c4d3bb879fffd2c149529", " index"=>0, "reason"=>"CRASHED", "exit_status"=>0, "exit_description"=>"app instance exited", "crash_timestamp"=>1443175138}
|
|
Re: Proposal: Decomposing cf-release and Extracting Deployment Strategies
Mike Youngstrom <youngm@...>
Sounds good. Thanks for taking the time to discuss this with me.
toggle quoted messageShow quoted text
Mike
On Mon, Sep 21, 2015 at 7:24 PM, Amit Gupta <agupta(a)pivotal.io> wrote:
This forces us to spread all clusterable nodes across 2 deploys andcertain jobs, like CC, use the job_name+index to uniquely identify a node
|
|
Re: Running the app test suite within the CATs, and the admin_buildpack_lifecycle_test is failing
CF Runtime
You can also check out the v208 tag of cf-release, then run the
acceptance-tests from src/github.com/cloudfoundry/cf-acceptance-tests Joseph CF Release Integration Team On Thu, Sep 24, 2015 at 2:53 PM, Christopher Piraino <cpiraino(a)pivotal.io> wrote: Jordan,
|
|
Re: Running the app test suite within the CATs, and the admin_buildpack_lifecycle_test is failing
Christopher Piraino <cpiraino@...>
Jordan,
The Cloud Foundry bosh release comes with an errand called "acceptance_tests" that contains the version of CATs which that version of CF was tested with. You can run these by doing "bosh run errand acceptance_tests". There are also some manifest properties that you might need to set for the CATs to run correctly. The list of all possible properties for the acceptance_tests errand can be found here: https://github.com/cloudfoundry/cf-release/blob/develop/jobs/acceptance-tests/spec . - Chris Piraino On Thu, Sep 24, 2015 at 11:27 AM, Jordan Collier <jordanicollier(a)gmail.com> wrote: I was unclear on what I am asking, the real question is as follows:
|
|
Re: Environment variables with special characters not handled correctly?
Jonas Rosland
Hi Daniel and Dieu,
Finally after much trial and error I finally got it working. I created a user-provided service and then called on it from my application. I've documented the steps for anyone else wanting to know how to work with these variables (clearer documentation with examples maybe?). Here's the documentation and example application: https://gist.github.com/jonasrosland/08b5758eaa9098a81cf8 Thanks for all your help! Best regards, Jonas Rosland
|
|