Re: add Diego into our monitoring system
Matthew Sykes <matthew.sykes@...>
There are dozens of metrics emitted by diego but I don't know of any
documentation for them in the open source repositories.
You can find most of them with a quick search of the diego-release
submodules under `src/github.com/cloudfoundry-incubator` with a pattern
like `metric\..*\(`. You will see metrics like `CrashedActualLRPs`,
`LRPsMissing`, and `LRPsExtra` in there.
toggle quoted message
Show quoted text
documentation for them in the open source repositories.
You can find most of them with a quick search of the diego-release
submodules under `src/github.com/cloudfoundry-incubator` with a pattern
like `metric\..*\(`. You will see metrics like `CrashedActualLRPs`,
`LRPsMissing`, and `LRPsExtra` in there.
On Thu, Mar 3, 2016 at 8:35 PM, Liu Rui <ibmmarmot(a)gmail.com> wrote:
Hello,
We need to add Diego into our current monitoring system. BOSH still can be
used. The previous Varz information as following from HM9000 is very useful
to us. Is there any substitute for it in Diego?
{
"name": "HM9000",
"numCPUS": 4,
"numGoRoutines": 82,
"memoryStats": {
"..."
},
"tags": {
"ip": "..."
},
"contexts": [
{
"name": "HM9000",
"metrics": [
{
"name": "StartEvacuating",
"value": 643631
},
{
"name": "StopEvacuationComplete",
"value": 564223
},
{
"name": "DesiredStateSyncTimeInMilliseconds",
"value": 1576.52033
},
{
"name": "ActualStateListenerStoreUsagePercentage",
"value": 4.43904
},
{
"name": "StartCrashed",
"value": 3396363
},
{
"name": "StartMissing",
"value": 1519145
},
{
"name": "StopDuplicate",
"value": 45593
},
{
"name": "StopExtra",
"value": 2040903
},
{
"name": "SavedHeartbeats",
"value": 14958857
},
{
"name": "ReceivedHeartbeats",
"value": 14958857
},
{
"name": "NumberOfAppsWithAllInstancesReporting",
"value": 66841
},
{
"name": "NumberOfAppsWithMissingInstances",
"value": 10
},
{
"name": "NumberOfUndesiredRunningApps",
"value": 7
},
{
"name": "NumberOfRunningInstances",
"value": 71574
},
{
"name": "NumberOfMissingIndices",
"value": 12
},
{
"name": "NumberOfCrashedInstances",
"value": 2139
},
{
"name": "NumberOfCrashedIndices",
"value": 373
},
{
"name": "NumberOfDesiredApps",
"value": 66851
},
{
"name": "NumberOfDesiredInstances",
"value": 71869
},
{
"name": "NumberOfDesiredAppsPendingStaging",
"value": 9
}
]
}
]
}
--
Matthew Sykes
matthew.sykes(a)gmail.com
Matthew Sykes
matthew.sykes(a)gmail.com