Operations
Google Cloud’s operations suite (formerly Stackdriver) is designed to monitor, troubleshoot, and improve cloud infrastructure, software, and application performance. Efficiently build and run workloads, keeping applications performant and available.
- Collect signals across Google Cloud internal and external apps, platforms, and services
- Analyze and monitor your operational telemetry
- Set up appropriate performance and availability indicators
- Use built-in observability to troubleshoot and improve your applications
- Automate ops using both out-of-the-box tools and tools customized through programmatic interfaces
Key features
For Ops, SecOps, SRE, DevOps
Cloud Logging is a fully managed service that performs at scale and can ingest application and system log data, as well as custom log data from thousands of VMs. Cloud Logging allows you to analyze and export selected logs to long-term storage in real time.
For Ops, SRE, DevOps
Cloud Monitoring provides visibility into the performance, uptime, and overall health of cloud-powered applications. Collect metrics, events, and metadata from Google Cloud services, hosted uptime probes, application instrumentation, and a variety of common application components.
For DevOps
Application Performance Management (APM) includes tools to help you reduce latency and cost, so you can run more efficient applications. With Cloud Trace, Cloud Debugger, and Cloud Profiler, you gain insight into how your code and services are functioning and troubleshoot if needed.
Customer stories
Highlights
-
Reduced time spent on infrastructure management using Cloud Monitoring
-
Increased productivity by managing all operations products in one centralized platform
-
Saved as much as 80% of time that would otherwise be spent troubleshooting
Partner
What’s new
Documentation
Get started with Cloud Logging
Guides and set-up docs to help you get up and running with Cloud Logging.
Cloud Audit Logs
Learn how Cloud Audit Logs maintains three audit logs: admin activity, data access, and system event.
Get started with Cloud Monitoring
Learn about Workspaces, monitoring agent, uptime checks, and other features.
Google Cloud metrics
See which metrics Cloud Monitoring supports.
Monitoring and logging support for GKE
Learn about Google Kubernetes Engine’s native integration with Cloud Monitoring and Cloud Logging.
Common use cases
Manage cloud operations
Build observability into your platform through the use of integrated logging, monitoring, and application performance management tools.
Centralize your logging and operations
Integrated logging provides critical insights into platform events for development, DevOps/SRE, and security teams. Ingest logs from Google Cloud services and external sources for short-term operations and long-term log analysis. Use integrated audit logging to perform detailed forensic analysis. Integrate with your third-party logging systems using real-time log exports.
Cloud Logging collects all logs, including audit logs, platform logs, user logs, and external logs sent to the API, which are sent to the Logs Router where they are delivered to Cloud Logging, BigQuery, or externally via integration with Pub/Sub.
Build observability into applications and infrastructure
Integrated observability provides critical insights into platform events for development, DevOps/SRE, and security teams. Cloud Monitoring provides centralized dashboards and alerting to efficiently operate services. Use integrated logging to power vulnerability detection and bring proactive intelligent monitoring to your security and operations team. Customize your log monitoring using Cloud Functions and the Data Loss Protection API.
Cloud Logging and Cloud Monitoring services provide your SRE/DevOps teams with the observability needed to monitor Google Cloud, on-premises, and third-party providers. Logging and Monitoring are integrated with Security Command Center to provide the security and operations teams the insights they need.
BindPlane is a registered trademark of Blue Medora.
Reduce latency and inefficiency with Application Performance Management
Reduce latency and cost for your applications by using Application Performance Management tools. By understanding in detail how they behave in production, you can help make your applications faster and more reliable whether they are hosted on Google Cloud or not. Use Cloud Trace’s distributed trace to understand how requests propagate through your application. Use Cloud Profiler to help identify latency and inefficiency in your code. Troubleshoot your application in production without stopping or slowing down your apps by using Cloud Debugger.
All features
| Log management | Logs Router allows customers to control where logs are sent. All logs, including audit logs, platform logs, and user logs, are sent to the Cloud Logging API where they pass through the log router. The log router checks each log entry against existing rules to determine which log entries to discard, which to ingest, and which to include in exports. |
|---|---|
| Log insights | Error Reporting analyzes and aggregates the errors in your cloud applications. Notifies you when new errors are detected. |
| Proactive monitoring | Cloud Monitoring allows you to create alerting policies to notify you when metrics, health check results, and uptime check results meet specified criteria. Integrated with a wide variety of notification channels, including Slack and PagerDuty. |
| Custom visualization | Cloud Monitoring Dashboards provides default out-of-the-box dashboards and allows you to define custom dashboards with powerful visualization tools to suit your needs. |
| Health check monitoring | Cloud Monitoring provides endpoint checks to web applications and other internet-accessible services running on your cloud environment. You can configure uptime checks associated with URLs, groups, or resources, such as instances and load balancers. |
| Service monitoring | Service Monitoring provides out-of-the-box telemetry and dashboards that allow troubleshooting in context through topology and context graphs, plus automation of health monitoring through SLOs and error budget management. |
| Latency management | Cloud Trace provides latency sampling and reporting for App Engine, including per-URL statistics and latency distributions. |
| Debugging | Cloud Debugger connects your application’s production data to your source code by inspecting the state of your application at any code location in production without stopping or slowing down your requests. |
| Performance and cost management | Cloud Profiler provides continuous profiling of resource consumption in your production applications, helping you identify and eliminate potential performance issues. |
| Security management | Cloud Audit Logs provides near real-time user activity visibility across Google Cloud. |
Pricing
Control your own usage and spending: pay only for what you use. Free usage allotments let you get started with no up-front fees or commitments.
Free usage allotments let you get started with no up-front fees or commitments.
View pricing detailsPartners
Get support from a rich and growing ecosystem of technology integrations to expand the IT ops, security, and compliance capabilities available to Google Cloud customers.
Take the next step
Get $300 in free credits to learn and build on Google Cloud for up to 12 months.