App Engine Modules in Python

App Engine Modules (or just "Modules" hereafter) is a feature that lets developers factor large applications into logical components that can share stateful services and communicate in a secure fashion. An app that handles customer requests might include separate modules to handle other tasks:

API requests from mobile devices
Internal, admin-like requests
Backend processing such as billing pipelines and data analysis

Modules can be configured to have different versions, performance levels, budgets, and authorization.

Application hierarchy
Instance scaling and class
Configuration
Uploading modules
Instance states
Instance uptime
Background threads
Monitoring resource usage
Logging
Communication between modules

Application hierarchy

At the highest level, an App Engine application includes one or more modules. Each module has one or more versions. Every module and version has a name which you assign. Module and version names are alphanumeric. Version names cannot be integers, they must contain at least one non-numeric character. While running, a particular module/version will have one or more instances:

Hierarchy graph of modules/versions/instances

Stateful services (such as Memcache, Datastore, and Task Queues) are shared by all the modules in an application. Every module, version, and instance has its own unique URI (for example, v1.my-module.my-app.appspot.com). Incoming user requests are routed to an instance of a particular module/version according to URL addressing conventions and an optional customized dispatch file.

Please note that in April of 2013, Google stopped issuing SSL certificates for double-wildcard domains hosted at appspot.com (i.e. *.*.appspot.com). If you rely on such URLs for HTTPS access to your application, please change any application logic to use "-dot-" instead of ".". For example, to access version "1" of application "myapp" use "https://1-dot-myapp.appspot.com" instead of "https://1.myapp.appspot.com." If you continue to use "https://1.myapp.appspot.com" the certificate will not match, which will result in an error for any User-Agent that expects the URL and certificate to match exactly.

Instance scaling and class

While an application is running, incoming requests are routed to an existing or new instance of the appropriate module/version. The scaling type of a module/version controls how instances are created. There are three scaling types: manual, basic, and automatic.

Manual Scaling

A module with manual scaling runs continuously, allowing you to perform complex initialization and rely on the state of its memory over time.

Basic Scaling

A module with basic scaling will create an instance when the application receives a request. The instance will be turned down when the app becomes idle. Basic scaling is ideal for work that is intermittent or driven by user activity.

Automatic Scaling

Automatic scaling is the scaling policy that App Engine has used since its inception. It is based on request rate, response latencies, and other application metrics. Previously users could use the Admin Console to configure the automatic scaling parameters (instance class, idle instances and pending latency) for an application's frontend versions only. These settings now apply to every version of every module that has automatic scaling.

Each scaling type offers a selection of instance classes, with different amounts of CPU and Memory. The following tables compare the three types of instance scaling, and the levels of service and cost of the various instance classes:

Scaling Types

Feature	Automatic Scaling	Manual Scaling	Basic Scaling
Deadlines	60-second deadline for HTTP requests, 10-minute deadline for tasks	Requests can run indefinitely. A manually-scaled instance can choose to handle `/_ah/start` and execute a program or script for many hours without returning an HTTP response code.	Same as manual scaling.
CPU/Memory	Configurable by selecting a F1, F2 or F4 instance class	Configurable by selecting a B1, B2, B4, B4_1G, or B8 instance class	Configurable by selecting a B1, B2, B4, B4_1G, or B8 instance class
Residence	Instances are evicted from memory based on usage patterns.	Instances remain in memory, and state is preserved across requests. When instances are restarted, an `/_ah/stop` request appears in the logs. If there is a registered stop callback method, it has 30 seconds to complete before shutdown occurs.	Instances are evicted based on the `‘idle_timeout’` parameter. If an instance has been idle, i.e. has not received a request, for more than `‘idle_timeout’`, then the instance is evicted.
Startup and Shutdown	Instances are created on demand to handle requests and automatically turned down when idle.	Instances are sent a start request automatically by App Engine in the form of an empty GET request to `/_ah/start`. An instance that is stopped with `‘appcfg stop’` (or via the Admin Console UI) has 30 seconds to finish handling requests before it is forcibly terminated.	Instances are created on demand to handle requests and automatically turned down when idle, based on the `‘idle_timeout’` configuration parameter. As with manual scaling, an instance that is stopped with `‘appcfg stop’` (or via the Admin Console UI) has 30 seconds to finish handling requests before it is forcibly terminated.
Instance Addressability	Instances are anonymous.	Instances are addressable at URLs with the form: `http://instance.version.module.app_id.appspot.com`. If you have set up a wildcard subdomain mapping for a custom domain, you can also address a module or any of its instances via a URL of the form `http://module.domain.com` or `http://instance.module.domain.com`. You can reliably cache state in each instance and retrieve it in subsequent requests.	Instances are addressable at URLs with the form: `http://instance.version.module.app_id.appspot.com`. If you have set up a wildcard subdomain mapping for a custom domain, you can also address a module or any of its instances via a URL of the form `http://module.domain.com` or `http://instance.module.domain.com`. You can reliably cache state in each instance and retrieve it in subsequent requests.
Scaling	App Engine scales the number of instances automatically in response to processing volume. This scaling factors in the `‘automatic_scaling’` settings that are provided on a per-version basis in the configuration file uploaded with the module version.	You configure the number of instances of each module version in that module’s configuration file. The number of instances usually corresponds to the size of a dataset being held in memory or the desired throughput for offline work. You can adjust the number of instances of a manually-scaled version very quickly, without stopping instances that are currently running, using the Modules API `‘set_num_instances’` function.	A basic scaling module version is configured with a maximum number of instances using the `‘max_instances’ basic_scaling` parameter. The number of live instances scales with the processing volume.

Instance classes

Instances are priced based on an hourly rate determined by the instance class.

Instance Class	Memory Limit	CPU Limit	Cost per Hour per Instance
B1	128 MB	600 Mhz	0.08
B2	256 MB	1.2 Ghz	0.16
B4	512 MB	2.4 Ghz	0.32
B4_1G	1024 MB	2.4 Ghz	0.48
B8	1024 MB	4.8 Ghz	0.64
F1	128 MB	600 Mhz	0.08
F2	256 MB	1.2 Ghz	0.16
F4	512 MB	2.4 Ghz	0.32

Manual and basic scaling instances are billed at hourly rates based on uptime. Billing begins when an instance starts and ends fifteen minutes after a manual instance shuts down or fifteen minutes after a basic instance has finished processing its last request. Runtime overhead is counted against the instance memory limit. This will be higher for Java than for Python.

Configuration

Python modules are defined in yaml files, one file per module. A yaml file usually takes the same name as the module it defines, but this is not required. Every yaml file must include a version parameter. To define the default module, you can explicitly include the parameter "module: default" or leave the module parameter out of the file.

Each module's configuration file defines the scaling type and instance class for a specific module/version. Different scaling parameters are used depending on which type of scaling you specify. If you do not specify scaling, automatic scaling is the default.

For each module you can also specify settings that map URL requests to specific python scripts and identify static files for better server efficiency. These settings are also included in the yaml file and are described in the App Config section. The following examples show how to configure modules for each scaling type.

Manual Scaling

application: simple-sample
module: my_module
version: uno
runtime: python27
instance_class: B8
manual_scaling:
  instances: 5

`instance_class:`

The Instance Class size for this module. When using manual scaling, the B1, B2, B4, B4_1G, and B8 instance classes are available. If you do not specify a class, B2 is assigned by default.

`manual_scaling:`

Required to enable manual scaling for a module.

`instances:`

The number of instances to give to assign to the module to start. This number can later be altered by using the Modules API set_num_instances() function.

Basic Scaling

application: simple-sample
module: my_module
version: uno
runtime: python27
instance_class: B8
basic_scaling:
  max_instances: 11
  idle_timeout: 10m

`instance_class:`

The instance class size for this module. When using basic scaling, the B1, B2, B4, B4_1G, and B8 instance classes are available. If you do not specify a class, B2 is assigned by default.

`basic_scaling:`

Required to enable basic scaling for a module

`max_instances:`

Required. The maximum number of instances for App Engine to create for this module version. This is useful to limit the costs of a module.

`idle_timeout:`

Optional. The instance will be shut down this amount of time after receiving its last request.

Automatic Scaling

application: simple-sample
module: my_module
version: uno
runtime: python27
instance_class: F2
automatic_scaling:
  min_idle_instances: 5
  max_idle_instances: automatic  # default value
  min_pending_latency: automatic  # default value
  max_pending_latency: 30ms

`automatic_scaling:`

Optional. Automatic scaling is assume by default.

`instance_class:`

The Instance Class size for this module. When using automatic scaling, only the F1, F2 and F4 instance classes are available. If you do not specify a class, F1 is assigned by default.

`min_idle_instances:`

The minimum number of idle instances that App Engine should maintain for this version. Only applicable to the default version, since other versions are not expected to receive significant traffic.

`max_idle_instances:`

The maximum number of idle instances that App Engine should maintain for this version.

`min_pending_latency:`

The minimum amount of time that App Engine should allow a request to wait in the pending queue before starting a new instance to handle it.

`max_pending_latency:`

The maximum amount of time that App Engine should allow a request to wait in the pending queue before starting a new instance to handle it.

The default module

Every application must have a single default module. To define the default module, include the setting module: default in the module's yaml file, or leave the setting out.

An example

Here is an example of how you would configure yaml files for an application that has three modules: a default module that handles web requests, plus two more modules that handle mobile requests and backend processing.

Start by defining a configuration file named app.yaml that will handle all web-related requests:

application: simple-sample
version: uno
runtime: python27
api_version: 1
threadsafe: true

This configuration is identical to existing App Engine application configurations, it creates a default module with automatic scaling. Its public address is http[s]://simple‑sample.appspot.com.

Next, assume that you want to create a module to handle mobile web requests. For the sake of the mobile users (in this example) the max pending latency will be just a second and we’ll always have at least two instances idle. To configure this you would create a mobile-frontend.yaml configuration file. with the following contents:

application: simple-sample
module: mobile-frontend
version: uno
runtime: python27
api_version: 1
threadsafe: true

automatic_scaling:
  min_idle_instances: 2
  max_pending_latency: 1s

The module this file creates can be reached at http[s]://mobile‑frontend.simple‑sample.appspot.com.

Finally, add a module, called my-module for handling static backend work. This could be a continuous job that exports data from Datastore to BigQuery. The amount of work is relatively fixed, therefore you simply need 1 resident module at any given time. Also, these jobs will need to handle a large amount of in-memory processing, thus you’ll want modules with an increased memory configuration. To configure this you would create a my-module.yaml configuration file with the following contents.

application: simple-sample
module: my-module
version: uno
runtime: python27
api_version: 1
threadsafe: true

instance_class: B8
manual_scaling:
  instances: 1

The module this file creates can be reached at http[s]://my-module.simple‑sample.appspot.com.

Notice the manual_scaling: setting. The instances: parameter tells App Engine how many instances to create for this module.

You may also want to download this Python demo app and take a look.

Uploading modules

To deploy the example above, use the appcfg update command. If you are uploading the app for the first time, the default module must uploaded first, or if you are listing multiple modules, the default module must be the first module in the file list:

cd simple-sample
appcfg update app.yaml mobile-frontend.yaml my-module.yaml

You will receive verification via the command line as each module is successfully deployed. Once the application has been successfully deployed you can access it at http://simple‑sample.appspot.com.

Similarly, you can access each of the modules individually:

http://default.simple‑sample.appspot.com
http://mobile‑frontend.simple-sample.appspot.com
http://my-module.simple‑sample.appspot.com

If you run multiple versions of a module, you can access a specific version by prepending the version name to the URI. For example, http://uno.mobile‑frontend.simple‑sample.appspot.com will target version uno of the mobile‑frontend module.

Instance states

A manual or basic scaled instance can be in one of two states: Running or Stopped. All instances of a particular module/version share the same state. You can change the state of all the instances belonging to a module/version using the appcfg command or the Modules API.

Startup

Each module instance is created in response to a start request, which is an empty GET request to /_ah/start. App Engine sends this request to bring an instance into existence; users cannot send a request to /_ah/start. Manual and basic scaling instances must respond to the start request before they can handle another request. The start request can be used for two purposes:

To start a program that runs indefinitely, without accepting further requests
To initialize an instance before it receives additional traffic

Manual scaling instances and basic scaling instances startup differently. When you start a manual scaling instance, App Engine immediately sends a /_ah/start request to each instance. When you start an instance of a basic scaling module, App Engine allows it to accept traffic, but the /_ah/start request is not sent to an instance until it receives its first user request. Multiple basic scaling instances are only started as necessary, in order to handle increased traffic.

When an instance responds to the /_ah/start request with an HTTP status code of 200–299 or 404, it is considered to have successfully started and can handle additional requests. Otherwise, App Engine terminates the instance. Manual scaling instances are restarted immediately, while basic scaling instances are restarted only when needed for serving traffic.

Shutdown

The shutdown process may be triggered by a variety of planned and unplanned events, such as:

You manually stop an instance using the 'appcfg stop' command or the Modules API stop_module function call.
Manually stop an instance from the Admin Console Versions page.
You update the module version using 'appcfg update'.
The instance exceeds the maximum memory for its configured instance_class.
Your application runs out of Instance Hours quota.
The machine running the instance is restarted, forcing your instance to move to a different machine.
App Engine needs to move your instance to a different machine to improve load distribution.

When App Engine needs to turn down a manual scaling instance it first notifies the instance. There are two ways to receive this notification. First, the is_shutting_down() method from google.appengine.api.runtime begins returning true. Second, if you have registered a shutdown hook, it will be called. It's a good idea to register a shutdown hook in your start request. After the notification is issued, existing requests are given 30 seconds to complete, and new requests immediately return 404.

If an instance is handling a request, App Engine pauses the request and runs the shutdown hook. If there is no active request, App Engine sends an /_ah/stop request, which runs the shutdown hook. The /_ah/stop request bypasses normal handling logic and cannot be handled by user code; its sole purpose is to invoke the shutdown hook. If you raise an exception in your shutdown hook while handling another request, it will bubble up into the request, where you can catch it.

If you are using threadsafe in Python 2.7, raising a deadline from a shutdown hook copies that exception to all threads. The following code sample demonstrates a basic shutdown hook:

from google.appengine.api import apiproxy_stub_map
from google.appengine.api import runtime

def my_shutdown_hook():
  apiproxy_stub_map.apiproxy.CancelApiCalls()
  save_state()
  # May want to raise an exception

runtime.set_shutdown_hook(my_shutdown_hook)
Alternatively, the following sample demonstrates how to use the is_shutting_down() method:
while more_work_to_do and not runtime.is_shutting_down():
  do_some_work()
  save_state()

Instance uptime

App Engine attempts to keep manual and basic scaling instances running indefinitely. However, at this time there is no guaranteed uptime for manual and basic scaling instances. Hardware and software failures that cause early termination or frequent restarts can occur without prior warning and may take considerable time to resolve; thus, you should construct your application in a way that tolerates these failures. The App Engine team will provide more guidance on expected instance uptime as statistics become available.

Good strategies for avoiding downtime due to instance restarts include:

Load balancing across multiple instances
Configuring more instances than are normally required to handle your traffic patterns
Writing fall-back logic that uses cached results when a manual scaling instance is unavailable
Reducing the amount of time it takes for your instances to start up and shutdown
Duplicating the same state in more than one instance.
For long running computations, checkpoint the state from time to time, so you can resume it it doesn't complete.

It's also important to recognize that the shutdown hook is not always able to run before an instance terminates. In rare cases, an outage can occur that prevents App Engine from providing 30 seconds of shutdown time. Thus, we recommend periodically checkpointing the state of your instance and using it primarily as an in-memory cache rather than a reliable data store.

Background threads

Code running on a manual scaling instance can start a background thread that may outlive the request that spawns it. This allows instances to perform arbitrary periodic or scheduled tasks or to continue working in the background after a request has returned to the user.

A background thread's os.environ and logging entries are independent of those of the spawning thread.

The BackgroundThread class is like the regular Python threading.Threadclass, but can "outlive" the request that spawns it.

from google.appengine.api import background_thread

def f(arg1, arg2, *kwargs):
  ...something useful...

t = background_thread.BackgroundThread(target=f, args=["foo", "bar"]) t.start()

There is a function start_new_background_thread which creates a background thread and starts it:

from google.appengine.api import background_thread

def f(arg1, arg2, *kwargs):
  ...something useful...

tid = background_thread.start_new_background_thread(f, ["foo", "bar"])

Monitoring resource usage

The Instances Console section of the Administration Console provides visibility into how instances are performing. By selecting your module and version in the dropdowns, you can see the memory and CPU usage of each instance, uptime, number of requests, and other statistics. You can also manually initiate the shutdown process for any instance.

You also can use the Runtime API to access statistics showing the CPU and memory usage of your instances. These statistics help you understand how resource usage responds to requests or work performed, and also how to regulate the amount of data stored in memory in order to stay below the memory limit of your instance class.

Logging

You can use the LogService API to access your app's request and application logs. In particular, the fetch() function allows you to retrieve logs using various filters, such as request ID, timestamp, module ID, and version ID.

Application (user/app-generated) logs are periodically flushed while manual and basic scaling instances handle requests. (since modules can run on a request a long time, logs may not flush for a while) You can tune the flush settings, or force an immediate flush, using the LogService API. When a flush occurs, a new log entry is created at the time of the flush, containing any log messages that have not yet been flushed. These entries show up in the Logs Console marked with "flush", and include the start time of the request that generated the flush.

Communication between modules

Modules can share state by using the Datastore and memcache. They can collaborate by assigning work between them using Task Queues. To access these shared services, use the corresponding App Engine APIs. Calls to these APIs are automatically mapped to the application’s namespace.

The Modules API provides functions to retrieve the address of a module, a version, or an instance. This allows an application to send requests from one module, version, or instance to another module, version, or instance. This works in both the development and production environments. The Modules API also provides functions that return information about the current operating environment (module, version, and instance).

The following code sample shows how to get the module name and instance id for a request:

from google.appengine.api import modules
module = modules.get_current_module_name()
instance = modules.get_current_instance_id()

The instance ID of an automatic scaled module will be returned as a unique base64 encoded value, e.g. 'e4b565394caa'.

You can communicate between modules in the same app by fetching the hostname of the target module:

import urllib2 from google.appengine.api import modules

url = "http://%s/" % modules.get_hostname(module="my-backend")
try:
  result = urllib2.urlopen(url)
  doSomethingWithResult(result)
except urllib2.URLError, e:
  handleError(e)

You can also use the URLFetch service.

To be safe, the receiving module should validate that the request is coming from a valid client. You can check that the Inbound-AppId header or user-agent-string matches the app-id fetched with the AppIdentity service. Or you can use OAuth to authenticate the request.

You can configure any manual or basic scaling module to accept requests from other modules in your app by adding the "login:admin" specification to the module's handler. In that case any URLFetch from any other module in the app will be automatically authenticated by App Engine, and any URLFetch call from a module that is not part of the application will be rejected.

If you want a module to receive requests from anywhere you must code your own secure solution as you would for any App Engine application. This is usually done by implementing a custom API and authentication mechanism.

Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 3.0 License, and code samples are licensed under the Apache 2.0 License. For details, see our Site Policies.

Last updated September 9, 2013.

Google App Engine

Feedback on this document

App Engine Modules in Python

Application hierarchy

Instance scaling and class

Manual Scaling

Basic Scaling

Automatic Scaling

Scaling Types

Instance classes

Configuration

Manual Scaling

`instance_class:`

`manual_scaling:`

`instances:`

Basic Scaling

`instance_class:`

`basic_scaling:`

`max_instances:`

`idle_timeout:`

Automatic Scaling

`automatic_scaling:`

`instance_class:`

`min_idle_instances:`

`max_idle_instances:`

`min_pending_latency:`

`max_pending_latency:`

The default module

An example

Uploading modules

Instance states

Startup

Shutdown

Instance uptime

Background threads

Monitoring resource usage

Logging

Communication between modules

Authentication required

Signing you in...

Google App Engine Feedback on this document

App Engine Modules in Python

Application hierarchy

Instance scaling and class

Manual Scaling

Basic Scaling

Automatic Scaling

Scaling Types

Instance classes

Configuration

Manual Scaling

instance_class:

manual_scaling:

instances:

Basic Scaling

instance_class:

basic_scaling:

max_instances:

idle_timeout:

Automatic Scaling

automatic_scaling:

instance_class:

min_idle_instances:

max_idle_instances:

min_pending_latency:

max_pending_latency:

The default module

An example

Uploading modules

Instance states

Startup

Shutdown

Instance uptime

Background threads

Monitoring resource usage

Logging

Communication between modules

Authentication required

Signing you in...

Google App Engine

Feedback on this document

`instance_class:`

`manual_scaling:`

`instances:`

`instance_class:`

`basic_scaling:`

`max_instances:`

`idle_timeout:`

`automatic_scaling:`

`instance_class:`

`min_idle_instances:`

`max_idle_instances:`

`min_pending_latency:`

`max_pending_latency:`