FAQ, statement_cache_size=0... #507

bitmodeler · 2019-11-24T00:43:19Z

I have done a create_pool, passing the argument 'statement_cache_size=0"; I am STILL periodically getting the DuplicatePreparedStatement error on statements prepared from a connection acquired from that pool.

Any other options? All other facilities appear to work reliably in your driver except this critically important prepared statement facility.

Running your latest driver, postgres 12; application is running in the context of a uwsgi server app. Connecting via a socket file (in /var/run...) to postgres...

elprans · 2019-11-24T02:59:19Z

We would need a bit more information to triage, like sample code and steps to reproduce. Also, you don't need to use explicit prepared statements in most cases. Asyncpg maintains an automatic prepared statement cache, which is sufficient in most cases.

bitmodeler · 2019-11-24T03:31:16Z

Thanks for your timely reply.

I made a python class that does a prepare for named statements (this was discussed on another thread that you may have seen...) You don't have to examine all the code; the key thing is MY "PreparedStatement" class, and its aenter and aexit member functions:

https://gitlab.com/osfda/asyncpg_utility/blob/master/asyncpg_utility.py

I set up a prepared statement in its aenter, assigning it to a member of the class:

self._statement=await self._connection.prepare(self._prepared_query._query)

Examining your source, I did not see any way to release the prepared statement; so in one version I had the aexit do nothing (figuring the connection release would clean it up...); then just to force the issue, I also tried an explicit del of the prepared statement in the aexit (what you see at that link above...)

The code which calls that statement-preparing class is simply this:

async def allocate_pool(host, database, user, pw):
          return await asyncpg.create_pool(host=host,
                                           database=database,
                                           user=user,
                                           password=pw,
                                           statement_cache_size=0 # Note the fixup...
                                          )
# .
# .
# .
pool=asyncio.get_event_loop().run_until_complete(allocate_pool(host, database, user, pw))

async def fetch_something(pool, criteria):
          async with pool.acquire() as db:
                     async with PreparedStatement(db,
                                                  class_with_a_positional_query_string_member
                                                 ) as statement:
                                return await statement.fetchval(named_parameter=criteria)
# .
# .
# .
pool=asyncio.get_event_loop().run_until_complete(fetch_something(pool, criteria))

The DuplicatePreparedStatement gets triggered in the aenter of MY PreparedStatement class, when it goes to do the prepare call.

It works a number of times; just not EVERY time, reliably!

elprans · 2019-11-24T03:45:30Z

asyncpg uses a simple monotonic counter to generate prepared statement names. The only two times when I've seen DuplicatePreparedStatement were with pg_bouncer, and with an improper sharing of a connection in multiple threads. I cannot see any other way why this error might occur in your case.

bitmodeler · 2019-11-24T04:53:56Z

I have multiple functions concurrently calling the fetch_something (each one independently executing run_until_complete's...) Is the pool.acquire threadsafe? (I had reasoned it would be...)

"pool" is stored globally at the main module level, and that module is a Falcon app served via uwsgi; uwsgi references an "app" global variable:

app=falcon.API(...

uwsgi will run multiple MULTIPLE instances of the app; think that might be a problem?

I think each app module should have its own distinct global pool variable...

bitmodeler · 2019-11-24T20:40:33Z

Well, I did a rewrite.

In Falcon, each URL endpoint gets a class to handle it; so I added a dedicated pool for each endpoint/class, and bagged the module-level pool.

In testing: so far, so good. Having the endpoint classes share a module-level pool variable did not work out (I had presumed that the pool.acquire would handle the concurrency issues with that?)

In the process of doing that code restructuring, it got more efficient too (bagged some redundant connection acquisitions...)

As of yet, I am not getting a performance gain from using it in Falcon's synchronous context; but if I were to dispatch asynchronous queries while I simultaneously do other asynchronous network tasks, then coordinate completion of those tasks in the synchronous endpoint class routine, I could get the benefit (I'll do that eventually...)

MagicStack / asyncpg

FAQ, statement_cache_size=0... #507

FAQ, statement_cache_size=0... #507

bitmodeler commented Nov 24, 2019 •

edited

elprans commented Nov 24, 2019

bitmodeler commented Nov 24, 2019 •

edited

elprans commented Nov 24, 2019

bitmodeler commented Nov 24, 2019 •

edited

bitmodeler commented Nov 24, 2019

MagicStack / asyncpg

Join GitHub today

FAQ, statement_cache_size=0... #507

FAQ, statement_cache_size=0... #507

Comments

bitmodeler commented Nov 24, 2019 • edited

elprans commented Nov 24, 2019

bitmodeler commented Nov 24, 2019 • edited

elprans commented Nov 24, 2019

bitmodeler commented Nov 24, 2019 • edited

bitmodeler commented Nov 24, 2019

bitmodeler commented Nov 24, 2019 •

edited

bitmodeler commented Nov 24, 2019 •

edited

bitmodeler commented Nov 24, 2019 •

edited