Slow RTT registers

Kaspar · 17 September 2020 15:10

Hey everyone and especially @Michel,

https://github.com/RIOT-OS/RIOT/pull/14013 brought up again the subject of RTT’s which do have a usable frequency, but are very slow to read (e.g., on sam0).

IIRC they impose a delay of up to 6 slow clock ticks for every read or write to the RTT registers.

6 slow clock ticks at 32khz mean ~180us
during the synchronization, the bus is blocked_ (nothing on the bus can interact with the system. that means practically, most peripheral ISR are blocked)
(read can be “cached”, resulting in instant read of the value, which is then between 0-5 ticks “old”).
with current periph_rtt (used by ztimer), a set() requires a read and a write, so two syncs minimum. I think ztimer might even do a read or two more

IMO this makes the RTT so slow as to be unusable for many use-cases.

I propose we brain storm a bit on how to deal with it. Ideas?

Kaspar · 17 September 2020 15:26

Maybe synchronize some timers. E.g., if the board wakes up because of an RTT isr, assume it’s the previously set wakeup time. From then on, use a converted high freq timer. Only before entering sleep the next time, set() the RTT once and store the expected wakeup time.

Michel · 17 September 2020 16:06

Some brainstorming won’t hurt, so here are some of the things that pop up in my mind:

How bad the impact of (1) is, absolutely depends on the platform (and its config).
Completely blocking the bus (2) sound really bad for many scenarios. The caching/shadowing combined with synchronization could be applicable to mitigate that. Do you know if in this particular case the 0-5 ticks age is deterministic as in “sync every 5 ticks, then it just gets older till the next sync” or is more like random ?
In any case: its not a property that is directly tied to the RTT API itself so we need some other way to store and communicate this information to decide case-by-case.
this case-by-case problem is one aspect that made me think it would be nice to push these decisions to the inside of the timer abstraction (or some utility function of it).
A more simplified lazy alternative could be to just define higher thresholds on when using RTT is deemed reasonable.

The underlying problem of varying hardware performance and limits is something we want to reflect with a property access for the envisioned low-level timer API. I think Niels is heading to dump details on that very soon… One of the next steps after collecting and properly defining all these relevant metrics and culprits (of which there are many) we plan on doing micro-benchmarks to measure such performance implications. That should help to decide what is possible on which platform.

Maybe synchronize some timers. E.g. (…)

Yes. Probably not trivial but worth a try. Pretty much what I already said here

github.com/RIOT-OS/RIOT

Review by MichelRottleuthner - [WIP, RFC] doc/memos: Added RDM on high level timer API requirements and common features

RIOT-OS:master ← maribu:timer-rdm

> I think this is re-hashing the mailing list discussions from a year ago :) …I know, and to be honest I don't want to bore people with lengthy discussions again. There is no point in these lengthy "abstract" discussions if not more of the developers jump in. Yet, the arguments are not less valid than before. It appears to me that we all hold single pieces of the big picture without an explicit agreement on how the big picture looks like. This document will eventually help with that I think. > Otherwise, I see split between domains, clock drift boundaries, high precision LF, mentioned. Implement! :) On it (at least for some of these problems) just from a slightly different direction.. I'll explain during the summit. > Well, how do you do that without explicit synchronization? (..) Not. At least not without any synchronization. Of course they need to be synchronized. But why double sync? And who says that you need to do this explicitly? We may very well use an "opportunistic" approach where you sync based on events that happen anyway. I.e., you don't care how long reading 'now' takes for the slow timer if you read the fast timer close enough to a known event of the slow timer... I don't say this solves all cases everywhere. I'm just trying to think towards a solution instead of searching for reasons against it. > I call that expensive Yes reading LF timer can indeed be expensive. Apart from reading not being necessarily required, some would argue transitioning to a low power mode for arbitrarily small periods is "expensive" because of the delay when leaving lpm. Again, it all depends on the perspective ;) > I'm strongly in favor of not making things complex, and teach users to use ms (..) Fair enough. I get your point.