Skip to content

2026

Improvements in generation latency estimation

We have made an improvement regarding how we estimate the generation latency (i.e. estimated duration of a request excluding network latency) for a request. Our new default estimation is based on time to first token and throughput metrics collected on OpenRouter for the supporter providers and models.

The new generation latency calculation is now:

\[ \text{generation latency} = \text{time to first token} + \text{throughput} \times \text{output tokens} \]

With:

  • Time-to-first-token (TTFT) represents the average duration in seconds of the pre-fill phase for an LLM.
  • Throughput (TPS) represents the average number of output tokens generated per second, it helps estimate the duration of the decode phase for an LLM.

These two metrics are being collected from OpenRouter, a service that centralizes the access to many AI providers with a single API key. Since the service is widely adopted (over 30 trillion tokens per month) the average data should be representative of real-world conditions.

This work extends what was previously done to patch the energy and impacts overestimations we had in EcoLogits Calculator compared to the Python library. Having this new estimation method in our core methodology makes it more reliable and reusable in all projects that depend on EcoLogits.

It is important to note that the old method to estimate generation latency using the ML.ENERGY Leaderboard is still being used when TTFT and TPS values are not available on OpenRouter. This is the case for the Hugging Face inference provider that we support.

EcoLogits and CodeCarbon are joining forces

The year 2025 was a milestone for EcoLogits. We saw growing adoption, formed new partnerships, and made significant methodological progress. More organizations than ever are now using EcoLogits to estimate and reduce the environmental footprint of generative AI.

Today we are pleased to announce a major step forward: EcoLogits is joining the CodeCarbon non-profit. This collaboration unites two complementary initiatives, driven by a shared commitment to scientific excellence. Together, we aim to make environmental impact assessment for AI more accessible, transparent, and effective by building open source tools that are robust and widely trusted.

This alliance expands our technical and partnership capabilities, strengthens our methodological expertise, and increases our collective impact on the responsible digital ecosystem.

Follow EcoLogits as we continue this work within the CodeCarbon non-profit. 🤗