SSL Certificate Expiration Monitoring - Blog

Every TLS certificate has an expiration date. When it passes, browsers show warnings, API clients reject connections, and automated systems start throwing errors. The failure is always total, always visible, and almost always preventable.

SSL certificate expiration monitoring is the practice of continuously tracking the validity of every certificate on your infrastructure and alerting before expiry becomes an outage. This post covers what a real monitoring setup looks like, what signals matter, and how to avoid the common pitfalls. It is the expiry-focused deep dive within the broader SSL certificate monitoring picture, which also covers chains, issuance, and configuration.

Why Expired Certificates Keep Happening

Certificate expiry is the most scheduled failure in infrastructure operations. You know the exact moment it will happen. And yet outages from expired certificates continue to take down production services at Microsoft, Spotify, Cisco, and countless smaller operators every year.

The reasons are consistent:

Automation is configured but silently fails weeks before expiry, with no alerting
A certificate is deployed manually once and then inherited by the next team
Internal services (monitoring agents, load balancer backends, mTLS clients) are missed by public scanners
Short-lived certificates are renewing successfully on one host but not on the others behind the same DNS record
The person who set up renewal has left the company

None of these are technical problems. They're visibility problems. Monitoring is how you close that gap.

What Certificate Monitoring Actually Checks

A useful monitoring system goes beyond a single expiry date. Certificates can be broken in multiple ways, each with its own failure signature:

Expiration: the most obvious. Alert at a configurable threshold before notAfter is reached.
Chain integrity: an expired or missing intermediate certificate causes the same browser errors as an expired leaf. The full chain must be validated, not just the end certificate.
Hostname matching: a certificate served on the wrong hostname (common after load balancer changes or CDN migrations) is functionally expired from the client's perspective.
Revocation: a certificate that has been revoked by its CA is no longer valid, even if it hasn't expired. Revocation is published through a CRL, which the CA/Browser Forum has required of every CA since March 2024. OCSP is optional now and on its way out: Let's Encrypt shut down its responders in August 2025 and Google Trust Services stopped embedding an OCSP URI in most chains, though DigiCert, Sectigo and GoDaddy responders still answer.
CA trust: a certificate issued by a CA that browsers have distrusted (a rare but real event) will fail to validate.
Fingerprint change: an unexpected certificate rotation may indicate a successful renewal, or it may indicate a configuration change you didn't authorize.

A monitoring system that only tracks expiry dates misses five of the six failure modes above.

Choosing the Right Alert Threshold

The single most important configuration decision is how far in advance to alert. Set it too far out and you'll get noise during normal renewal cycles. Set it too close and you won't have time to fix problems before users are affected.

The right threshold depends on certificate lifetime:

Certificate Lifetime	Suggested Alert Threshold	Why
398 days (legacy)	30 days	Plenty of runway; rarely noisy
200 days (current CA/B maximum)	21 days	Any renewal window has long since closed
90 days (Let's Encrypt current)	15 days	Past the automatic renewal window
47 days (post-2029 maximum)	7 days	Short but still actionable

The goal is to alert after the normal automatic renewal window has clearly failed, but with enough lead time to diagnose and fix the problem. As certificate lifetimes continue to shrink across the industry, this window is tightening. Alert thresholds need to shrink alongside certificate validity periods.

Internal Certificates Are Often the Biggest Gap

Public-facing certificates on your website and API are typically the ones that get monitored first, because they're visible. But internal infrastructure often runs on certificates that are harder to track:

mTLS between microservices
Kubernetes ingress and internal service meshes
Database connections (PostgreSQL, MySQL, MongoDB) using TLS
LDAP directories
Internal VPN endpoints
Build and CI/CD systems with code-signing or artifact certificates

These certificates typically come from an internal CA, aren't visible to external scanners, and often get renewed manually. They cause the same kind of outages as public certificates, but they're harder to find and easier to forget. Any serious monitoring setup needs a way to cover internal infrastructure, whether through agents running inside the network or direct visibility into your internal CA.

Where Manual Checking Fails

A common starter approach is a cron job that runs openssl s_client or a curl probe once a day. It works for a handful of certificates, but it breaks down quickly:

It only checks the leaf certificate, not the full chain
It doesn't check alternate IPs behind a DNS round-robin, or STARTTLS protocols like SMTP and LDAP
Failures become cron output nobody reads, with no alerting layer
It won't catch a certificate that stops serving because a listener was removed

For a single-server blog, the cron approach is fine. For anything larger, you need a dedicated monitoring layer. You can also sanity-check any public certificate manually with the SSL Certificate Check on Mr. DNS, which shows the expiry date, the SANs, the key details, and the chain the server actually sends for any hostname.

Getting TLS Configuration Right

Monitoring catches problems after they happen. Good TLS configuration prevents a category of them in the first place: weak ciphers that force certificate replacement, chain configuration errors that make valid certs appear broken, and HSTS policies that leave visitors no way to click past a certificate error. GoodTLS has production-ready configurations for common web servers (Nginx, Apache, Caddy, HAProxy) and mail servers (Postfix, Exim, Dovecot) that pair well with active monitoring.

Building the Monitoring Layer

The starting point is knowing what you have: an inventory that includes internal services, not just the public hostnames visible from outside. From there, the monitoring runs continuously: daily checks at minimum, covering expiry, chain integrity, and hostname matching across every IP behind your DNS records, not just the primary.

Alerting is where most DIY setups fall short. A check that fails silently is worse than no check, because it creates the illusion of coverage. Useful alerts go to whoever owns the certificate, with enough detail to diagnose without digging through logs.

Generator Labs certificate monitoring covers all of this, including internal infrastructure via on-premise agents. Alerts include the certificate details and the specific failure mode so whoever is on call can act without needing to look up context.

An expired certificate is one of the few infrastructure failures with a known, scheduled date. There's no reason to be surprised by it.