Trouble getting HTTPS / letsencrypt working with 0.9.0-beta.4

Seth_Nickell · March 11, 2020, 1:40am

I’ve been using JH 0.8 (included as a requirements.yaml for my own service) on google cloud, tried playing with 0.9.0-beta.4, and have not been able to get https working after an upgrade. It looks like autohttps isn’t properly serving the acme challenge on port 80 is 0.9.0-betas?

When I visit https://improc.ceresimaging.net (which maps correctly to 35.203.130.226) I get an SSL protocol error (basically https server had an internal error). If I visit http://improc.ceresimaging.net, I get redirected to 443/https.

All the logs look ok, except the autohttps pod is failing to complete the letsencript http challenge, timing out trying to access .well-know/acme-challenge on port 80:

Running wget on my computer produces the same results:

➜  ~ wget http://improc.ceresimaging.net/.well-known/acme-challenge/018QQqoEpMphNo8_7J61TOcmQ7oGhZ7WOAl3VMfJuJc
--2020-03-10 15:37:04--  http://improc.ceresimaging.net/.well-known/acme-challenge/018QQqoEpMphNo8_7J61TOcmQ7oGhZ7WOAl3VMfJuJc
Resolving improc.ceresimaging.net (improc.ceresimaging.net)... 35.203.130.226
Connecting to improc.ceresimaging.net (improc.ceresimaging.net)|35.203.130.226|:80... connected.
HTTP request sent, awaiting response... 404 Not Found
2020-03-10 15:38:06 ERROR 404: Not Found.

The services look like they’re up and healthy, as do the pods.

Relevant part of values.yaml is as follows:

  proxy:
    https:
      hosts:
        - improc.ceresimaging.net
      letsencrypt:
        contactEmail: seth@ceresimaging.net
    secretToken: "SECRETS DELETED"
    service:
      loadBalancerIP: 35.203.130.226

In case its helpful, here’s the full values.yaml with secrets elided:

jupyterhub:
  proxy:
    https:
      hosts:
        - improc.ceresimaging.net
      letsencrypt:
        contactEmail: seth@ceresimaging.net
    secretToken: "SECRETS DELETED"
    service:
      loadBalancerIP: 35.203.130.226
	  
  singleuser:
    defaultUrl: "/lab"
    image:
      name: gcr.io/ceres-imaging-science/improc-notebook
      tag: latest

    extraEnv:
      JUPYTER_ENABLE_LAB: "yes"
      GRANT_SUDO: "yes"

    storage:
      homeMountPath: /home/{username}
      extraVolumes:
        - name: ceres-flights
          persistentVolumeClaim:
            claimName: ceres-flights
      extraVolumeMounts:
        - name: ceres-flights
          mountPath: /home/{username}/flights

    cmd: "start-singleuser.sh"

    # start as root, we drop privs once NB_USER is set by CustomGoogleOAuthenticator below
    uid: 0
  hub:
    image:
      name: gcr.io/ceres-imaging-science/improc-hub
      tag: latest
    imagePullSecret:
      registry: gcr.io
      username: _json_key
      password: |-
        {
          "type": "service_account",
		  # SECRETS DELETED
        }
    extraConfig:
      logo: |
        c.JupyterHub.logo_file = '/usr/local/share/jupyterhub/static/images/ceres-logo.svg'
      useCeresOAuthenticator: |
        c.JupyterHub.authenticator_class = CeresOAuthenticator
  prePuller:
    hook:
      enabled: false

  auth:
    admin:
      users:
        - SECRETS DELETED
    type: google
    google:
	  # SECRETS DELETED

    state:
      enabled: true
      cryptoKey: SECRETS DELETED

debug:
  enabled: true

Seth_Nickell · March 11, 2020, 1:43am

Which pod should be responding to the acme challenge, and what’s the path of loadbalancer/service/route that the request should be taking from public-proxy to that pod?

I notice the kube-lego pod(s) and service are no longer present, I’m guessing that autohttps is taking over this roll?

matthew.brett · July 2, 2020, 2:36pm

Hi,

I’m running into a very similar problem - the default LetsEncrypt step is failing at the challenge.

I’m using 0.9.0 - but I’ve also tried the latest 0.9.0 chart.

My config.yaml is as simple as I could make it:

proxy:
  secretToken: "need-to-know-basis"
  https:
    hosts:
      - uobhub.org
    letsencrypt:
      contactEmail: matthew.brett@gmail.com
  service:
    loadBalancerIP: 35.189.82.198

Log from kubectl logs pod/autohttps-7b465f7b8b-lp5ww traefik -f gives:

time="2020-07-02T14:21:55Z" level=info msg="Starting provider aggregator.ProviderAggregator {}"
time="2020-07-02T14:21:55Z" level=info msg="Starting provider *file.Provider {\"watch\":true,\"filename\":\"/etc/traefik/dynamic.toml\"}"
time="2020-07-02T14:21:55Z" level=info msg="Starting provider *acme.Provider {\"email\":\"matthew.brett@gmail.com\",\"caServer\":\"https://acme-v02.api.l
etsencrypt.org/directory\",\"storage\":\"/etc/acme/acme.json\",\"keyType\":\"RSA4096\",\"httpChallenge\":{\"entryPoint\":\"http\"},\"ResolverName\":\"le\
",\"store\":{},\"ChallengeStore\":{}}"
time="2020-07-02T14:21:55Z" level=info msg="Testing certificate renew..." providerName=le.acme
time="2020-07-02T14:21:55Z" level=info msg="Starting provider *traefik.Provider {}"
time="2020-07-02T14:22:11Z" level=error msg="Unable to obtain ACME certificate for domains \"uobhub.org\" : unable to generate a certificate for the doma
ins [uobhub.org]: acme: Error -> One or more domains had a problem:\n[uobhub.org] acme: error: 400 :: urn:ietf:params:acme:error:connection :: Fetching h
ttp://uobhub.org/.well-known/acme-challenge/btuQKX8X9Q6RlJGzpIgN7wi9RsCDxB8luT7r6oI2IE0: Timeout during connect (likely firewall problem), url: \n" provi
derName=le.acme
time="2020-07-02T14:22:24Z" level=error msg="Unable to obtain ACME certificate for domains \"uobhub.org\" : unable to generate a certificate for the doma
ins [uobhub.org]: acme: Error -> One or more domains had a problem:\n[uobhub.org] acme: error: 400 :: urn:ietf:params:acme:error:connection :: Fetching h
ttp://uobhub.org/.well-known/acme-challenge/btuQKX8X9Q6RlJGzpIgN7wi9RsCDxB8luT7r6oI2IE0: Timeout during connect (likely firewall problem), url: \n" provi
derName=le.acme

I can make LetsEncrypt work on my own Mac - here’s the result of a LetsEncrypt certification running on my home machine: https://jupyterhub.dynevor.org/

Any suggestions of what I could try next to debug?

Cheers,

Matthew

consideRatio · July 3, 2020, 5:21pm

When not using a very recent version of the Helm chart, newer than 0.9.0, the autohttps pod can save a failed attempt into a k8s secret and get stuck in a bad state. Due to this, I suggest:

Verify your domain points to the external IP you should see by writing kubectl get svc proxy-public.
Upgrade to use a Helm chart version like 0.9.0-n116.h1c766a1 or newer to get a version of the autohttps setup that avoid getting stuck in corrupt states by saving it to a secret which it reloads on startup.
Delete both the secret named proxy-public-tls-acme and the autohttps pod.

If done in order and these still fail, try deleting the autohttps pod some times. If this still fails:

Inspect logs of the autohttps pod
Inspect logs of proxy pod
Set helm chart configuration: debug.enabled: true and repeat for more details in logs.

This is the autohttps pod. With it around, traffic is routed from proxy-public svc to autohttps pod to proxy-http svc to proxy pod to whatever destination depending on path (/hub to hub pod, unknown paths to hub pod, and /user to user pods if they have servers running).

The TLS termination is done by the autohttps pod, which is now traefik v2 using the LEGO acme client library.

Upcoming fix in Traefik’s use of LEGO

The need to restart the autohttps pod is caused by this issue that I opened with Traefik. They have successfully reproduced this issue and a PR is now open to resolve it.

github.com/traefik/traefik

acme: sometimes two instead of one ACME order is placed, which sometimes leads to challenge failure

opened 01:21AM - 02 Jun 20 UTC

closed 10:54AM - 08 Jul 20 UTC

consideRatio

area/acme kind/bug/possible status/5-frozen-due-to-age

### Do you want to request a *feature* or report a *bug*? Bug ### What did y…ou do? I startup Traefik in the exact same way multiple times in a CI environment, but I end up with different outcomes in Traefik's ACME server interaction using the [go-acme/lego](https://github.com/go-acme/lego) library. ### What did you expect to see? I expected Traefik to use the ACME LEGO library to make _a single attempt_ to get a TLS certificate from the ACME server (one order), and then process it to succeed or fail. ### What did you see instead? Instead I sometimes (25-75% of the times) see two almost simultaneously initiated orders with the ACME server when Traefik starts, and when this happens, a ACME server interaction failure can follow about 50% of the times. ### My analysis I can spot when one or two ACME orders are placed by Traefik's use of the LEGO library against the ACME server by seeing either one or two lines of the following in the logs: __`acme: Obtaining bundled SAN certificate`__. The logged line above comes from [here in go-acme/lego](https://github.com/go-acme/lego/blob/2da1ce06ea5a6454980e26d1226e2af4c659145f/certificate/certificates.go#L89-L97). When that happens, Traefik probably have invoked it twice, and __I suspect [this section](https://github.com/containous/traefik/blob/7928e6d0cd4c7751a14bea5f74124f8a3cb829c0/pkg/provider/acme/provider.go#L320-L423) is ending up calling the [resolveCertifciate](https://github.com/containous/traefik/blob/7928e6d0cd4c7751a14bea5f74124f8a3cb829c0/pkg/provider/acme/provider.go#L425) function twice__, which in turn calls the go-acme/lego library's obtain function twice. ### Output of `traefik version`: (_What version of Traefik are you using?_) ``` Traefik version 2.2.1 built on 2020-04-29T18:02:09Z ``` ### What is your environment & configuration (arguments, toml, provider, platform, ...)? I've setup Pebble, Let's Encrypts ACME server meant for testing purposes, and start pebble first and await it to be ready, then start Traefik. This is done in separate VMs multiple times over as part of CI tests, and this is where I observe different outcomes between fully decoupled runs. ```yaml # DYNAMIC CONFIGURATION # dynamic.yaml: | http: middlewares: hsts: headers: stsIncludeSubdomains: false stsPreload: false stsSeconds: 15724800 redirect: redirectScheme: permanent: true scheme: https scheme: headers: customRequestHeaders: X-Scheme: https routers: default: entrypoints: - https middlewares: - hsts - scheme rule: PathPrefix(`/`) service: default tls: certResolver: default domains: - main: local.jovyan.org options: default insecure: entrypoints: - http middlewares: - redirect rule: PathPrefix(`/`) service: default services: default: loadBalancer: servers: - url: http://proxy-http:8000/ tls: options: default: cipherSuites: - TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384 - TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 - TLS_ECDHE_ECDSA_WITH_AES_128_GCM_SHA256 - TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256 - TLS_ECDHE_ECDSA_WITH_CHACHA20_POLY1305 - TLS_ECDHE_RSA_WITH_CHACHA20_POLY1305 minVersion: VersionTLS12 sniStrict: true # STATIC CONFIGURATION # traefik.yaml: | accessLog: fields: headers: names: Authorization: redacted Cookie: redacted Set-Cookie: redacted X-Xsrftoken: redacted filters: statusCodes: - 500-599 certificatesResolvers: default: acme: caServer: https://pebble/dir email: jovyan@jupyter.test httpChallenge: entryPoint: http storage: /etc/acme/acme.json entryPoints: http: address: :80 https: address: :443 transport: respondingTimeouts: idleTimeout: 10m0s log: level: DEBUG providers: file: filename: /etc/traefik/dynamic.yaml ``` ### If applicable, please paste the log output in DEBUG level (`--log.level=DEBUG` switch) I pasted the logs of an example of two orders being placed below as identified by two lines of `acme: Obtaining bundled SAN certificate`. But other examples are available in the links. More are available: - [one order (always (?) success with one order)](https://travis-ci.org/github/jupyterhub/zero-to-jupyterhub-k8s/jobs/693668494#L851) - [two orders (success)](https://travis-ci.org/github/jupyterhub/zero-to-jupyterhub-k8s/jobs/693668495#L862-L863) - [two orders (failure - lacking DEBUG level on logs)](https://travis-ci.org/github/jupyterhub/zero-to-jupyterhub-k8s/jobs/693624730#L806-L807) ``` Configuration loaded from file: /etc/traefik/traefik.yaml" Traefik version 2.2.1 built on 2020-04-29T18:02:09Z" Static configuration loaded {\"global\":{\"checkNewVersion\":true},\"serversTransport\":{\"maxIdleConnsPerHost\":200},\"entryPoints\":{\"http\":{\"address\":\":80\",\"transport\":{\"lifeCycle\":{\"graceTimeOut\":10000000000},\"respondingTimeouts\":{\"idleTimeout\":180000000000}},\"forwardedHeaders\":{},\"http\":{}},\"https\":{\"address\":\":443\",\"transport\":{\"lifeCycle\":{\"graceTimeOut\":10000000000},\"respondingTimeouts\":{\"idleTimeout\":600000000000}},\"forwardedHeaders\":{},\"http\":{}}},\"providers\":{\"providersThrottleDuration\":2000000000,\"file\":{\"watch\":true,\"filename\":\"/etc/traefik/dynamic.yaml\"}},\"log\":{\"level\":\"DEBUG\",\"format\":\"common\"},\"accessLog\":{\"format\":\"common\",\"filters\":{\"statusCodes\":[\"500-599\"]},\"fields\":{\"defaultMode\":\"keep\",\"headers\":{\"defaultMode\":\"drop\",\"names\":{\"Authorization\":\"redacted\",\"Cookie\":\"redacted\",\"Set-Cookie\":\"redacted\",\"X-Xsrftoken\":\"redacted\"}}}},\"certificatesResolvers\":{\"default\":{\"acme\":{\"email\":\"jovyan@jupyter.test\",\"caServer\":\"https://pebble/dir\",\"storage\":\"/etc/acme/acme.json\",\"keyType\":\"RSA4096\",\"httpChallenge\":{\"entryPoint\":\"http\"}}}}}" \nStats collection is disabled.\nHelp us improve Traefik by turning this feature on :)\nMore details on: https://docs.traefik.io/contributing/data-collection/\n" Starting provider aggregator.ProviderAggregator {}" Start TCP Server" entryPointName=http Start TCP Server" entryPointName=https Starting provider *file.Provider {\"watch\":true,\"filename\":\"/etc/traefik/dynamic.yaml\"}" Starting provider *acme.Provider {\"email\":\"jovyan@jupyter.test\",\"caServer\":\"https://pebble/dir\",\"storage\":\"/etc/acme/acme.json\",\"keyType\":\"RSA4096\",\"httpChallenge\":{\"entryPoint\":\"http\"},\"ResolverName\":\"default\",\"store\":{},\"ChallengeStore\":{}}" Testing certificate renew..." providerName=default.acme Starting provider *traefik.Provider {}" Configuration received from provider file: {\"http\":{\"routers\":{\"default\":{\"middlewares\":[\"hsts\",\"scheme\"],\"service\":\"default\",\"rule\":\"PathPrefix(`/`)\",\"tls\":{\"options\":\"default\",\"certResolver\":\"default\",\"domains\":[{\"main\":\"local.jovyan.org\"}]}},\"insecure\":{\"middlewares\":[\"redirect\"],\"service\":\"default\",\"rule\":\"PathPrefix(`/`)\"}},\"services\":{\"default\":{\"loadBalancer\":{\"servers\":[{\"url\":\"http://proxy-http:8000/\"}],\"passHostHeader\":null}}},\"middlewares\":{\"hsts\":{\"headers\":{\"stsSeconds\":15724800}},\"redirect\":{\"redirectScheme\":{\"scheme\":\"https\",\"permanent\":true}},\"scheme\":{\"headers\":{\"customRequestHeaders\":{\"X-Scheme\":\"https\"}}}}},\"tcp\":{},\"udp\":{},\"tls\":{\"options\":{\"default\":{\"minVersion\":\"VersionTLS12\",\"cipherSuites\":[\"TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384\",\"TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384\",\"TLS_ECDHE_ECDSA_WITH_AES_128_GCM_SHA256\",\"TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256\",\"TLS_ECDHE_ECDSA_WITH_CHACHA20_POLY1305\",\"TLS_ECDHE_RSA_WITH_CHACHA20_POLY1305\"],\"clientAuth\":{},\"sniStrict\":true}}}}" providerName=file Configuration received from provider default.acme: {\"http\":{},\"tls\":{}}" providerName=default.acme Configuration received from provider internal: {\"http\":{\"services\":{\"noop\":{}}},\"tcp\":{},\"tls\":{}}" providerName=internal No entryPoint defined for this router, using the default one(s) instead: [http https]" routerName=default No entryPoint defined for this router, using the default one(s) instead: [http https]" routerName=insecure Creating middleware" routerName=insecure@file middlewareName=pipelining middlewareType=Pipelining serviceName=default entryPointName=https Creating load-balancer" serviceName=default entryPointName=https routerName=insecure@file Creating server 0 http://proxy-http:8000/" serverName=0 entryPointName=https routerName=insecure@file serviceName=default Added outgoing tracing middleware default" entryPointName=https routerName=insecure@file middlewareName=tracing middlewareType=TracingForwarder Creating middleware" entryPointName=https routerName=insecure@file middlewareName=redirect@file middlewareType=RedirectScheme Setting up redirection to https " entryPointName=https routerName=insecure@file middlewareName=redirect@file middlewareType=RedirectScheme Adding tracing to middleware" routerName=insecure@file middlewareName=redirect@file entryPointName=https Creating middleware" middlewareType=Recovery entryPointName=https middlewareName=traefik-internal-recovery Creating middleware" entryPointName=http middlewareName=traefik-internal-recovery middlewareType=Recovery Creating Middleware (ResponseModifier)" middlewareType=Headers routerName=default@file middlewareName=hsts@file entryPointName=http Creating Middleware (ResponseModifier)" routerName=default@file middlewareName=scheme@file entryPointName=http middlewareType=Headers Creating middleware" entryPointName=http serviceName=default middlewareName=pipelining middlewareType=Pipelining routerName=default@file Creating load-balancer" routerName=default@file entryPointName=http serviceName=default Creating server 0 http://proxy-http:8000/" serviceName=default serverName=0 routerName=default@file entryPointName=http Added outgoing tracing middleware default" routerName=default@file middlewareType=TracingForwarder middlewareName=tracing entryPointName=http Creating middleware" middlewareName=scheme@file middlewareType=Headers entryPointName=http routerName=default@file Setting up customHeaders/Cors from %v{map[X-Scheme:https] map[] false [] [] [] [] 0 false [] [] false false map[] false 0 false false false false false false false}" entryPointName=http routerName=default@file middlewareName=scheme@file middlewareType=Headers Adding tracing to middleware" entryPointName=http middlewareName=scheme@file routerName=default@file Creating middleware" entryPointName=http routerName=default@file middlewareName=hsts@file middlewareType=Headers Setting up secureHeaders from %v{map[] map[] false [] [] [] [] 0 false [] [] false false map[] false 15724800 false false false false false false false}" routerName=default@file middlewareName=hsts@file middlewareType=Headers entryPointName=http Adding tracing to middleware" entryPointName=http routerName=default@file middlewareName=hsts@file Creating middleware" entryPointName=http middlewareName=traefik-internal-recovery middlewareType=Recovery Creating middleware" entryPointName=https middlewareName=traefik-internal-recovery middlewareType=Recovery No default certificate, generating one" Looking for provided certificate(s) to validate [\"local.jovyan.org\"]..." providerName=default.acme No default certificate, generating one" Creating middleware" serviceName=default middlewareType=Pipelining middlewareName=pipelining entryPointName=http routerName=insecure@file Creating load-balancer" entryPointName=http routerName=insecure@file serviceName=default Creating server 0 http://proxy-http:8000/" entryPointName=http routerName=insecure@file serviceName=default serverName=0 Added outgoing tracing middleware default" middlewareType=TracingForwarder entryPointName=http routerName=insecure@file middlewareName=tracing Creating middleware" middlewareName=redirect@file middlewareType=RedirectScheme entryPointName=http routerName=insecure@file Setting up redirection to https " middlewareType=RedirectScheme entryPointName=http routerName=insecure@file middlewareName=redirect@file Adding tracing to middleware" entryPointName=http routerName=insecure@file middlewareName=redirect@file Creating middleware" middlewareName=traefik-internal-recovery entryPointName=http middlewareType=Recovery Creating middleware" middlewareType=Recovery entryPointName=https middlewareName=traefik-internal-recovery Creating Middleware (ResponseModifier)" entryPointName=https routerName=default@file middlewareName=hsts@file middlewareType=Headers Creating Middleware (ResponseModifier)" middlewareType=Headers routerName=default@file middlewareName=scheme@file entryPointName=https Creating middleware" entryPointName=https routerName=default@file serviceName=default middlewareName=pipelining middlewareType=Pipelining Creating load-balancer" routerName=default@file serviceName=default entryPointName=https Creating server 0 http://proxy-http:8000/" serverName=0 entryPointName=https routerName=default@file serviceName=default Added outgoing tracing middleware default" entryPointName=https routerName=default@file middlewareName=tracing middlewareType=TracingForwarder Creating middleware" entryPointName=https middlewareName=scheme@file middlewareType=Headers routerName=default@file Setting up customHeaders/Cors from %v{map[X-Scheme:https] map[] false [] [] [] [] 0 false [] [] false false map[] false 0 false false false false false false false}" middlewareType=Headers routerName=default@file entryPointName=https middlewareName=scheme@file Adding tracing to middleware" entryPointName=https routerName=default@file middlewareName=scheme@file Creating middleware" routerName=default@file middlewareType=Headers middlewareName=hsts@file entryPointName=https Setting up secureHeaders from %v{map[] map[] false [] [] [] [] 0 false [] [] false false map[] false 15724800 false false false false false false false}" middlewareName=hsts@file entryPointName=https routerName=default@file middlewareType=Headers Adding tracing to middleware" routerName=default@file entryPointName=https middlewareName=hsts@file Creating middleware" entryPointName=https middlewareName=traefik-internal-recovery middlewareType=Recovery Creating middleware" entryPointName=http middlewareType=Recovery middlewareName=traefik-internal-recovery No default certificate, generating one" Looking for provided certificate(s) to validate [\"local.jovyan.org\"]..." providerName=default.acme Creating middleware" serviceName=default middlewareType=Pipelining middlewareName=pipelining entryPointName=https routerName=insecure@file Creating load-balancer" entryPointName=https routerName=insecure@file serviceName=default Creating server 0 http://proxy-http:8000/" serviceName=default serverName=0 entryPointName=https routerName=insecure@file Added outgoing tracing middleware default" entryPointName=https routerName=insecure@file middlewareType=TracingForwarder middlewareName=tracing Creating middleware" entryPointName=https routerName=insecure@file middlewareName=redirect@file middlewareType=RedirectScheme Setting up redirection to https " routerName=insecure@file middlewareName=redirect@file middlewareType=RedirectScheme entryPointName=https Adding tracing to middleware" middlewareName=redirect@file entryPointName=https routerName=insecure@file Creating middleware" middlewareName=traefik-internal-recovery middlewareType=Recovery entryPointName=https Creating middleware" middlewareName=traefik-internal-recovery middlewareType=Recovery entryPointName=http Creating Middleware (ResponseModifier)" entryPointName=http middlewareType=Headers routerName=default@file middlewareName=hsts@file Creating Middleware (ResponseModifier)" entryPointName=http routerName=default@file middlewareName=scheme@file middlewareType=Headers Creating middleware" middlewareType=Pipelining middlewareName=pipelining entryPointName=http routerName=default@file serviceName=default Creating load-balancer" entryPointName=http routerName=default@file serviceName=default Creating server 0 http://proxy-http:8000/" serverName=0 entryPointName=http routerName=default@file serviceName=default Added outgoing tracing middleware default" entryPointName=http routerName=default@file middlewareName=tracing middlewareType=TracingForwarder Creating middleware" middlewareType=Headers entryPointName=http routerName=default@file middlewareName=scheme@file Setting up customHeaders/Cors from %v{map[X-Scheme:https] map[] false [] [] [] [] 0 false [] [] false false map[] false 0 false false false false false false false}" entryPointName=http routerName=default@file middlewareName=scheme@file middlewareType=Headers Adding tracing to middleware" routerName=default@file middlewareName=scheme@file entryPointName=http Creating middleware" entryPointName=http routerName=default@file middlewareName=hsts@file middlewareType=Headers Setting up secureHeaders from %v{map[] map[] false [] [] [] [] 0 false [] [] false false map[] false 15724800 false false false false false false false}" routerName=default@file middlewareName=hsts@file middlewareType=Headers entryPointName=http Adding tracing to middleware" routerName=default@file middlewareName=hsts@file entryPointName=http Creating middleware" entryPointName=http middlewareName=traefik-internal-recovery middlewareType=Recovery Creating middleware" entryPointName=https middlewareName=traefik-internal-recovery middlewareType=Recovery No default certificate, generating one" Domains [\"local.jovyan.org\"] need ACME certificates generation for domains \"local.jovyan.org\"." providerName=default.acme No default certificate, generating one" Domains [\"local.jovyan.org\"] need ACME certificates generation for domains \"local.jovyan.org\"." providerName=default.acme Loading ACME certificates [local.jovyan.org]..." providerName=default.acme Looking for provided certificate(s) to validate [\"local.jovyan.org\"]..." providerName=default.acme No ACME certificate generation required for domains [\"local.jovyan.org\"]." providerName=default.acme Loading ACME certificates [local.jovyan.org]..." providerName=default.acme Building ACME client..." providerName=default.acme https://pebble/dir" providerName=default.acme level=info msg=Register... providerName=default.acme legolog: [INFO] acme: Registering account for jovyan@jupyter.test" Using HTTP Challenge provider." providerName=default.acme legolog: [INFO] [local.jovyan.org] acme: Obtaining bundled SAN certificate" legolog: [INFO] [local.jovyan.org] acme: Obtaining bundled SAN certificate" legolog: [INFO] [local.jovyan.org] AuthURL: https://pebble/authZ/Ry0-sVL2fgOhfZGNM6B-QHdSRaX3RucC_RfCMbhuWDI" legolog: [INFO] [local.jovyan.org] acme: Could not find solver for: tls-alpn-01" legolog: [INFO] [local.jovyan.org] acme: use http-01 solver" legolog: [INFO] [local.jovyan.org] acme: Trying to solve HTTP-01" legolog: [INFO] [local.jovyan.org] AuthURL: https://pebble/authZ/dlfrb8FCW9AeshOfOGuO_LEm8pt6TMRx1tVPFwvyvls" legolog: [INFO] [local.jovyan.org] acme: Could not find solver for: tls-alpn-01" legolog: [INFO] [local.jovyan.org] acme: use http-01 solver" legolog: [INFO] [local.jovyan.org] acme: Trying to solve HTTP-01" Retrieving the ACME challenge for token Gb56VvHeVTQV3EGQOIm1Y7wvuuS0atJGoWIhL1g0kWA..." providerName=default.acme Retrieving the ACME challenge for token Gb56VvHeVTQV3EGQOIm1Y7wvuuS0atJGoWIhL1g0kWA..." providerName=default.acme Retrieving the ACME challenge for token nxs1Z7D2pk44EjrOV__akhrdG2_WTHltyizUBn8ymlQ..." providerName=default.acme Retrieving the ACME challenge for token Gb56VvHeVTQV3EGQOIm1Y7wvuuS0atJGoWIhL1g0kWA..." providerName=default.acme Retrieving the ACME challenge for token nxs1Z7D2pk44EjrOV__akhrdG2_WTHltyizUBn8ymlQ..." providerName=default.acme Retrieving the ACME challenge for token nxs1Z7D2pk44EjrOV__akhrdG2_WTHltyizUBn8ymlQ..." providerName=default.acme legolog: [INFO] [local.jovyan.org] The server validated our request" legolog: [INFO] [local.jovyan.org] acme: Validations succeeded; requesting certificates" legolog: [INFO] Wait for certificate [timeout: 30s, interval: 500ms]" legolog: [INFO] [local.jovyan.org] Server responded with a certificate." Certificates obtained for domains [local.jovyan.org]" providerName=default.acme Configuration received from provider default.acme: {\"http\":{},\"tls\":{}}" providerName=default.acme Adding certificate for domain(s) local.jovyan.org" No default certificate, generating one" Creating middleware" routerName=insecure@file serviceName=default entryPointName=http middlewareName=pipelining middlewareType=Pipelining Creating load-balancer" routerName=insecure@file serviceName=default entryPointName=http Creating server 0 http://proxy-http:8000/" serverName=0 entryPointName=http routerName=insecure@file serviceName=default Added outgoing tracing middleware default" entryPointName=http routerName=insecure@file middlewareName=tracing middlewareType=TracingForwarder Creating middleware" entryPointName=http routerName=insecure@file middlewareName=redirect@file middlewareType=RedirectScheme Setting up redirection to https " routerName=insecure@file middlewareName=redirect@file middlewareType=RedirectScheme entryPointName=http Adding tracing to middleware" routerName=insecure@file middlewareName=redirect@file entryPointName=http Creating middleware" middlewareType=Recovery entryPointName=http middlewareName=traefik-internal-recovery Creating middleware" middlewareName=traefik-internal-recovery entryPointName=https middlewareType=Recovery Creating Middleware (ResponseModifier)" entryPointName=http routerName=default@file middlewareName=hsts@file middlewareType=Headers Creating Middleware (ResponseModifier)" middlewareType=Headers middlewareName=scheme@file entryPointName=http routerName=default@file Creating middleware" entryPointName=http routerName=default@file serviceName=default middlewareName=pipelining middlewareType=Pipelining Creating load-balancer" routerName=default@file serviceName=default entryPointName=http Creating server 0 http://proxy-http:8000/" routerName=default@file serviceName=default serverName=0 entryPointName=http Added outgoing tracing middleware default" entryPointName=http middlewareName=tracing middlewareType=TracingForwarder routerName=default@file Creating middleware" entryPointName=http routerName=default@file middlewareType=Headers middlewareName=scheme@file Setting up customHeaders/Cors from %v{map[X-Scheme:https] map[] false [] [] [] [] 0 false [] [] false false map[] false 0 false false false false false false false}" routerName=default@file middlewareType=Headers middlewareName=scheme@file entryPointName=http Adding tracing to middleware" middlewareName=scheme@file entryPointName=http routerName=default@file Creating middleware" middlewareType=Headers entryPointName=http routerName=default@file middlewareName=hsts@file Setting up secureHeaders from %v{map[] map[] false [] [] [] [] 0 false [] [] false false map[] false 15724800 false false false false false false false}" entryPointName=http routerName=default@file middlewareName=hsts@file middlewareType=Headers Adding tracing to middleware" routerName=default@file middlewareName=hsts@file entryPointName=http Creating middleware" middlewareName=traefik-internal-recovery middlewareType=Recovery entryPointName=http Creating middleware" middlewareName=traefik-internal-recovery middlewareType=Recovery entryPointName=https Looking for provided certificate(s) to validate [\"local.jovyan.org\"]..." providerName=default.acme No ACME certificate generation required for domains [\"local.jovyan.org\"]." providerName=default.acme legolog: [INFO] [local.jovyan.org] The server validated our request" legolog: [INFO] [local.jovyan.org] acme: Validations succeeded; requesting certificates" ```

matthew.brett · July 3, 2020, 6:01pm

@consideRatio - thank you!

I found and deleted the secret:

$ kubectl get secrets
$ kubectl delete secret proxy-public-tls-acme
$ kubectl get secrets

I found the latest chart from https://jupyterhub.github.io/helm-chart/#development-releases-jupyterhub, which was 0.9.0-n116.h1c766a1.

I then purged and restarted using this chart:

$ helm delete jhub-testing --purge
$ helm upgrade --install jhub-testing jupyterhub/jupyterhub   --namespace jhub-testing --version=0.9.0-n116.h1c766a1 --values config.yaml

Then I checked the logs, but got the same error:

$ kubectl logs pod/$(kubectl get pods -o custom-columns=POD:metadata.name | grep autohttps-) traefik -f

giving:

time="2020-07-03T17:46:42Z" level=error msg="Unable to obtain ACME certificate for domains \"testing.uobhub.org\" : unable to generate a certificate for th
e domains [testing.uobhub.org]: error: one or more domains had a problem:\n[testing.uobhub.org] acme: error: 400 :: urn:ietf:params:acme:error:connection :
: Fetching http://testing.uobhub.org/.well-known/acme-challenge/QfUNDgaKU_3dw_WvkDiPaAADbFAOciVMXCMG99nZCiI: Timeout during connect (likely firewall proble
m), url: \n" providerName=default.acme

Finally, I tried deleting the autohttps pod:

$ kubectl delete pods $(kubectl get pods -o custom-columns=POD:metadata.name | grep autohttps-)

And - hey presto - it worked! Thanks very much for your help.

Do you know why I had to delete, even with the newest chart? Is that something that will be easy to fix in due course?

Cheers,

Matthew

matthew.brett · July 3, 2020, 6:34pm

Hmm - interesting - the exact same procedure also worked for my not-testing cluster - I had to delete the autohttps pod once … I wonder - could it be starting up before the external IP is assigned?

Cheers,

Matthew

Yasharth_Bajpai · July 3, 2020, 7:32pm

I tried a similar approach but doesn’t seem to resolve the issue for me.

matthew.brett · July 3, 2020, 11:14pm

@Yasharth_Bajpai - maybe it’s worth posting the exact steps you took and their output, just in case you missed something, or I missed out a step in what I did?

For example I didn’t record the nslookup output, but it is correct, in that it matches my config.yaml and the output from kubectl get svc --namespace jhub-testing:

$ nslookup testing.uobhub.org
Server:         169.254.169.254
Address:        169.254.169.254#53
Non-authoritative answer:
Name:   testing.uobhub.org
Address: 34.89.20.96

consideRatio · July 5, 2020, 2:16pm

It could be one reason, but not the most common one I think. I’m not sure if Traefik make retries after a while, but if it does, that would only delay the process until a retry would be made.

The key reason for this issue is reported with Traefik, who sometimes end up making multiple requests to the ACME server when only one should be made, and then responds to the wrong challenge. It is on its way to be resolved already, and then we will update to use the new version of Traefik which avoids this issue when they use the LEGO as an ACME client interacting with Let’s Encrypt as an ACME server.

matthew.brett · July 5, 2020, 4:11pm

Interesting - thanks.

Is that multiple request issue compatible with my “Timeout during
connect” error?

Cheers,

Matthew

matthew.brett · July 21, 2020, 5:12pm

Just following up - I have this same problem every time I start my cluster.

For the last four times or so, I did not delete the stored secret, I only deleted the autohttps pod:

kubectl delete pods $(kubectl get pods -o custom-columns=POD:metadata.name | grep autohttps-)

The last time I did this, I had to do it twice.

Only to say then, that deleting the secret does not seem to be relevant in my case.

Cheers,

Matthew

consideRatio · February 25, 2022, 1:20am

See LetsEncrypt certificate generation failing on basic default z2jh / GKE setup · Issue #2601 · jupyterhub/zero-to-jupyterhub-k8s · GitHub as a followup. This issue related to the k8s clusters networking wasn’t setup quick enough after the Pod was scheduled to a node. This won’t happen in all k8s clusters, but is confirmed to be a problem in GKE clusters as 2022-02-25, both when using default settings for a GKE cluster and when using Calico which is an opt-in feature.

mr_z_ro · September 25, 2024, 7:30pm

Hi all, just wanted to share that (a) I really appreciate all of you for posting here and in the associated github issue, and (b) there’s a really critical comment that’s a bit buried in the GitHub thread, which is not replicated here.

Since this Discourse thread is now linked from all over the web, I want to share that comment from @consideRatio explicitly below as well, in hopes it will save someone else the tremendous amounts of time I spent troubleshooting over the past days, before coming across the GitHub comment.

consideRatio commented on Feb 25, 2022

I think I’ve nailed it as I could make sure it didn’t occur by introducing a delay from when the k8s Pod had been scheduled on a node and received an IP. Either by pulling a new image on a node, or by tweaking the startup command to first run sleep 10 before starting Traefik as usual.

I’ve proposed a new feature for Traefik in traefik/traefik#8803. But for now, a workaround could be to redeploy by setting new image tags to force restarts of the autohttps pod. Example config:

proxy:
  traefik:
    image:
      # tag modified to trigger a restart of the autohttps pod
      # and induce a delay while downloading the image
      # that ensures networking gets setup in time
      # which allows the requested ACME challenge
      # where the Pod will receive inbound network traffic
      # can succeed.
      tag: 2.6.0 # default is 2.6.1

Another option to this is to edit the autohttps deployment like this.

kubectl edit deploy autohttps
        # ...
       containers:
       - image: traefik:v2.6.1
+        command: ["sh", "-c", "sleep 10 && /entrypoint.sh traefik"]
         imagePullPolicy: IfNotPresent
         # ...

Topic		Replies	Views
Not able to setup HTTPS on jupyterhub on kubernetes Zero to JupyterHub on Kubernetes	2	755	January 18, 2021
Autohttps pod unable to obtain Lets Encrypt certificate Zero to JupyterHub on Kubernetes security	3	1310	August 2, 2022
TLJH \| Need some help with setting up https and letsencrypt The Littlest JupyterHub help-wanted	2	1377	June 7, 2022
Letsencrypt autohttps Failure in GKE Deployment Zero to JupyterHub on Kubernetes help-wanted	3	614	February 28, 2021
Connection is not secure with automatic HTTPS JupyterHub	3	1094	August 3, 2021

Trouble getting HTTPS / letsencrypt working with 0.9.0-beta.4

Upcoming fix in Traefik’s use of LEGO

Related topics