Kumo locking up not responding at all

wez · March 28, 2024, 3:14pm

so if you were pushing very hard, we could use more memory than the determined soft limit

Solmea · March 28, 2024, 3:14pm

alright.. that softlimit can be set to 2Gb or more?

Solmea · March 28, 2024, 3:14pm

but I will build me the latest verzion and see if that makes a difference

wez · March 28, 2024, 3:14pm

the soft limit defaults to 75% of the physical ram, but you can explicitly set it via cgroup or ulimit

Solmea · March 28, 2024, 3:15pm

right got it… well I upped the server from 4Gb to 8Gb

Solmea · March 28, 2024, 3:15pm

in order to get memory out of the way

wez · March 28, 2024, 3:17pm

depending on how hard you’re pushing, 8gb might be a little tight. I’d definitely recommend running with that commit to keep things within the soft limit

Solmea · March 28, 2024, 3:17pm

I will give it a try with the webhook.

Solmea · March 28, 2024, 3:17pm

and this latest version

Solmea · March 28, 2024, 3:54pm

Build and installed kumomta-dev.2024.03.28.065631.1b451e3a.Debian12.deb, enabled the webhooks again… lets wait and see. Thanks for getting back on this @yearning-hyena

Solmea · March 28, 2024, 3:55pm

and ofc.. thanks @free-spirited-yorksh

Mike · April 1, 2024, 3:42pm

@magnanimous-umbrella any update?

Solmea · April 2, 2024, 11:56am

So far so good.. no lockups yet.. but sending volume have been fairly low the last 4 days. However I just see a sendout which is doing a 30 messages per minute, which is now fully served by Kumo and it is still serving.

Solmea · April 3, 2024, 7:10am

Oh well the happy streak ended at 4:22 NL time. Last log in elastic is a Reception of a mail and in the ‘tailer --tail’ of the kumo logs I see the delivery, which did not make it into logstash:

  "type": "Delivery",
  "id": "0c7c5056f16111ee94ca02666fd40793",
  "sender": "return-to@news.bla.com",
  "recipient": "a.b.c@blabla.com",
  "queue": "webhook.log_hook",
  "site": "unspecified->webhook.log_hook@lua:make.webhook.log_hook",
  "size": 680,
  "response": {
    "code": 200,
    "enhanced_code": null,
    "content": "200 OK: ok",
    "command": null
  },
  "peer_address": {
    "name": "Lua via make.webhook.log_hook",
    "addr": "0.0.0.0"
  },
  "timestamp": 1712110961,
  "created": 1712110961,
  "num_attempts": 0,
  "bounce_classification": "Uncategorized",
  "egress_pool": "unspecified",
  "egress_source": "unspecified",
  "feedback_report": null,
  "meta": {},
  "headers": {},
  "delivery_protocol": "Lua",
  "reception_protocol": "LogRecord",
  "nodeid": "234f64a1-0345-419b-a259-8aad0bbf8ea3"
}

It simply stops responding Looking at memory usage it says that there is 5Gb is still available in zabbix. Systemctl shows: Memory: 836.4M

Solmea · April 3, 2024, 7:34am

Actually it was doing like 50 mails per minute. Logstash had double the amount to handle.. but usually logstash should be capable of that. But for the sake of science I now removed the webhook and will rely on my zabbix monitoring to detect if Kumo still runs. I am a little lost on what this can be.. Can’t be memory on the server.

Solmea · April 3, 2024, 7:50am

Oh well since I am now all in monitoring using te metrics.json.. I noticed that when the connection queue info fills up with domain entries.. I don’t see the memory values anymore in the json info.

Solmea · April 3, 2024, 8:13am

But the Zabbix monitoring show after the stop and start that it is simply sending mail again at a rate of 20-ish mails per minute. It started with 194 deliveries and then was fed new mails again.

wez · April 3, 2024, 1:19pm

Are you saying that the json is incomplete? How many domain entries are we talking about here? Is the regular prometheus data from /metrics complete in that situation?

Mike · April 3, 2024, 1:35pm

Can you post the full conf? It will help with troubleshooting.

wez · April 3, 2024, 1:43pm

Additionally, if/when it is next locked up, please obtain a stack trace. You need lldb for this.

$ lldb -p $(pgrep kumod) -o 'bt all' -o 'quit' > /tmp/kumo-bt.txt

then provide the /tmp/kumo-bt.txt file to us