pict-rs, magick and lemmy-ui consuming lots of CPU time

Hi guys,

so recently we had a bit of an influx of users on beehaw.org, which lead to some support questions due to the site being unstable.

I looked at the nginx config and introduced rate limiting and we scaled up from 2→4 CPU cores and 4→8GB RAM. That alleviated these issues a bit, but we still have a load of ~2.0 with just a few hundred/thousand users online.

In my experience with web services this is not a lot of users. What I found out is that a magick process is getting OOM killed frequently, that pict-rs often calls exiftool which causes lots of load and that basically the server.js from Lemmy-UI is not fast enough dealing with requests from nginx.

How do I debug where the bottleneck is? Do you already have an idea on how to reduce the load? We’re still running 0.17.2, but upgraded from pict-rs 0.3.0 to 0.3.1. Is it safe to upgrade to pict-rs 0.3.3 or even 0.4.0? Can you configure pict-rs to not use exiftool? I wouldn’t want to renice it manually if there’s a good way to optimise requests/s.

pinging @dessalines@lemmy.ml @nutomic@lemmy.ml @makotech222@lemmy.ml @CannotSleep420@lemmygrad.ml