Perhaps obvious to most everyone, but the default filter list was shorter than I expected:['ASCII','UTF-8'], in that order.c. 2024, 60% of websites globally declared charset 'UTF-8'So if you're experimenting with multibyte encodings other than UTF-8, you have to specify your detect_order, choosing from the list of implemented filters.