Crystal frequency is 4xE so E is not 1843.2 KHz but 921.6 KHz --> 57600 baud.
I also made this mistake, think that I had not only read but also highlighted with a yellow underliner the note in the datasheet where it clearly says 4xE, but then, resuming the project between one weekend and the next, I forgot about it
(in this... I can say that compared to paper notes, taking notes on Remarkable2 helps a lot)
so ... to fix the problem without re-desiging the whole PCB, I had to decouple the UART from the CPU and built a "baud-generator" circuit dedicated to its UART.
It uses a counter coupled with a digital oscillator. I have jumpers to select the desired bps.
- - -
Another note: I have developed several asynchronous serial lines in VHDL, and I must tell you that the RX part is much more incline to decode a bit incorrectly than the TX part, and to make it a little more robust I use 2x clock in order to sample the signal more stable.
This way I avoid more bits incorrectly detected, and I can measure it in terms of error rate on long continuous transmissions, where I evaluate the CRC, which is good, but.... it costs 2X clock oversampling.
If this trick is used in ASCI chips, it means that the TX part can be overclocked much more stably than the RX part