Thanks for the answers.
The Robert Feranec video shows the approuch I took.
I used the xSignals DDR4 wizard to create the xSignals between al the devices together with the Matched Length rules.
He also has a video about xSignals:
Then you have to start routing the traces in the best way possible (keeping byte lane signals from the same byte on the same layer, etc..).
And then you can start length matching using the interactive tool.
But this is an iterative and tedious process that takes a long time.
Competing tools can do these two steps automatically within a few clicks, hence my bugcrunch idea
