All software DDR3 controller update....
16 read, 16 write ports with smart cache, configurable priority encoding and data width per port done.
4096x2160 screen scrolling on a 1080p 32bit color out HDMI with geometry test drawing program done.
Auto DDR3 PLL calibration finally done.

!!!400MHz with data integrity!!!

With a +/- 3 PLL tuning step valid data window.
Yes, Altera/Intel's software DDR3 solution is only rated at 300MHz. Not even the minimum 303MHz rated by the DDR3 spec.
It also only has a single read and write port.
It's Altera's hardware dependent PHY which has multiport and runs at 400MHz.
Things left to do:
Upgrade my software FIFOs in my multiport module as they currently have a bottleneck of 125MHz because of the way they are written.
Improve the cross clock domain boundary as the DDR3 sequencer sends commands to the DDR3 CK clock domain. (The solution I'm looking at might get the DDR3 running at 500MHz on a -6, but definitely working at >300MHz on a -8, even on old Cyclone III & IV.)
The goal for the above 2 is at least 300MHz allowing for a 'Full Rate' controller where as if you want the 400MHz (officially overclocking Altera's IOs), you will probably need to run it at 'Half Rate', or 'Quarter Rate'.
Without the Multiport module, IE, directly communicating with my DDR3 controller, 1 read & 1 write port, the controller consumes just under 3K logic gates for a 16bit DDR3 ram chip. And it is smart, keeps banks open and aware when you access across banks, it will keep banks open if it can, only close and open individual banks as it needs to.
I should be ready to post the source next week.
Since your video output on the final board will be limited to 720p@60hz, or 1080p@30hz, and you will have 2 DDR3 16 bit ram chips, here is what the maximum video layers you can produce at the following bit depths:
Color depth -> 1080p@60Hz, 720p@60hz/1080p@30hz, 480p@60hz
32bit 4 , 8 , 22
16bit 8 , 16 , 44
8bit 16 , 32 , 88 (WTF can you do with 88 layers?)
This does not include the additional 15 layers you get with the core ram's 128kb memory.
Anyways, @nockieboy, make sure your DECA board works and you can at least output the HDMI test program by sending it just the .sof file in the deca example source code demos. I will be providing a DDR3 demo .sof file in a few days.