Oh, you probably want a D2PAK then, not one of those PDFN5X6 variants. I don't know why ZthJC continues going up past 10ms, *possibly* the source bond clip soaking some heat, but in general ZthJC (to *case*) is over and done in a couple ms (here, about 10, less the source clip), and then it's up to whatever's around it.
Which, if you don't have solid metal backing up the tab, that's the end of it, ZthJA skyrockets into the 100ms range. You need ZthJA to do this estimation, not JC. (Note, it's no worse than 1:1, or 10x Zth for 10x t, being the constant heat capacity line assuming no new material is being heated.)
Anyway, D2PAK is more material. Thicker tab. That's it. Chip can be identical, same SOA. Could also use a metal core PCB, or one of those boutique fabs with metal pillars buried into it (or machined or whatever), but... right.
Also, you probably want to do a real simulation, because pulse power varies with time, and so does heat as it's spreading out. A transient thermal model can be fitted to the curve, and then connected to some manner of heatsink (to the extent one can be turned into a transient model). You will most likely find PCB alone isn't enough, and a direct clamped heatsink of some substance is required (even just a slug of metal).
Or put the TVS out in front, in series with a rectifier big enough to handle the surge automatically (same idea, it'll just have a higher Tj(max) and thicker leadframe). Saves MOSFET size. The rect + FET is going to be bigger than just using a larger MOSFET though.
Note that TO-220 (standing without a heatsink) is about the same as D2PAK, if verticality is any help. And can be clamped to a slug of metal, or heatsink proper, if needed.
Tim