I've just been talking to mk47 about possible signal fading, propagation delay and interference when daisy-chaining up to 8 TLCs (or 4 boards including the KL25Z interface board).
I might be a little late on that since you guys already ordered your prototype boards, but as a fallback idea, I thought about using the 3 remaining pins (PTC0,3,4 according to schematics ) on the KL25Z board for talking to the TLCs.
The idea is only to chain the 2 TLCs per board.
Assuming that PWM_GSCLK, PWM_XLAT, PWM_SLCK and PWM_BLANK can be connected in parallel to all TLCs throughout the system, only one additional PWM_SIN pin per board (or chain of 2 TLCs) is required.
The resulting 4 available chains of 2 TLCs each (or 4 boards, including the "main" board) would be able to provide the projected 128 outputs (not even including the chime outputs).
You can stick to bus wiring as it is and put 2 more pins on the header or create additional I/O headers (2 more would be required - for 3 expansion boards besides the KL25Z interface board), with one PWM_SIN line on each.
When sticking to bus wiring, one would be required to jumper the expansion board so that the correct PWM_SIN line in the bus is used. Worst case when jumpered wrong: 2 banks of outputs fire at the same time - go cry 
Not an expert on your source code, but I assume only minimal effort is required to let the software know how to use 4 different PWM_SIN lines on the output assigments (should even be calculatable on-the-fly with MOD dividing
)