Splicing the short 500s to the 750s individually and then connecting the 500s together should, if anything, improve the balance of the currents through the 500s and into the termination. That's because the larger resistance of the long 750s will dominate over the small resistances of the individual termination connections and therefore establish how well the currents are shared, even if these small termination resistances have mismatch.
That's a good point too.
If anything, requiring you to make perform your splice where you bring all wires together, is counterproductive to the intent of the context rules of parallel conductors.
The language of "joined at both ends" in the beginning of the 310.10(H) needs to use that language, for defining what it means to use conductors in parallel in the first place, but there is no physical basis for not allowing splicing/reducing within an individual set.. That is, as long as all the remaining sets also get spliced and reduced in the same manner. If that were the case, then every time you make a splice even between identical sized conductors, you'd need to rejoin them in parallel.
To give an example, suppose you had about 100 ft of 750's, and about 1 foot of 500's at the termination.
We know that there is going to be some practical amount of inconsistency in length on both of these dimensions, even if the code gives no margin of error to address how much error is allowable. Suppose the two lengths of 750's were 101 ft and 100 ft, and that the two lengths of 500's were 13" and 12". That takes a disciplined amount of skill to get it that close. Going by the approximation that resistance is proportional to length/kcmil, the following is how it divides:
Combining the 750's in this example on their own, results in a 50.25% / 49.75% division of the current.
Combining the short lengths of 500's on their own, results in a 52% / 48% division of the current.
Splice together the sizes without paralleling, and when the two longest lengths are in series, you get a 50.3%/49.7% division of the current.
So you can see that the 1" of difference for the short whip at the termination is a lot more significant than the 1 foot of difference for the majority of the run. By requiring you to parallel prior to splicing, you end up with a greater split between the way the current divides between the two parallel sets.