#7 up to #11 added, in particular w.r.t. discoverability
https://www.complang.tuwien.ac.at/ulrich/iso-prolog/max_arity#9
That is definitely a problem and most of the ISO tests suites I have available either fail here or ignore those cases.
The only way how all systems can discover this flag is via #11.
Which particular test suites are you referring to?
The comparison table suggests that there are many other unrelated issues to resolve. The flag as such is currently an implementation specific extension (5.5.8).