incorrect output and segfaults from sprintf with %*d (PR#13667) - R-devel

Wacek Kusnierczyk · 2009-04-23T13:05:21Z

Martin Maechler wrote: >>>>>> "vQ" == Wacek Kusnierczyk >>>>>> on Thu, 23 Apr 2009 11:49:54 +0200 writes: >>>>>> > > vQ> maechler at stat.math.ethz.ch wrote: > >> > vQ> sprintf has a documented limit on strings included in the output using the > vQ> format '%s'. It appears that there is a limit on the length of strings included > vQ> with, e.g., the format '%d' beyond which surprising things happen (output > vQ

Wacek Kusnierczyk

Thu, Apr 23, 2009 6:05 AM #

Martin Maechler wrote:

martin, ?sprintf says:

" There is a limit of 8192 bytes on elements of 'fmt' and also on
     strings included by a '%s' conversion specification."

for me, it's clear that the *elements of fmt* cannot be longer than 8192
bytes, and that each single bit included in the output in place of a %s
cannot be longer than 8192.  nowhere does it say that strings included
in the output in place of a %d, for example, cannot be longer than
8192.  the fact that %s is particularized makes me infer that there is
something specific to %s that does not apply to %d, for example,
otherwise the help would have been formulated differently.  (though
given how r help pages are written, nothing seems unlikely.)

and in fact, the limit does not seem to apply in an obvious way in cases
such as sprintf('%*d', 10000, 1), where the output is correct.  at the
very least, the documentation leaves the user ignorant as to what will
happen if the limit is exceeded.

maybe, i can't judge without carefully examining the code for sprintf.c
(which i am rather unwilling to do, having had a look at a sample).

would be interesting to see how this improves sprintf.

great, i'm going to torture the fix soon ;)

i don't see how i could be more polite here, i had absolutely no
intention to be impolite and didn't think i were. 

i gave a serious answer by means of a serious question.  increasing an
arbitrary, poorly documented limit of obscure effect is hardly any
solution.  suggesting that a bug is not a bug because some limit is not
likely to be exceeded in practice is not a particularly good idea.

no, i didn't read that into your posting, i'm just referring to the
state of the 'art' in r.

cheers,
vQ