The stencil pattern represents a vast variety of applications, ranging from geophysics to medical science. In application codes, the stencil kernel is often the part where most of the time is spent, thus forcing an efficient parallel implementation of it. On the other side we know that stencil computations are often memory-bound, which requires sophisticated parallelization techniques to get scalable solutions. In this paper we present the results of a stencil benchmark experiment run on two different systems by means of the PROVA! tool we are currently implementing. PROVA! aims for reproducible performance experiments and makes collaborative stencil benchmarking feasible through web repositories and interfaces.