Bernoulli sampling theorem #1645

hoheinzollern · 2025-06-19T15:26:46Z

This PR contains the Bernoulli sampling theorem as described in the paper:

Formalizing concentration inequalities in Rocq: infrastructure and automation

by Reynald Affeldt, Alessandro Bruni, Cyril Cohen, Pierre Roux, Takafumi Saikawa

Much of the code leading up to this formalization has already been integrated through various PRs in MathComp and MathComp-Analysis where appropriate. See Figure 2 of the paper for a detailed explanation of the formalized theories developed for this PR, and the list below for references to the merged PRs.

Structure of the proof

The proof is inspired by Rajani's pen and paper proof.

One key lemma is mmt_gen_fun_expectation, which establishes the expectation of the moment generating function of a Bernoulli trial using the product probability measure.
Then follows the proof bernoulli_trial_mmt_gen_fun, which establishes that the moment generating function of a Bernoulli trial is the product of each moment generating function.

One key step in the above proofs is to show:

$E_{\otimes_{n} P} [\prod_{i < n} X_{i}] = \prod_{i < n} E_{P} [X_{i}]$

i.e. that the expectation of the product of $n$ random variables on the power measure of P is the product of the expectations of each variable with probability measure P.

To prove our final results, we need to establish a sequence of key analytical lemmas, namely:

exp2_le8: $e^{2} \leq 8$ , inequality solved by using CoqInterval.
xlnx_lbound_i01: lower bound for $x \cdot \ln (x)$ in the interval $] 0, 1 [$ .
xlnx_ubound_i1y: upper bound for $x \cdot \ln (x)$ for $x > 1$ .

The proof itself is split into a sequence of intermediate (concentration) inequalities:

sampling_ineq1: Concentration inequality on a Bernoulli trial $X$ , bounding the probability of $X \geq (1 + δ) E_{\otimes_{n} P} [X]$ .
sampling_ineq2: Specialization of sampling_ineq1 using xlnx_lbound_i12
sampling_ineq3: Concentration inequality on a Bernoulli trial $X$ , bounding the probability of $X \leq (1 - δ) E_{\otimes_{n} P} [X]$
sampling_ineq4: Combines the previous two inequalities to obtain a bound on the probability of $| X - E_{\otimes_{n} P} [X] | \geq δ E_{\otimes_{n} P} [X]$

Finally, sampling is the main sampling theorem combining the above inequalities.

Notes on the current state of this PR

Much of this formalization has already been integrated in MathComp and MathComp-Analysis, and the goal of the current PR is to serve as a compendium to the paper. A few integration tasks currently remain:

first is the instantiation of semi-norms for Lp spaces, which, due to MathComp-Analysis aiming to be compatible with at least two version of MathComp, will not be integrated until support is dropped for MathComp 2.3. PR Lspace #1230 is meant to track the experiment of integrating Lp spaces in MathComp-Analysis with semi-norms, and is only compatible with MathComp 2.4 onwards.
second is that the theory of iterated product measures (here called power_measure), is not yet integrated into MathComp-Analysis as of version 1.12. The plan is to merge it into the next release.

As these elements get integrated into the main branches, this PR will shrink to contain only the Bernoulli sampling theorem.

PRs leading up to this one

This is a (somewhat complete) list of PRs to Analysis that have been branched out of this sampling theorem:

In additions to these PRs, this development contributed the following commits and PRs to MathComp:

math-comp/math-comp@5b293e7 (seminorm interface)
math-comp/math-comp@23ebefe (interval inference)
generalization of subset_itv math-comp#1380

replaces the sampling PR#1240

a consequence and integral_sum can be generalized from eqType to Type.

proux01 · 2025-07-09T12:54:46Z

CI green

affeldt-aist force-pushed the sampling_20250619 branch 4 times, most recently from bda9c53 to e04452c Compare June 28, 2025 10:27

affeldt-aist mentioned this pull request Jul 3, 2025

Countable product of measurable spaces #1214

Open

affeldt-aist force-pushed the sampling_20250619 branch from e04452c to 48017bb Compare July 7, 2025 03:05

hoheinzollern and others added 21 commits July 8, 2025 00:10

Sampling theorem

11e8a7d

replaces the sampling PR#1240

mfunM no longer needed

3723148

removed finite_prod_fin_num and finite_prod_ge0

6e8c01a

removing subset_itvW_bound

1bc1754

removed gtr0_derive1_homo and ger0_derive1_homo

c8be49c

removed bigcup_mkord_ord

1a7cd3f

removed measurability of tuples

1fbff46

cleaning

2947a5f

rm dup

a693dbd

generalize integral_sum so that integrable_sum_ord becomes

071657d

a consequence and integral_sum can be generalized from eqType to Type.

renaming

f15bf26

rebase

0b25b9e

rm dead code, cleaning, rebase

cf458de

tuple_of_pair

64d49d8

cleaning

ff9e066

generalize ipro to sigma_finite

9884905

cleaning

db9caa8

rm dup code

4ede82a

rebase

9452b23

remove preimage_set1

3b0d4bc

ipro -> power_measure

65a0eff

affeldt-aist force-pushed the sampling_20250619 branch from bf35f13 to 65a0eff Compare July 7, 2025 15:35

nitpicking

43eb2a9

hoheinzollern changed the title ~~Sampling theorem~~ Bernoulli sampling theorem Jul 8, 2025

proux01 and others added 5 commits July 9, 2025 08:36

Adding interval dependency to OPAM package

8baaf86

[CI] Add interval dependency to mathcomp-analysis-stdlib

089fb26

moving sampling.v to analysis_stdlib/

57d7978

[CI] New attempt

ea5d731

[CI] Further fix

2e4192e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bernoulli sampling theorem #1645

Bernoulli sampling theorem #1645

hoheinzollern commented Jun 19, 2025 •

edited

Loading

Uh oh!

proux01 commented Jul 9, 2025

Uh oh!

Bernoulli sampling theorem #1645

Are you sure you want to change the base?

Bernoulli sampling theorem #1645

Conversation

hoheinzollern commented Jun 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Formalizing concentration inequalities in Rocq: infrastructure and automation

Structure of the proof

Notes on the current state of this PR

PRs leading up to this one

Uh oh!

proux01 commented Jul 9, 2025

Uh oh!

hoheinzollern commented Jun 19, 2025 •

edited

Loading