Add CSmith GitHub action #5700

tautschnig · 2020-12-27T08:52:53Z

Run 10 randomly generated CSmith tests as a GitHub action to perform an
integration test involving CBMC and goto-instrument --dump-c. Each test
is first compiled and run using GCC to obtain a checksum, which is then
included as an assertion for CBMC to verify.

Each commit message has a non-empty body, explaining why the change was made.
n/a Methods or procedures I have added are documented, following the guidelines provided in CODING_STANDARD.md.
n/a The feature or user visible behaviour I have added or modified has been documented in the User Guide in doc/cprover-manual/
Regression or unit tests are included, or existing tests cover the modified code (in this case I have detailed which ones those are in the commit message).
n/a My commit message includes data points confirming performance improvements (if claimed).
My PR is restricted to a single feature or bugfix.
n/a White-space or formatting changes outside the feature-related changed lines are in commits of their own.

codecov · 2020-12-27T09:32:11Z

Codecov Report

Merging #5700 (9b56cce) into develop (deba4ae) will not change coverage.
The diff coverage is n/a.

@@           Coverage Diff            @@
##           develop    #5700   +/-   ##
========================================
  Coverage    69.54%   69.54%           
========================================
  Files         1243     1243           
  Lines       100700   100700           
========================================
  Hits         70036    70036           
  Misses       30664    30664

Flag	Coverage Δ
cproversmt2	`43.29% <ø> (ø)`
regression	`66.44% <ø> (ø)`
unit	`32.22% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update deba4ae...9b56cce. Read the comment docs.

hannes-steffenhagen-diffblue · 2021-01-11T15:47:33Z

This seems like a good idea but the script could use some more explanation for what is going on there.

tautschnig · 2021-01-11T18:09:21Z

This seems like a good idea but the script could use some more explanation for what is going on there.

Indeed. I've added comments!

martin-cs

I /really/ like the concept but ... non-deterministic things make me nervous. I would be happier if the scripts/csmith.sh took a random seed and csmith.yml set it to the hash of the diff or something similar that could be locally reproduced.

I'm also a little unsure about what you feel should happen if you find issues of this form. Is it the job of the person who submitted the PR to fix them? Raise issues?

hannes-steffenhagen-diffblue · 2021-01-12T14:19:55Z

@martin-cs I'd say failure for these is not PR raiser responsibility, but if something does come up we should definitely take note.

hannes-steffenhagen-diffblue

IMHO "cbmc matches csmith test behaviour" and "goto-cc --> dump-c should yield program with same behaviour" are two different tests and probably shouldn't be in the same script, but otherwise I think this is a good addition for finding potential problems hopefully before users run into them.

NlightNFotis

This is great!

Run 10 randomly generated CSmith tests as a GitHub action to perform an integration test involving CBMC and goto-instrument --dump-c. Each test is first compiled and run using GCC to obtain a checksum, which is then included as an assertion for CBMC to verify.

tautschnig · 2021-01-12T16:04:06Z

I /really/ like the concept but ... non-deterministic things make me nervous. I would be happier if the scripts/csmith.sh took a random seed and csmith.yml set it to the hash of the diff or something similar that could be locally reproduced.

Very good point - I've now modified the script to:

Generate a seed (using date +%s) and print that seed.
Add the ability to re-run with a particular seed.

I'm also a little unsure about what you feel should happen if you find issues of this form. Is it the job of the person who submitted the PR to fix them? Raise issues?

I've done multiple thousands of runs of CSmith in recent weeks, and am thus fairly confident that the likelihood of a PR tripping this up by a bug outside the changes introduced in said PR are fairly low. Thus I'd argue it is the job of the person who submitted the PR to get this back in working order.

tautschnig · 2021-01-12T16:06:44Z

IMHO "cbmc matches csmith test behaviour" and "goto-cc --> dump-c should yield program with same behaviour" are two different tests and probably shouldn't be in the same script, but otherwise I think this is a good addition for finding potential problems hopefully before users run into them.

Yes, having two separate jobs might be nicer, but I'm a bit hesitant to have almost-the-same-script twice in the repository. So I'd for now stick with the current solution. If we find that we regularly only run into bugs in one or the other part then that'll provide an incentive to improve this.

martin-cs · 2021-01-12T17:25:42Z

@tautschnig : sounds great; let's do it!

tautschnig added the Tests label Dec 27, 2020

tautschnig requested a review from a team as a code owner December 27, 2020 08:52

tautschnig force-pushed the csmith branch from 04a2e06 to 260b21a Compare December 27, 2020 11:59

tautschnig mentioned this pull request Jan 2, 2021

Make output of --dump-c --use-system-headers compilable #249

Closed

tautschnig force-pushed the csmith branch from 260b21a to a5095f8 Compare January 11, 2021 18:08

martin-cs approved these changes Jan 11, 2021

View reviewed changes

hannes-steffenhagen-diffblue approved these changes Jan 12, 2021

View reviewed changes

NlightNFotis approved these changes Jan 12, 2021

View reviewed changes

tautschnig force-pushed the csmith branch from a5095f8 to 0031f7e Compare January 12, 2021 15:52

Add CSmith GitHub action

9b56cce

Run 10 randomly generated CSmith tests as a GitHub action to perform an integration test involving CBMC and goto-instrument --dump-c. Each test is first compiled and run using GCC to obtain a checksum, which is then included as an assertion for CBMC to verify.

tautschnig force-pushed the csmith branch from 0031f7e to 9b56cce Compare January 12, 2021 16:00

tautschnig merged commit 5cac14f into diffblue:develop Jan 12, 2021

tautschnig deleted the csmith branch January 12, 2021 17:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add CSmith GitHub action #5700

Add CSmith GitHub action #5700

Uh oh!

tautschnig commented Dec 27, 2020

Uh oh!

codecov bot commented Dec 27, 2020 •

edited

Loading

Uh oh!

hannes-steffenhagen-diffblue commented Jan 11, 2021

Uh oh!

tautschnig commented Jan 11, 2021

Uh oh!

martin-cs left a comment

Uh oh!

hannes-steffenhagen-diffblue commented Jan 12, 2021

Uh oh!

hannes-steffenhagen-diffblue left a comment

Uh oh!

NlightNFotis left a comment

Uh oh!

tautschnig commented Jan 12, 2021

Uh oh!

tautschnig commented Jan 12, 2021

Uh oh!

martin-cs commented Jan 12, 2021

Uh oh!

Uh oh!

Add CSmith GitHub action #5700

Add CSmith GitHub action #5700

Uh oh!

Conversation

tautschnig commented Dec 27, 2020

Uh oh!

codecov bot commented Dec 27, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

hannes-steffenhagen-diffblue commented Jan 11, 2021

Uh oh!

tautschnig commented Jan 11, 2021

Uh oh!

martin-cs left a comment

Choose a reason for hiding this comment

Uh oh!

hannes-steffenhagen-diffblue commented Jan 12, 2021

Uh oh!

hannes-steffenhagen-diffblue left a comment

Choose a reason for hiding this comment

Uh oh!

NlightNFotis left a comment

Choose a reason for hiding this comment

Uh oh!

tautschnig commented Jan 12, 2021

Uh oh!

tautschnig commented Jan 12, 2021

Uh oh!

martin-cs commented Jan 12, 2021

Uh oh!

Uh oh!

codecov bot commented Dec 27, 2020 •

edited

Loading