Turn off regex default features. #1643

PyroLagus · 2019-10-09T15:51:29Z

Fulfills #1622

PyroLagus · 2019-10-09T15:55:27Z

Should the unicode features (or any others) stay enabled by default?

emilio · 2019-10-11T07:47:01Z

Thank you! This looks reasonable to me... I want to see if CI says something interesting, but right now it's busted because there's no rustfmt in rust nightly (https://rust-lang.github.io/rustup-components-history/).

Should the unicode features (or any others) stay enabled by default?

Perhaps, yeah...

est31 · 2019-10-16T13:29:52Z

@emilio rustfmt should be present again. Could the tests be re-run?

emilio · 2019-10-20T12:31:59Z

This doesn't seem to build:

error: std feature is currently required to build this crate

emilio · 2019-10-20T12:32:14Z

But this looks pretty ok to me otherwise.

PyroLagus · 2019-10-26T17:09:24Z

Odd, it built fine here, but I added std anyways. Maybe I was doing something wrong.

emilio

Thanks!

#1643 disabled many deafault features of the `regex` crate but left the `unicode` meta feature enabled. With the `unicode` feature enabled and `bindgen` as a build dependency, `regex-syntax` (a direct dependency of the `regex` crate) takes 7 seconds to compile as a build dependency in my application. The `unicode` feature includes support for many Unicode character class lookups which I find unlikely that bindgen uses. From https://docs.rs/regex/latest/regex/#unicode-features: > - unicode-age - Provide the data for the Unicode Age property. This > makes it possible to use classes like `\p{Age:6.0}` to refer to all > codepoints first introduced in Unicode 6.0 > - unicode-bool - Provide the data for numerous Unicode boolean > properties. The full list is not included here, but contains > properties like `Alphabetic`, `Emoji`, `Lowercase`, `Math`, > `Uppercase` and `White_Space`. > - unicode-case - Provide the data for case insensitive matching using > Unicode's "simple loose matches" specification. > - unicode-gencat - Provide the data for Unicode general categories. > This includes, but is not limited to, `Decimal_Number`, `Letter`, > `Math_Symbol`, `Number` and `Punctuation`. > - unicode-script - Provide the data for Unicode scripts and script > extensions. This includes, but is not limited to, `Arabic`, `Cyrillic`, > `Hebrew`, `Latin` and `Thai`. > - unicode-segment - Provide the data necessary to provide the > properties used to implement the Unicode text segmentation > algorithms. This enables using classes like `\p{gcb=Extend}`, > `\p{wb=Katakana}` and `\p{sb=ATerm}`. I have retained the `unicode-perl` feature, which gives support for `\w`, `\s` and `\d`, because these character classes were required to get tests to pass. Removing support for these character classes removes the need to compile many data tables, which should significantly reduce compile times.

highfive added the S-awaiting-review label Oct 9, 2019

Lokathor mentioned this pull request Oct 11, 2019

Update Cargo.toml #1645

Closed

PyroLagus force-pushed the disable-regex-default-features branch from 0fc34e8 to c7ffca0 Compare October 19, 2019 17:03

Turn off regex default features.

6a11867

PyroLagus force-pushed the disable-regex-default-features branch from c7ffca0 to 6a11867 Compare October 26, 2019 17:08

emilio approved these changes Oct 26, 2019

View reviewed changes

emilio merged commit 18a64e6 into rust-lang:master Oct 26, 2019

PyroLagus deleted the disable-regex-default-features branch October 27, 2019 12:08

kulp mentioned this pull request Jun 2, 2022

Turning off default features of regex #1622

Closed

lopopolo mentioned this pull request Dec 27, 2023

Deactivate many regex Unicode crate features #2702

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Turn off regex default features. #1643

Turn off regex default features. #1643

Uh oh!

PyroLagus commented Oct 9, 2019

Uh oh!

PyroLagus commented Oct 9, 2019

Uh oh!

emilio commented Oct 11, 2019

Uh oh!

est31 commented Oct 16, 2019

Uh oh!

emilio commented Oct 20, 2019

Uh oh!

emilio commented Oct 20, 2019

Uh oh!

PyroLagus commented Oct 26, 2019

Uh oh!

emilio left a comment

Uh oh!

Uh oh!

Turn off regex default features. #1643

Turn off regex default features. #1643

Uh oh!

Conversation

PyroLagus commented Oct 9, 2019

Uh oh!

PyroLagus commented Oct 9, 2019

Uh oh!

emilio commented Oct 11, 2019

Uh oh!

est31 commented Oct 16, 2019

Uh oh!

emilio commented Oct 20, 2019

Uh oh!

emilio commented Oct 20, 2019

Uh oh!

PyroLagus commented Oct 26, 2019

Uh oh!

emilio left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!