test: add new corner cases for linguist v7.26

test plan
 - go test -run '^Test_EnryOnLinguistCorpus$' github.com/go-enry/go-enry/v2
This commit is contained in:
Alex Bezzubov 2023-09-22 13:38:49 +02:00
parent cc878e354c
commit bd3630419c
2 changed files with 8 additions and 0 deletions

View File

@ -160,6 +160,8 @@ The `enry` library is based on the data from `github/linguist` version **v7.26.0
Parsing [linguist/samples](https://github.com/github/linguist/tree/master/samples) the following `enry` results are different from the Linguist:
- [Heuristic for ".plist" extension](https://github.com/github-linguist/linguist/blob/b5432ebc7e78f25415b98d48c2fbacddbf8df317/lib/linguist/heuristics.yml#L524) in 'XML Property List', due to unsupported backreference in RE2 regexp engine.
- [Heuristics for ".txt" extension](https://github.com/github/linguist/blob/8083cb5a89cee2d99f5a988f165994d0243f0d1e/lib/linguist/heuristics.yml#L521) in Vim Help File could not be parsed, due to unsupported negative lookahead in RE2 regexp engine.
- [Heuristics for ".sol" extension](https://github.com/github/linguist/blob/8083cb5a89cee2d99f5a988f165994d0243f0d1e/lib/linguist/heuristics.yml#L464) in Solidity could not be parsed, due to unsupported negative lookahead in RE2 regexp engine.

View File

@ -33,6 +33,12 @@ func (s *linguistCorpusSuite) TestLinguistSamples() {
"anti-facebook.txt": true,
"fake-news.txt": true,
"test_rules.txt": true,
// backreference in .plist heuristics for "XML Property List" language https://github.com/go-enry/go-enry/pull/169#discussion_r1319889500
// upsteam fix comming in https://github.com/go-enry/go-enry/pull/169#issuecomment-1708840755
"ff-man.plist": true,
"info.min.plist": true,
"info.plist": true,
"man.plist": true,
}
var total, failed, ok, other int