diff options
author | ThibG <thib@sitedethib.com> | 2018-12-17 19:19:45 +0100 |
---|---|---|
committer | Eugen Rochko <eugen@zeonfederated.com> | 2018-12-17 19:19:45 +0100 |
commit | e709b8da0d685d3cc48d430a9761896094f67d72 (patch) | |
tree | 8d60d548237c7478bb974e2813288fdda11a8862 /spec/fixtures/requests | |
parent | 4ede51743e5b9121a49e9131f91cf012fab410f8 (diff) |
Ignore low-confidence CharlockHolmes guesses when parsing link cards (#9510)
* Add failing test for windows-1251 link cards * Ignore low-confidence CharlockHolmes guesses Fixes #9466 * Fix no method error when charlock holmes cannot detect charset
Diffstat (limited to 'spec/fixtures/requests')
-rw-r--r-- | spec/fixtures/requests/windows-1251.txt | 17 |
1 files changed, 17 insertions, 0 deletions
diff --git a/spec/fixtures/requests/windows-1251.txt b/spec/fixtures/requests/windows-1251.txt new file mode 100644 index 000000000..f573e28b2 --- /dev/null +++ b/spec/fixtures/requests/windows-1251.txt @@ -0,0 +1,17 @@ +HTTP/1.1 200 OK +server: nginx +date: Wed, 12 Dec 2018 13:14:03 GMT +content-type: text/html +content-length: 190 +accept-ranges: bytes + +<!DOCTYPE html> +<html> +<head> + <meta http-equiv="Content-Type" content="text/html; charset=windows-1251" /> + <title>ñýìïë òåêñò</title> +</head> +<body> + <p>ñýìïë òåêñò</p> +</body> +</html> |