dirkf [Fri, 4 Aug 2023 21:54:12 +0000 (22:54 +0100)]
[S4C] Add extractor for Sianel Pedwar Cymru
* from https://github.com/yt-dlp/yt-dlp/pull/7730, thx ifan-t, bashonly
dirkf [Sun, 30 Jul 2023 20:50:52 +0000 (21:50 +0100)]
[utils] Advertise optional supported `Content-Encoding`s
dirkf [Sun, 30 Jul 2023 20:49:58 +0000 (21:49 +0100)]
[utils] Add and use `filter_dict()` from yt-dlp
dirkf [Sun, 30 Jul 2023 20:47:48 +0000 (21:47 +0100)]
[utils] Revise `isinstance()` tests (especially for str/unicode/bytes) to complete Linter fix
dirkf [Sun, 30 Jul 2023 20:45:57 +0000 (21:45 +0100)]
[utils] Avoid comparing `type(var)`, etc, to pass new Linter rules
dirkf [Fri, 28 Jul 2023 05:19:15 +0000 (06:19 +0100)]
[utils] Rework URL path munging for ., .. components
* move processing to YoutubeDLHandler
* also process `Location` header for redirect
* use tests from https://github.com/yt-dlp/yt-dlp/pull/7662
dirkf [Fri, 28 Jul 2023 05:03:14 +0000 (06:03 +0100)]
[utils] Rework decoding of `Content-Encoding`s
* support nested encodings
* support optional `br` encoding, if brotli package is installed
* support optional 'compress' encoding, if ncompress package is installed
* response `Content-Encoding` has only unprocessed encodings, or removed
* response `Content-Length` is decoded length (usable for filesize metadata)
* use zlib for both deflate and gzip decompression
* some elements taken from yt-dlp: thx especially coletdjnz
dirkf [Fri, 28 Jul 2023 09:52:04 +0000 (10:52 +0100)]
[workflows/ci.yml] Update to setup-java@v3
* avoid Node 12 deprecation
dirkf [Tue, 25 Jul 2023 14:11:15 +0000 (15:11 +0100)]
[build] Fix typo in `devscripts/fish-completion.py` (fix
2285605)
dirkf [Mon, 24 Jul 2023 23:22:54 +0000 (00:22 +0100)]
[build] Extend use of `devscripts/utils`
dirkf [Mon, 24 Jul 2023 23:17:15 +0000 (00:17 +0100)]
[compat] Use `compat_open()`
dirkf [Mon, 24 Jul 2023 22:43:36 +0000 (23:43 +0100)]
[build] Add and use `devscripts/utils`
dirkf [Mon, 24 Jul 2023 02:30:28 +0000 (03:30 +0100)]
[utils] Fix update_Request() with empty data (not None)
dirkf [Thu, 20 Jul 2023 18:32:29 +0000 (19:32 +0100)]
[test/test_execution.py] Use `compat_subprocess_get_DEVNULL()`
dirkf [Thu, 20 Jul 2023 17:49:48 +0000 (18:49 +0100)]
[build] Fix various Jython CI and test issues
dirkf [Thu, 20 Jul 2023 15:36:54 +0000 (16:36 +0100)]
[workflows/ci.yml] Build 3.12 with pyenv
dirkf [Thu, 20 Jul 2023 14:49:46 +0000 (15:49 +0100)]
[doc] Update developer guidance
* mention pynose
* mention traverse_obj and add/revise examples
[skip ci]
dirkf [Thu, 20 Jul 2023 11:42:46 +0000 (12:42 +0100)]
[workflows/ci.yml] Extend Python versions
* add 3.10 - 3.12
* use https://pypi.org/project/pynose/ for Py >= 3.9
* test Windows with 3.4
* set defaults (main, both) except push: (all, core)
dirkf [Thu, 20 Jul 2023 04:29:59 +0000 (05:29 +0100)]
[utils] Remove stray undocumented Host header in redirect (fix
46fde7c)
dirkf [Wed, 19 Jul 2023 13:14:50 +0000 (14:14 +0100)]
[InfoExtractor] Add `_match_valid_url()` class method and refactor
* API compatible with yt-dlp
* also support Sequence of patterns in _VALID_URL
* one place to compile _VALID_URL
* TODO: remove existing extractor shims
dirkf [Wed, 19 Jul 2023 12:01:02 +0000 (13:01 +0100)]
[utils] Fix broken Py 3.11+ compat in `traverse_obj()`
* inspect.getargspec is missing despite doc claiming backward compat
* replace with emulation of `Signature.bind()`
dirkf [Fri, 5 May 2023 18:25:42 +0000 (19:25 +0100)]
[InfoExtractor] Add search methods for Next/Nuxt.js from yt-dlp
* add _search_nextjs_data(), from https://github.com/yt-dlp/yt-dlp/pull/1386
thanks selfisekai
* add _search_nuxt_data(), from https://github.com/yt-dlp/yt-dlp/pull/1921,
thanks Lesmiscore, pukkandan
* add tests for the above
* also fix HTML5 type recognition and tests, from
https://github.com/yt-dlp/yt-dlp/commit/
222a230871fe4fe63f35c49590379c9a77116819,
thanks Lesmiscore
* update extractors in PR using above, fix tests.
dirkf [Wed, 3 May 2023 23:08:26 +0000 (00:08 +0100)]
[Clipchamp] Add new extractor back-ported from yt-dlp
dirkf [Wed, 3 May 2023 09:07:35 +0000 (10:07 +0100)]
[DLF] Add site extractors back-ported from yt-dlp
* from https://github.com/yt-dlp/yt-dlp/pull/6697, thanks nick-cd
dirkf [Wed, 3 May 2023 09:04:46 +0000 (10:04 +0100)]
[Whyp] Add extractor back-ported from yt-dlp
* from https://github.com/yt-dlp/yt-dlp/pull/6803, thanks CoryTibbettsDev
dirkf [Wed, 3 May 2023 09:02:25 +0000 (10:02 +0100)]
[GlobalPlayer] Add site extractors back-ported from yt-dlp
* from https://github.com/yt-dlp/yt-dlp/pull/6903, thanks garret1317
dirkf [Wed, 3 May 2023 12:08:58 +0000 (13:08 +0100)]
[InfoExtractor] Support groups in _`search_regex()`, etc
dirkf [Wed, 3 May 2023 09:54:52 +0000 (10:54 +0100)]
[utils] Minor updates (merge_dicts, T)
A couple of mods to ease yt-dlp back-ports:
* add kwargs to merge_dicts:
`unblank=True` (disallow empty string), `rev=False` (reverse the merge list)
* add `T(x)` shortcut for `{x}`, unsupported in Py2.6
dirkf [Wed, 3 May 2023 11:06:34 +0000 (12:06 +0100)]
[utils] Improve js_to_json, align with yt-dlp
* support variable substitution, from https://github.com/yt-dlp/yt-dlp/pull/#521 etc,
thanks ChillingPepper, Grub4k, pukkandan
* improve escape handling, from https://github.com/yt-dlp/yt-dlp/pull/#521
thanks Grub4k
* support template strings from https://github.com/yt-dlp/yt-dlp/pull/6623
thanks Grub4k
* add limited `!` evaluation (eg, !!0 -> false, see tests)
dirkf [Wed, 3 May 2023 11:40:09 +0000 (12:40 +0100)]
[utils] Align traverse_obj() with yt-dlp
Thanks Grub4k for these:
* traverse `Iterable`s, from https://github.com/yt-dlp/yt-dlp/pull/6902, etc
* traverse `set` key for transformations/filters, `re.Match` group names, from
https://github.com/yt-dlp/yt-dlp/commit/
776995bc109c5cd1aa56b684fada2ce718a386ec, etc
* traverse `re.Match`es, from https://github.com/yt-dlp/yt-dlp/pull/5174
* always return list when branching, from https://github.com/yt-dlp/yt-dlp/pull/5170
dirkf [Mon, 17 Jul 2023 19:39:11 +0000 (20:39 +0100)]
[compat] Fix old Pythons broken loading of valueless cookie attributes
Cookie string parsing in Py 2.6.9, probably earlier, requires `=`.
Also 3.2, though the CPython code appears to be OK: 3.1 was also wrong.
dirkf [Thu, 13 Jul 2023 19:14:50 +0000 (20:14 +0100)]
[test] Fixes for old Pythons
dirkf [Tue, 11 Jul 2023 20:51:32 +0000 (21:51 +0100)]
[doc] Warn against setting cookies with --add-header
bashonly [Tue, 4 Jul 2023 21:40:56 +0000 (16:40 -0500)]
[downloader/external] Fix cookie support
Simon Sawicki [Tue, 4 Jul 2023 19:41:04 +0000 (21:41 +0200)]
[core] Process header cookies on loading
bashonly [Tue, 4 Jul 2023 19:03:39 +0000 (14:03 -0500)]
[utils] `YoutubeDLCookieJar`: Add `get_cookie_header` and `get_cookies_for_url` methods
dirkf [Wed, 7 Jun 2023 18:38:54 +0000 (19:38 +0100)]
[core] Remove `Cookie` header on redirect to prevent leaks
Adated from yt-dlp/yt-dlp-ghsa-v8mc-9377-rwjj/pull/1/commits/
101caac
Thx coletdjnz
dirkf [Wed, 7 Jun 2023 13:51:50 +0000 (14:51 +0100)]
[core] Update redirect handling from yt-dlp
* Thx coletdjnz: https://github.com/yt-dlp/yt-dlp/pull/7094
* add test that redirected `POST` loses its `Content-Type`
dirkf [Fri, 7 Jul 2023 17:51:38 +0000 (18:51 +0100)]
[compat] Add Request and HTTPClient compat for redirect
* support `method` parameter of `Request.__init__` (Py 2 and old Py 3)
* support `getcode` method of compat_http_client.HTTPResponse (Py 2)
dirkf [Mon, 17 Jul 2023 19:47:58 +0000 (20:47 +0100)]
[test] Make skipped tests in test_execution work with Py 2.6
dirkf [Fri, 7 Jul 2023 17:45:31 +0000 (18:45 +0100)]
[core] Align error reporting methods with yt-dlp
dirkf [Thu, 6 Jul 2023 14:46:22 +0000 (15:46 +0100)]
[utils] Add {expected_type} and Iterable support to traverse_obj()
dirkf [Fri, 7 Jul 2023 17:41:32 +0000 (18:41 +0100)]
[core] Revert version display from
b8a86dc
dirkf [Tue, 4 Jul 2023 15:06:21 +0000 (16:06 +0100)]
[Misc] Fixes for 2.6 compatibility
dirkf [Fri, 30 Jun 2023 02:52:39 +0000 (03:52 +0100)]
[workflows/ci.yml] Fix test support for Py 2.6
dirkf [Thu, 29 Jun 2023 14:27:12 +0000 (15:27 +0100)]
[workflows/ci.yml] Restore test support for Py 3.2
dirkf [Thu, 22 Jun 2023 23:15:06 +0000 (00:15 +0100)]
[workflows/ci.yml] Restore test support for Py 3.3, 3.4, and add 2.6
dirkf [Thu, 22 Jun 2023 22:10:04 +0000 (23:10 +0100)]
[YouTube] Avoid crash in author extraction
dirkf [Thu, 22 Jun 2023 16:24:48 +0000 (17:24 +0100)]
[workflows/ci.yml] Replace actions/setup-python for legacy Pythons
Thanks MatteoH2O1999: https://github.com/MatteoH2O1999/setup-python
pukkandan [Thu, 22 Jun 2023 07:53:31 +0000 (13:23 +0530)]
[YouTube] Improve nsig function name extraction
Fixes player
b7910ca8, using `,` vs `;`
See https://github.com/ytdl-org/youtube-dl/issues/32292#issuecomment-
1602231170
Co-authored-by: dirkf
dirkf [Sat, 17 Jun 2023 23:52:18 +0000 (00:52 +0100)]
[YouTube] Improve fix for
ae8ba2c
Thx: https://github.com/yt-dlp/yt-dlp/commit/
01aba25
dirkf [Sat, 17 Jun 2023 14:36:39 +0000 (15:36 +0100)]
[YouTube] Fix `KeyError QV` in signature extraction failed
* temporarily force missing global definition into sig JS
* improve test: thanks https://github.com/yt-dlp/yt-dlp/issues/7327#issuecomment-
1595274615
* resolves #32314
dirkf [Sat, 17 Jun 2023 14:43:10 +0000 (15:43 +0100)]
[jsinterp] Don't find unrelated objects
dirkf [Sat, 17 Jun 2023 14:34:11 +0000 (15:34 +0100)]
[jsinterp] Fix test for failed match in extract_object()
dirkf [Sun, 11 Jun 2023 12:33:50 +0000 (13:33 +0100)]
[jsinterp] Fix div bug breaking player
8c7583ff
Thx bashonly: https://github.com/ytdl-org/youtube-dl/issues/32292#issuecomment-
1585639223
Fixes #32292
dirkf [Fri, 26 May 2023 19:25:25 +0000 (20:25 +0100)]
[core] Revise
1f7c6f8 to help downstream merger (possibly)
dirkf [Tue, 23 May 2023 16:11:22 +0000 (17:11 +0100)]
[compat] Fix casefold import __all__ syntax in
a19855f
dirkf [Tue, 23 May 2023 15:19:55 +0000 (16:19 +0100)]
[ITV] Fix UA capitalisation in
384f632
dirkf [Fri, 19 May 2023 12:12:59 +0000 (13:12 +0100)]
[core] Further improve platform debug log
* see
d1c6c5c
dirkf [Fri, 19 May 2023 12:09:18 +0000 (13:09 +0100)]
[jsinterp] Small updates for
a85a875
* update signature tests
* clarify NaN handling
dirkf [Thu, 11 May 2023 20:17:31 +0000 (21:17 +0100)]
[core] Improve platform debug log, based on yt-dlp
dirkf [Thu, 11 May 2023 20:02:01 +0000 (21:02 +0100)]
[jsinterp] Add short-cut evaluation for common expression
* special handling for (d%e.length+e.length)%e.length speeds up ~6%
dirkf [Thu, 11 May 2023 19:59:30 +0000 (20:59 +0100)]
[jsinterp] Handle NaN in bitwise operators
* also add _NaN
* also pull function naming from yt-dlp
dirkf [Thu, 11 May 2023 19:53:07 +0000 (20:53 +0100)]
[utils] Fix `compiled_regex_type` in
249f2b6
dirkf [Sun, 23 Apr 2023 21:58:35 +0000 (22:58 +0100)]
[YouTube] Support Releases tab
dirkf [Fri, 21 Apr 2023 13:04:30 +0000 (14:04 +0100)]
[jsinterp] Minimally handle arithmetic operator precedence
Resolves #32066
dirkf [Wed, 12 Apr 2023 23:15:07 +0000 (00:15 +0100)]
[DashSegmentsFD] Correctly detect errors when `fragment_retries` == 0
* use the success flag instead of the retry count
* establish the fragment_url outside the retry loop
* only report skipping a fragment once.
* resolves #32033
dirkf [Tue, 11 Apr 2023 16:36:27 +0000 (17:36 +0100)]
[YouTube] Simplify signature patterns
Gabriel Nagy [Wed, 12 Apr 2023 22:40:38 +0000 (01:40 +0300)]
[core] Sanitize info dict before dumping JSON (fixes
fe7e130) (#32032)
* follow up to
fe7e130 which didn't fix everything.
Co-authored-by: dirkf <fieldhouse@gmx.net>
dirkf [Mon, 10 Apr 2023 16:12:31 +0000 (17:12 +0100)]
[core] Add and use sanitize_info() method from yt-dlp
dirkf [Thu, 6 Apr 2023 18:49:46 +0000 (19:49 +0100)]
[core] No longer importing copy
dirkf [Thu, 6 Apr 2023 18:42:36 +0000 (19:42 +0100)]
[core] Avoid deepcopy of ctx dict (fix
f35b757) (Pt 2)
dirkf [Thu, 6 Apr 2023 13:11:18 +0000 (14:11 +0100)]
[core] Avoid deepcopy of ctx dict (fix
f35b757)
* may now contain `LazyList`s
* resolves #31999
dirkf [Wed, 5 Apr 2023 17:50:25 +0000 (18:50 +0100)]
[options] Add --mtime option, unsets default --no-mtime
* resolves #1709 (!)
dirkf [Wed, 5 Apr 2023 17:47:49 +0000 (18:47 +0100)]
[devscripts] Improve hack to convert command-line options to API options
* define equality for DateRange
* don't show default DateRange
dirkf [Wed, 5 Apr 2023 17:39:54 +0000 (18:39 +0100)]
[compat] Add and use `compat_open()` like Py3 `open()`
* resolves FIXME: ytdl-org/youtube-dl/commit/
dfe5fa4
dirkf [Wed, 5 Apr 2023 17:29:24 +0000 (18:29 +0100)]
[FragmentFD] Fix iteration with infinite limit
* fixes ytdl-org/youtube-dl/
baa6c5e
* resolves #31885
dirkf [Mon, 3 Apr 2023 20:07:10 +0000 (21:07 +0100)]
[test] Update tests for Ubuntu 20.04
* 18.04 test runner was withdrawn
* for now, disable Py 3.3/3.4 tests
pukkandan [Thu, 9 Mar 2023 16:39:23 +0000 (22:09 +0530)]
[extractor/youtube] Bypass throttling for `-f17`
and related cleanup
Thanks @AudricV for the finding
Ref: yt-dlp/yt-dlp/commit/
c9abebb
pukkandan [Sun, 12 Mar 2023 09:16:09 +0000 (14:46 +0530)]
[extractor/youtube] Construct fragment list lazily
Ref: yt-dlp/yt-dlp/commit/
e389d17
See: yt-dlp/yt-dlp#6517
dirkf [Sun, 19 Mar 2023 02:27:46 +0000 (02:27 +0000)]
[utils] Ensure `allow_types` for `variadic()` is a tuple
dirkf [Sun, 19 Mar 2023 00:51:44 +0000 (00:51 +0000)]
[downloader/http] Only check for resumability when actually resuming
dirkf [Tue, 14 Mar 2023 13:01:32 +0000 (13:01 +0000)]
[AENetworksBaseIE] Report missing show data instead of crash
dirkf [Tue, 14 Mar 2023 00:58:59 +0000 (00:58 +0000)]
[devscripts] Add a hack to convert command-line options to API options
dirkf [Mon, 13 Mar 2023 19:45:54 +0000 (19:45 +0000)]
[core] Handle `/../` sequences in HTTP URLs
* use Python's RFC implementation for embedded sequences
* hack: strip unbalanced leading `../` from path, like eg Firefox
See https://github.com/yt-dlp/yt-dlp/issues/3355
dirkf [Sat, 11 Mar 2023 12:17:00 +0000 (12:17 +0000)]
[FragmentFD] Respect `--no-continue`
* discard partial fragment on `--no-continue`
* continue with correct progress display otherwise
Resolves #21467
dirkf [Sat, 11 Mar 2023 12:09:55 +0000 (12:09 +0000)]
[downloader] Let _ffmpeg_ handle DASH segments
Fixes https://github.com/ytdl-org/youtube-dl/issues/31792 after
3da1783.
dirkf [Thu, 9 Mar 2023 16:40:30 +0000 (16:40 +0000)]
[doc] Improve "guidance" on bug reporting
Sophira [Tue, 7 Mar 2023 15:49:31 +0000 (15:49 +0000)]
[doc] Recommend "Get cookies.txt LOCALLY" extension in README.md (#31763)
* remove link to suspect "Get cookies.txt" extension, dropped from Chrome store
* link to new Manifest V3-compatible open-source "Get cookies.txt LOCALLY" extension.
Fixes #31465.
dirkf [Sun, 5 Mar 2023 23:07:07 +0000 (23:07 +0000)]
[jsinterp] Fix regexp parsing and .replace[All] method
* For performance, make regexp object instantiation lazy
* Other small performance improvements
pukkandan [Fri, 3 Mar 2023 11:18:54 +0000 (16:48 +0530)]
[jsinterp] Handle `Date` at epoch 0
See yt-dlp/yt_dlp#6400
pukkandan [Tue, 28 Feb 2023 17:33:44 +0000 (23:03 +0530)]
[Youtube] Construct dash formats with `range` query
See yt-dlp/yt_dlp#6369
dirkf [Fri, 24 Feb 2023 02:48:37 +0000 (02:48 +0000)]
[YouTube] Support @owner format in uploader_id etc
* implement https://github.com/ytdl-org/youtube-dl/issues/31530#issuecomment-
1435734719
* update affected tests
* misc clean-ups
dirkf [Fri, 24 Feb 2023 02:32:40 +0000 (02:32 +0000)]
[test] Support test-case with volatile ID (eg live show)
Signalled by regexp ID value, eg: `'id': r're:[\da-zA-Z_-]{8,}'`
pukkandan [Tue, 1 Jun 2021 12:35:41 +0000 (18:05 +0530)]
Escape URLs in `sanitized_Request`, not `sanitize_url`
d2558234cf5dd12d6896eed5427b7dcdb3ab7b5a added escaping of URLs while sanitizing. However, `sanitize_url` may not always receive an actual URL. Eg: When using `youtube-dl "search query" --default-search ytsearch`, `search query` gets escaped to `search%20query` before being prefixed with `ytsearch:` which is not the intended behavior. So the escaping is moved to `sanitized_Request` instead.
df [Mon, 20 Feb 2023 01:41:46 +0000 (01:41 +0000)]
[Vimeo] Fix
e19ec52 for tween-age Pythons
* a check in older Pythons in the 2.7 and earlier, 3.3, 3.4 series caused "sre_constants.error: nothing to repeat"
* satisfy the check by avoiding nested qualifiers that can match empty string
Resolves #31597
dirkf [Sun, 19 Feb 2023 13:47:49 +0000 (13:47 +0000)]
[jsinterp] Fix dict comprehension for Py2.6
Resolves #31600
dirkf [Fri, 17 Feb 2023 11:16:54 +0000 (11:16 +0000)]
[YouTube] Avoid crash if uploader_id extraction fails
See #31530.
dirkf [Tue, 14 Feb 2023 16:33:01 +0000 (16:33 +0000)]
[test] Avoid name TestIE which causes a pytest warning
See: https://github.com/yt-dlp/yt-dlp/commit/
060ac76257a8c1f7370a8a571821c1d73377701f
dirkf [Tue, 14 Feb 2023 02:47:09 +0000 (02:47 +0000)]
[InfoExtractor] Handle unquoted values in OpenGraph searches