Robotix [Wed, 6 Dec 2023 01:17:57 +0000 (02:17 +0100)]
[Epidemic Sound] Add new extractor (#32628)
* Add simple extractor
* Support separate tracks
* Use index as id instead of slug
---------
Co-authored-by: dirkf <fieldhouse@gmx.net>
dirkf [Tue, 5 Dec 2023 20:02:30 +0000 (20:02 +0000)]
[Imgur] Overhaul extractor module (#32612)
Revise extractors for new API and page formats
mimvahedi [Sat, 2 Dec 2023 15:25:09 +0000 (18:55 +0330)]
[telewebion] Fix extraction (#32634)
* [telewebion] fix extraction
Resolves https://github.com/ytdl-org/youtube-dl/issues/5135#issuecomment-
932952119
---------
Co-authored-by: dirkf <fieldhouse@gmx.net>
dirkf [Tue, 28 Nov 2023 17:26:37 +0000 (17:26 +0000)]
[utils] Make restricted filenames ignore characters in Unicode categories Mark, Other
Resolves #32629
dirkf [Tue, 28 Nov 2023 18:06:40 +0000 (18:06 +0000)]
[workflows/ci.yml] Use setup-python for now released Python 3.12
dirkf [Tue, 28 Nov 2023 16:17:07 +0000 (16:17 +0000)]
[workflows/ci.yml] Actually use default values for push and pull_request
dirkf [Wed, 18 Oct 2023 13:28:10 +0000 (14:28 +0100)]
[workflows/ci.yml] Run apt-get update before installing
ReenigneArcher [Wed, 15 Nov 2023 23:54:31 +0000 (18:54 -0500)]
[Youtube] Update consent cookie handling to match site
Apologies for force push!
[skip ci]
dirkf [Sun, 24 Sep 2023 21:00:13 +0000 (22:00 +0100)]
[downloader] Fix
baa6c5e: show ETA of http download as ETA instead of total d/l time
dirkf [Sun, 3 Sep 2023 22:15:19 +0000 (23:15 +0100)]
[utils] Revert
bbd3e7e, updating docstring, test instead
dirkf [Sun, 3 Sep 2023 00:18:22 +0000 (01:18 +0100)]
[utils] Properly handle list values in update_url()
An actual list value in a query update could have been treated
as a list of values because of the key:list parse_qs format.
dirkf [Sun, 3 Sep 2023 00:13:40 +0000 (01:13 +0100)]
[test] Remove redundancy from lambda expected value regex
dirkf [Sun, 27 Aug 2023 18:08:28 +0000 (19:08 +0100)]
[S4C] Add thumbnail extraction, extract series as playlist
Based on https://github.com/yt-dlp/yt-dlp/pull/7776: thx ifan-t, bashonly
dirkf [Thu, 31 Aug 2023 16:16:47 +0000 (17:16 +0100)]
[ci.yml] Improve conditions for nosetest installations
dirkf [Fri, 4 Aug 2023 21:54:12 +0000 (22:54 +0100)]
[S4C] Add extractor for Sianel Pedwar Cymru
* from https://github.com/yt-dlp/yt-dlp/pull/7730, thx ifan-t, bashonly
dirkf [Sun, 30 Jul 2023 20:50:52 +0000 (21:50 +0100)]
[utils] Advertise optional supported `Content-Encoding`s
dirkf [Sun, 30 Jul 2023 20:49:58 +0000 (21:49 +0100)]
[utils] Add and use `filter_dict()` from yt-dlp
dirkf [Sun, 30 Jul 2023 20:47:48 +0000 (21:47 +0100)]
[utils] Revise `isinstance()` tests (especially for str/unicode/bytes) to complete Linter fix
dirkf [Sun, 30 Jul 2023 20:45:57 +0000 (21:45 +0100)]
[utils] Avoid comparing `type(var)`, etc, to pass new Linter rules
dirkf [Fri, 28 Jul 2023 05:19:15 +0000 (06:19 +0100)]
[utils] Rework URL path munging for ., .. components
* move processing to YoutubeDLHandler
* also process `Location` header for redirect
* use tests from https://github.com/yt-dlp/yt-dlp/pull/7662
dirkf [Fri, 28 Jul 2023 05:03:14 +0000 (06:03 +0100)]
[utils] Rework decoding of `Content-Encoding`s
* support nested encodings
* support optional `br` encoding, if brotli package is installed
* support optional 'compress' encoding, if ncompress package is installed
* response `Content-Encoding` has only unprocessed encodings, or removed
* response `Content-Length` is decoded length (usable for filesize metadata)
* use zlib for both deflate and gzip decompression
* some elements taken from yt-dlp: thx especially coletdjnz
dirkf [Fri, 28 Jul 2023 09:52:04 +0000 (10:52 +0100)]
[workflows/ci.yml] Update to setup-java@v3
* avoid Node 12 deprecation
dirkf [Tue, 25 Jul 2023 14:11:15 +0000 (15:11 +0100)]
[build] Fix typo in `devscripts/fish-completion.py` (fix
2285605)
dirkf [Mon, 24 Jul 2023 23:22:54 +0000 (00:22 +0100)]
[build] Extend use of `devscripts/utils`
dirkf [Mon, 24 Jul 2023 23:17:15 +0000 (00:17 +0100)]
[compat] Use `compat_open()`
dirkf [Mon, 24 Jul 2023 22:43:36 +0000 (23:43 +0100)]
[build] Add and use `devscripts/utils`
dirkf [Mon, 24 Jul 2023 02:30:28 +0000 (03:30 +0100)]
[utils] Fix update_Request() with empty data (not None)
dirkf [Thu, 20 Jul 2023 18:32:29 +0000 (19:32 +0100)]
[test/test_execution.py] Use `compat_subprocess_get_DEVNULL()`
dirkf [Thu, 20 Jul 2023 17:49:48 +0000 (18:49 +0100)]
[build] Fix various Jython CI and test issues
dirkf [Thu, 20 Jul 2023 15:36:54 +0000 (16:36 +0100)]
[workflows/ci.yml] Build 3.12 with pyenv
dirkf [Thu, 20 Jul 2023 14:49:46 +0000 (15:49 +0100)]
[doc] Update developer guidance
* mention pynose
* mention traverse_obj and add/revise examples
[skip ci]
dirkf [Thu, 20 Jul 2023 11:42:46 +0000 (12:42 +0100)]
[workflows/ci.yml] Extend Python versions
* add 3.10 - 3.12
* use https://pypi.org/project/pynose/ for Py >= 3.9
* test Windows with 3.4
* set defaults (main, both) except push: (all, core)
dirkf [Thu, 20 Jul 2023 04:29:59 +0000 (05:29 +0100)]
[utils] Remove stray undocumented Host header in redirect (fix
46fde7c)
dirkf [Wed, 19 Jul 2023 13:14:50 +0000 (14:14 +0100)]
[InfoExtractor] Add `_match_valid_url()` class method and refactor
* API compatible with yt-dlp
* also support Sequence of patterns in _VALID_URL
* one place to compile _VALID_URL
* TODO: remove existing extractor shims
dirkf [Wed, 19 Jul 2023 12:01:02 +0000 (13:01 +0100)]
[utils] Fix broken Py 3.11+ compat in `traverse_obj()`
* inspect.getargspec is missing despite doc claiming backward compat
* replace with emulation of `Signature.bind()`
dirkf [Fri, 5 May 2023 18:25:42 +0000 (19:25 +0100)]
[InfoExtractor] Add search methods for Next/Nuxt.js from yt-dlp
* add _search_nextjs_data(), from https://github.com/yt-dlp/yt-dlp/pull/1386
thanks selfisekai
* add _search_nuxt_data(), from https://github.com/yt-dlp/yt-dlp/pull/1921,
thanks Lesmiscore, pukkandan
* add tests for the above
* also fix HTML5 type recognition and tests, from
https://github.com/yt-dlp/yt-dlp/commit/
222a230871fe4fe63f35c49590379c9a77116819,
thanks Lesmiscore
* update extractors in PR using above, fix tests.
dirkf [Wed, 3 May 2023 23:08:26 +0000 (00:08 +0100)]
[Clipchamp] Add new extractor back-ported from yt-dlp
dirkf [Wed, 3 May 2023 09:07:35 +0000 (10:07 +0100)]
[DLF] Add site extractors back-ported from yt-dlp
* from https://github.com/yt-dlp/yt-dlp/pull/6697, thanks nick-cd
dirkf [Wed, 3 May 2023 09:04:46 +0000 (10:04 +0100)]
[Whyp] Add extractor back-ported from yt-dlp
* from https://github.com/yt-dlp/yt-dlp/pull/6803, thanks CoryTibbettsDev
dirkf [Wed, 3 May 2023 09:02:25 +0000 (10:02 +0100)]
[GlobalPlayer] Add site extractors back-ported from yt-dlp
* from https://github.com/yt-dlp/yt-dlp/pull/6903, thanks garret1317
dirkf [Wed, 3 May 2023 12:08:58 +0000 (13:08 +0100)]
[InfoExtractor] Support groups in _`search_regex()`, etc
dirkf [Wed, 3 May 2023 09:54:52 +0000 (10:54 +0100)]
[utils] Minor updates (merge_dicts, T)
A couple of mods to ease yt-dlp back-ports:
* add kwargs to merge_dicts:
`unblank=True` (disallow empty string), `rev=False` (reverse the merge list)
* add `T(x)` shortcut for `{x}`, unsupported in Py2.6
dirkf [Wed, 3 May 2023 11:06:34 +0000 (12:06 +0100)]
[utils] Improve js_to_json, align with yt-dlp
* support variable substitution, from https://github.com/yt-dlp/yt-dlp/pull/#521 etc,
thanks ChillingPepper, Grub4k, pukkandan
* improve escape handling, from https://github.com/yt-dlp/yt-dlp/pull/#521
thanks Grub4k
* support template strings from https://github.com/yt-dlp/yt-dlp/pull/6623
thanks Grub4k
* add limited `!` evaluation (eg, !!0 -> false, see tests)
dirkf [Wed, 3 May 2023 11:40:09 +0000 (12:40 +0100)]
[utils] Align traverse_obj() with yt-dlp
Thanks Grub4k for these:
* traverse `Iterable`s, from https://github.com/yt-dlp/yt-dlp/pull/6902, etc
* traverse `set` key for transformations/filters, `re.Match` group names, from
https://github.com/yt-dlp/yt-dlp/commit/
776995bc109c5cd1aa56b684fada2ce718a386ec, etc
* traverse `re.Match`es, from https://github.com/yt-dlp/yt-dlp/pull/5174
* always return list when branching, from https://github.com/yt-dlp/yt-dlp/pull/5170
dirkf [Mon, 17 Jul 2023 19:39:11 +0000 (20:39 +0100)]
[compat] Fix old Pythons broken loading of valueless cookie attributes
Cookie string parsing in Py 2.6.9, probably earlier, requires `=`.
Also 3.2, though the CPython code appears to be OK: 3.1 was also wrong.
dirkf [Thu, 13 Jul 2023 19:14:50 +0000 (20:14 +0100)]
[test] Fixes for old Pythons
dirkf [Tue, 11 Jul 2023 20:51:32 +0000 (21:51 +0100)]
[doc] Warn against setting cookies with --add-header
bashonly [Tue, 4 Jul 2023 21:40:56 +0000 (16:40 -0500)]
[downloader/external] Fix cookie support
Simon Sawicki [Tue, 4 Jul 2023 19:41:04 +0000 (21:41 +0200)]
[core] Process header cookies on loading
bashonly [Tue, 4 Jul 2023 19:03:39 +0000 (14:03 -0500)]
[utils] `YoutubeDLCookieJar`: Add `get_cookie_header` and `get_cookies_for_url` methods
dirkf [Wed, 7 Jun 2023 18:38:54 +0000 (19:38 +0100)]
[core] Remove `Cookie` header on redirect to prevent leaks
Adated from yt-dlp/yt-dlp-ghsa-v8mc-9377-rwjj/pull/1/commits/
101caac
Thx coletdjnz
dirkf [Wed, 7 Jun 2023 13:51:50 +0000 (14:51 +0100)]
[core] Update redirect handling from yt-dlp
* Thx coletdjnz: https://github.com/yt-dlp/yt-dlp/pull/7094
* add test that redirected `POST` loses its `Content-Type`
dirkf [Fri, 7 Jul 2023 17:51:38 +0000 (18:51 +0100)]
[compat] Add Request and HTTPClient compat for redirect
* support `method` parameter of `Request.__init__` (Py 2 and old Py 3)
* support `getcode` method of compat_http_client.HTTPResponse (Py 2)
dirkf [Mon, 17 Jul 2023 19:47:58 +0000 (20:47 +0100)]
[test] Make skipped tests in test_execution work with Py 2.6
dirkf [Fri, 7 Jul 2023 17:45:31 +0000 (18:45 +0100)]
[core] Align error reporting methods with yt-dlp
dirkf [Thu, 6 Jul 2023 14:46:22 +0000 (15:46 +0100)]
[utils] Add {expected_type} and Iterable support to traverse_obj()
dirkf [Fri, 7 Jul 2023 17:41:32 +0000 (18:41 +0100)]
[core] Revert version display from
b8a86dc
dirkf [Tue, 4 Jul 2023 15:06:21 +0000 (16:06 +0100)]
[Misc] Fixes for 2.6 compatibility
dirkf [Fri, 30 Jun 2023 02:52:39 +0000 (03:52 +0100)]
[workflows/ci.yml] Fix test support for Py 2.6
dirkf [Thu, 29 Jun 2023 14:27:12 +0000 (15:27 +0100)]
[workflows/ci.yml] Restore test support for Py 3.2
dirkf [Thu, 22 Jun 2023 23:15:06 +0000 (00:15 +0100)]
[workflows/ci.yml] Restore test support for Py 3.3, 3.4, and add 2.6
dirkf [Thu, 22 Jun 2023 22:10:04 +0000 (23:10 +0100)]
[YouTube] Avoid crash in author extraction
dirkf [Thu, 22 Jun 2023 16:24:48 +0000 (17:24 +0100)]
[workflows/ci.yml] Replace actions/setup-python for legacy Pythons
Thanks MatteoH2O1999: https://github.com/MatteoH2O1999/setup-python
pukkandan [Thu, 22 Jun 2023 07:53:31 +0000 (13:23 +0530)]
[YouTube] Improve nsig function name extraction
Fixes player
b7910ca8, using `,` vs `;`
See https://github.com/ytdl-org/youtube-dl/issues/32292#issuecomment-
1602231170
Co-authored-by: dirkf
dirkf [Sat, 17 Jun 2023 23:52:18 +0000 (00:52 +0100)]
[YouTube] Improve fix for
ae8ba2c
Thx: https://github.com/yt-dlp/yt-dlp/commit/
01aba25
dirkf [Sat, 17 Jun 2023 14:36:39 +0000 (15:36 +0100)]
[YouTube] Fix `KeyError QV` in signature extraction failed
* temporarily force missing global definition into sig JS
* improve test: thanks https://github.com/yt-dlp/yt-dlp/issues/7327#issuecomment-
1595274615
* resolves #32314
dirkf [Sat, 17 Jun 2023 14:43:10 +0000 (15:43 +0100)]
[jsinterp] Don't find unrelated objects
dirkf [Sat, 17 Jun 2023 14:34:11 +0000 (15:34 +0100)]
[jsinterp] Fix test for failed match in extract_object()
dirkf [Sun, 11 Jun 2023 12:33:50 +0000 (13:33 +0100)]
[jsinterp] Fix div bug breaking player
8c7583ff
Thx bashonly: https://github.com/ytdl-org/youtube-dl/issues/32292#issuecomment-
1585639223
Fixes #32292
dirkf [Fri, 26 May 2023 19:25:25 +0000 (20:25 +0100)]
[core] Revise
1f7c6f8 to help downstream merger (possibly)
dirkf [Tue, 23 May 2023 16:11:22 +0000 (17:11 +0100)]
[compat] Fix casefold import __all__ syntax in
a19855f
dirkf [Tue, 23 May 2023 15:19:55 +0000 (16:19 +0100)]
[ITV] Fix UA capitalisation in
384f632
dirkf [Fri, 19 May 2023 12:12:59 +0000 (13:12 +0100)]
[core] Further improve platform debug log
* see
d1c6c5c
dirkf [Fri, 19 May 2023 12:09:18 +0000 (13:09 +0100)]
[jsinterp] Small updates for
a85a875
* update signature tests
* clarify NaN handling
dirkf [Thu, 11 May 2023 20:17:31 +0000 (21:17 +0100)]
[core] Improve platform debug log, based on yt-dlp
dirkf [Thu, 11 May 2023 20:02:01 +0000 (21:02 +0100)]
[jsinterp] Add short-cut evaluation for common expression
* special handling for (d%e.length+e.length)%e.length speeds up ~6%
dirkf [Thu, 11 May 2023 19:59:30 +0000 (20:59 +0100)]
[jsinterp] Handle NaN in bitwise operators
* also add _NaN
* also pull function naming from yt-dlp
dirkf [Thu, 11 May 2023 19:53:07 +0000 (20:53 +0100)]
[utils] Fix `compiled_regex_type` in
249f2b6
dirkf [Sun, 23 Apr 2023 21:58:35 +0000 (22:58 +0100)]
[YouTube] Support Releases tab
dirkf [Fri, 21 Apr 2023 13:04:30 +0000 (14:04 +0100)]
[jsinterp] Minimally handle arithmetic operator precedence
Resolves #32066
dirkf [Wed, 12 Apr 2023 23:15:07 +0000 (00:15 +0100)]
[DashSegmentsFD] Correctly detect errors when `fragment_retries` == 0
* use the success flag instead of the retry count
* establish the fragment_url outside the retry loop
* only report skipping a fragment once.
* resolves #32033
dirkf [Tue, 11 Apr 2023 16:36:27 +0000 (17:36 +0100)]
[YouTube] Simplify signature patterns
Gabriel Nagy [Wed, 12 Apr 2023 22:40:38 +0000 (01:40 +0300)]
[core] Sanitize info dict before dumping JSON (fixes
fe7e130) (#32032)
* follow up to
fe7e130 which didn't fix everything.
Co-authored-by: dirkf <fieldhouse@gmx.net>
dirkf [Mon, 10 Apr 2023 16:12:31 +0000 (17:12 +0100)]
[core] Add and use sanitize_info() method from yt-dlp
dirkf [Thu, 6 Apr 2023 18:49:46 +0000 (19:49 +0100)]
[core] No longer importing copy
dirkf [Thu, 6 Apr 2023 18:42:36 +0000 (19:42 +0100)]
[core] Avoid deepcopy of ctx dict (fix
f35b757) (Pt 2)
dirkf [Thu, 6 Apr 2023 13:11:18 +0000 (14:11 +0100)]
[core] Avoid deepcopy of ctx dict (fix
f35b757)
* may now contain `LazyList`s
* resolves #31999
dirkf [Wed, 5 Apr 2023 17:50:25 +0000 (18:50 +0100)]
[options] Add --mtime option, unsets default --no-mtime
* resolves #1709 (!)
dirkf [Wed, 5 Apr 2023 17:47:49 +0000 (18:47 +0100)]
[devscripts] Improve hack to convert command-line options to API options
* define equality for DateRange
* don't show default DateRange
dirkf [Wed, 5 Apr 2023 17:39:54 +0000 (18:39 +0100)]
[compat] Add and use `compat_open()` like Py3 `open()`
* resolves FIXME: ytdl-org/youtube-dl/commit/
dfe5fa4
dirkf [Wed, 5 Apr 2023 17:29:24 +0000 (18:29 +0100)]
[FragmentFD] Fix iteration with infinite limit
* fixes ytdl-org/youtube-dl/
baa6c5e
* resolves #31885
dirkf [Mon, 3 Apr 2023 20:07:10 +0000 (21:07 +0100)]
[test] Update tests for Ubuntu 20.04
* 18.04 test runner was withdrawn
* for now, disable Py 3.3/3.4 tests
pukkandan [Thu, 9 Mar 2023 16:39:23 +0000 (22:09 +0530)]
[extractor/youtube] Bypass throttling for `-f17`
and related cleanup
Thanks @AudricV for the finding
Ref: yt-dlp/yt-dlp/commit/
c9abebb
pukkandan [Sun, 12 Mar 2023 09:16:09 +0000 (14:46 +0530)]
[extractor/youtube] Construct fragment list lazily
Ref: yt-dlp/yt-dlp/commit/
e389d17
See: yt-dlp/yt-dlp#6517
dirkf [Sun, 19 Mar 2023 02:27:46 +0000 (02:27 +0000)]
[utils] Ensure `allow_types` for `variadic()` is a tuple
dirkf [Sun, 19 Mar 2023 00:51:44 +0000 (00:51 +0000)]
[downloader/http] Only check for resumability when actually resuming
dirkf [Tue, 14 Mar 2023 13:01:32 +0000 (13:01 +0000)]
[AENetworksBaseIE] Report missing show data instead of crash
dirkf [Tue, 14 Mar 2023 00:58:59 +0000 (00:58 +0000)]
[devscripts] Add a hack to convert command-line options to API options
dirkf [Mon, 13 Mar 2023 19:45:54 +0000 (19:45 +0000)]
[core] Handle `/../` sequences in HTTP URLs
* use Python's RFC implementation for embedded sequences
* hack: strip unbalanced leading `../` from path, like eg Firefox
See https://github.com/yt-dlp/yt-dlp/issues/3355
dirkf [Sat, 11 Mar 2023 12:17:00 +0000 (12:17 +0000)]
[FragmentFD] Respect `--no-continue`
* discard partial fragment on `--no-continue`
* continue with correct progress display otherwise
Resolves #21467