dirkf [Tue, 28 May 2024 16:16:58 +0000 (17:16 +0100)]
[utils] Split out traversal.py dummy and traversal tests
dirkf [Tue, 28 May 2024 15:38:20 +0000 (16:38 +0100)]
[compat] Improve compat_etree_iterfind for Py2.6
Adapted from https://raw.githubusercontent.com/python/cpython/2.7/Lib/xml/etree/ElementPath.py
dirkf [Sun, 21 Apr 2024 22:42:08 +0000 (23:42 +0100)]
[utils] Update traverse_obj() from yt-dlp
* remove `is_user_input` option per https://github.com/yt-dlp/yt-dlp/pull/8673
* support traversal of compat_xml_etree_ElementTree_Element per https://github.com/yt-dlp/yt-dlp/pull/8911
* allow un/branching using all and any per https://github.com/yt-dlp/yt-dlp/pull/9571
* support traversal of compat_cookies.Morsel and multiple types in `set()` keys per https://github.com/yt-dlp/yt-dlp/pull/9577
thx Grub4k for these
* also, move traversal tests to a separate class
* allow for unordered dicts in tests for Py<3.7
dirkf [Wed, 15 May 2024 15:57:59 +0000 (16:57 +0100)]
[workflows/ci.yml] Temporary workaround for Python 3.5 _pip_ failures
https://github.com/actions/setup-python/issues/866
dirkf [Thu, 11 Apr 2024 12:06:25 +0000 (13:06 +0100)]
[YouPorn] Add playlist extractors
* YouPornCategoryIE
* YouPornChannelIE
* YouPornCollectionIE
* YouPornStarIE
* YouPornTagIE
* YouPornVideosIE,
dirkf [Fri, 1 Mar 2024 15:07:30 +0000 (15:07 +0000)]
[YouPorn] Improve extraction
* detect unwatchable videos
* improve duration extraction
* fix count extraction and support large values
* detect and remove SEO spam boilerplate description
dirkf [Thu, 11 Apr 2024 11:47:05 +0000 (12:47 +0100)]
[test/test_download] Support 'playlist_maxcount:count' expected value
* parallel to `playlist_mincount'
* specify both for a range of playlist lengths
* if max < min the test will always fail!
dirkf [Fri, 1 Mar 2024 14:48:30 +0000 (14:48 +0000)]
[YouPorn] Incorporate yt-dlp PR 8827
* from https://github.com/yt-dlp/yt-dlp/pull/8827
* extract from webpage instead of broken API URL
* thx The-MAGI
dirkf [Sun, 7 Apr 2024 14:26:12 +0000 (15:26 +0100)]
[postprocessor/ffmpeg] Fix finding ffprobe (bug in
21792b8)
Fixes https://github.com/ytdl-org/youtube-dl/commit/
21792b88b791b16e3ab0a0fb2e26e5bb8a4e2ff3#commitcomment-
140705274, thx: vonProteus
Ori Avtalion [Fri, 5 Apr 2024 14:25:29 +0000 (17:25 +0300)]
[utils] Fix crash in _report_ignoring_subs from
c58b655 (#32762)
Align `utils.bug_reports_message()` with yt-dlp https://github.com/yt-dlp/yt-dlp/commit/
5873d4ccdd, thanks fstirlitz
---------
Co-authored-by: dirkf <fieldhouse@gmx.net>
dirkf [Sun, 3 Mar 2024 12:38:00 +0000 (12:38 +0000)]
[external/FFmpeg] Fix and improve --ffmpeg-location handling
* pass YoutubeDL (FileDownloader) to FFmpegPostProcessor constructor
* consolidate path search in FFmpegPostProcessor
* make availability of FFmpegFD depend on existence of FFmpegPostProcessor
* detect ffmpeg executable on instantiation of FFmpegFD
* resolves #32735
dirkf [Sat, 2 Mar 2024 15:17:09 +0000 (15:17 +0000)]
[downloader/external] Fix "Resource Warning" in downloader test
* add compat_subprocess_Popen context manager
* apply context manager in FFmpegFD._call_downloader()
dirkf [Sun, 10 Mar 2024 19:06:57 +0000 (19:06 +0000)]
[compat] Simplify/fix compat_html_parser_HTMLParseError
dirkf [Fri, 1 Mar 2024 19:53:58 +0000 (19:53 +0000)]
[utils] Apply compat_contextlib_suppress
dirkf [Fri, 1 Mar 2024 15:25:44 +0000 (15:25 +0000)]
[compat] Add compat_contextlib_suppress
with compat_contextlib_suppress(*Exceptions):
# code that fails silently for any of Exceptions
gy-chen [Sat, 23 Mar 2024 15:30:13 +0000 (23:30 +0800)]
[Youtube] Fix unwanted private method __ie_msg in
f8b0135850
Fixes `AttributeError no attribute '_YoutubeIE__ie_msg'` if unable to decode n-parameter
Zizheng Guo [Tue, 12 Mar 2024 11:44:13 +0000 (19:44 +0800)]
[Vimeo] Improve `config` extraction (#32742)
* update for more robust json parsing
hatsomatt [Sat, 16 Sep 2023 14:02:37 +0000 (16:02 +0200)]
[Videa] Fix extraction
* update API URL
* from https://github.com/yt-dlp/yt-dlp/pull/8003
* thanks to the authors!
Closes yt-dlp/7427
Authored by: hatsomatt, aky-01
dirkf [Thu, 29 Feb 2024 14:12:37 +0000 (14:12 +0000)]
[Videa] Align with yt-dlp
dirkf [Wed, 21 Feb 2024 00:18:17 +0000 (00:18 +0000)]
[XFileShare] Update extractor for 2024
* simplify aa_decode()
* review and update supported sites and tests
* in above, include FileMoon.sx, and remove separate module
* incorporate changes from yt-dlp
* allow for decoding multiple scripts (eg, FileMoon)
* use new JWPlayer extraction
dirkf [Wed, 21 Feb 2024 00:09:48 +0000 (00:09 +0000)]
[InfoExtractor] Rework and improve JWPlayer extraction
* use traverse_obj() and _search_json()
* support playlist `.load({**video1},{**video2}, ...)`
* support transform_source=... for _extract_jwplayer_data()
dirkf [Wed, 21 Feb 2024 00:03:17 +0000 (00:03 +0000)]
[InfoExtractor] Add `_search_json()`
* uses the error diagnostic to truncate the JSON string
* may be confused by non-C-Pythons
dirkf [Mon, 4 Mar 2024 20:52:58 +0000 (20:52 +0000)]
[utils] Let int_or_none() accept a base, like int()
Hubert Hirtz [Mon, 4 Mar 2024 01:27:55 +0000 (01:27 +0000)]
[utils] Handle user:pass in URLs (#28801)
* Handle user:pass in URLs
Fixes "nonnumeric port" errors when youtube-dl is given URLs with
usernames and passwords such as:
http://username:password@example.com/myvideo.mp4
Refs:
- https://en.wikipedia.org/wiki/Basic_access_authentication
- https://tools.ietf.org/html/rfc1738#section-3.1
- https://docs.python.org/3.8/library/urllib.parse.html#urllib.parse.urlsplit
Fixes #18276 (point 4)
Fixes #20258
Fixes #26211 (see comment)
* Align code with yt-dlp
---------
Co-authored-by: dirkf <fieldhouse@gmx.net>
Aaron Tan [Thu, 22 Feb 2024 12:54:07 +0000 (23:54 +1100)]
[caffeine.tv] Add new extractor (#32514)
* Add CaffeineTVIE info extractor to support site caffeine.tv
---------
Co-authored-by: dirkf <fieldhouse@gmx.net>
dirkf [Thu, 22 Feb 2024 12:44:00 +0000 (12:44 +0000)]
[GBNews]Add new extractor for GB News TV channel (#29432)
* Add extractor for GB News TV channel
* Support more GBNews URL formats
Allow alphanumeric and _ in place of `shows`, which redirect to site's preferred URL
* Update for 2024
dirkf [Wed, 21 Feb 2024 16:29:08 +0000 (16:29 +0000)]
[downloader/external] Fix WgetFD proxy (rev 2)
From PR (defunct source), closes #29343.
Matches https://github.com/yt-dlp/yt-dlp/pull/3152
Thx former user kikuyan.
dirkf [Sun, 18 Feb 2024 20:55:01 +0000 (20:55 +0000)]
[Vbox7] Improve extraction, adding features from yt-dlp PR #9100
* changes from https://github.com/yt-dlp/yt-dlp/pull/9100 (thx
seproDev):
- attempt HLS extraction
- re-enable XFF
- test `view_count`, `duration` extraction
* improve commenting, error checks
dirkf [Sat, 27 Jan 2024 18:17:09 +0000 (18:17 +0000)]
[Vbox7IE] Sanitise ld+json containing unexpected characters
* based on PR #29680
* added hack to force invoking `transform_source`
* fixes #26218
dirkf [Sat, 27 Jan 2024 15:57:25 +0000 (15:57 +0000)]
[Vbox7IE] Improve extraction
* DASH extraction no longer fails with new range support
* but always find combined formats if available
* suppress ineffective XFF geo-bypass (causes time-outs)
* adapted from https://github.com/ytdl-org/youtube-dl/pull/29680
* thx former GH user kikuyan
dirkf [Sat, 27 Jan 2024 15:45:43 +0000 (15:45 +0000)]
[InfoExtractor] Correctly resolve BaseURL in DASH manifest
Specs:
* ISO/IEC 23009-1:2012 section 5.6
* RFC 3986 section 5.
dirkf [Sat, 27 Jan 2024 15:37:08 +0000 (15:37 +0000)]
[InfoExtractor] Support byte range for DASH
* adapted from https://github.com/ytdl-org/youtube-dl/pull/30279
* thx former GH user kikuyan
dirkf [Sat, 27 Jan 2024 00:07:14 +0000 (00:07 +0000)]
[downloader/dash] Support `range` in fragment (format f'{start}-{end}')
* adapted from https://github.com/ytdl-org/youtube-dl/pull/30279
* thx former GH user kikuyan
dirkf [Sat, 27 Jan 2024 15:29:25 +0000 (15:29 +0000)]
[InfoExtractor] Support DASH subtitle extraction (yt-dlp back-port)
dirkf [Mon, 15 Jan 2024 18:38:43 +0000 (18:38 +0000)]
[YouTube] Fix `like_count` extraction using `likeButtonViewModel`
* also fix various tests
* TODO: check against yt-dlp tests
dirkf [Mon, 15 Jan 2024 18:34:21 +0000 (18:34 +0000)]
[YouTube] Rework n-sig processing, realigning with yt-dlp
* apply n-sig before chunked fragments, fixes #32692
dirkf [Mon, 15 Jan 2024 18:32:06 +0000 (18:32 +0000)]
[InfoExtractor] Support some warning and `._downloader` shortcut methods from yt-dlp
dirkf [Sat, 20 Jan 2024 18:28:52 +0000 (18:28 +0000)]
[compat] Rework compat for `method` parameter of `compat_urllib_request.Request` constructor
* fixes #32573
* does not break `utils.HEADrequest` (eg)
mk-pmb [Wed, 13 Sep 2023 18:57:05 +0000 (20:57 +0200)]
[core] Fix format string injection for metadata JSON filename message.
Robotix [Wed, 6 Dec 2023 01:17:57 +0000 (02:17 +0100)]
[Epidemic Sound] Add new extractor (#32628)
* Add simple extractor
* Support separate tracks
* Use index as id instead of slug
---------
Co-authored-by: dirkf <fieldhouse@gmx.net>
dirkf [Tue, 5 Dec 2023 20:02:30 +0000 (20:02 +0000)]
[Imgur] Overhaul extractor module (#32612)
Revise extractors for new API and page formats
mimvahedi [Sat, 2 Dec 2023 15:25:09 +0000 (18:55 +0330)]
[telewebion] Fix extraction (#32634)
* [telewebion] fix extraction
Resolves https://github.com/ytdl-org/youtube-dl/issues/5135#issuecomment-
932952119
---------
Co-authored-by: dirkf <fieldhouse@gmx.net>
dirkf [Tue, 28 Nov 2023 17:26:37 +0000 (17:26 +0000)]
[utils] Make restricted filenames ignore characters in Unicode categories Mark, Other
Resolves #32629
dirkf [Tue, 28 Nov 2023 18:06:40 +0000 (18:06 +0000)]
[workflows/ci.yml] Use setup-python for now released Python 3.12
dirkf [Tue, 28 Nov 2023 16:17:07 +0000 (16:17 +0000)]
[workflows/ci.yml] Actually use default values for push and pull_request
dirkf [Wed, 18 Oct 2023 13:28:10 +0000 (14:28 +0100)]
[workflows/ci.yml] Run apt-get update before installing
ReenigneArcher [Wed, 15 Nov 2023 23:54:31 +0000 (18:54 -0500)]
[Youtube] Update consent cookie handling to match site
Apologies for force push!
[skip ci]
dirkf [Sun, 24 Sep 2023 21:00:13 +0000 (22:00 +0100)]
[downloader] Fix
baa6c5e: show ETA of http download as ETA instead of total d/l time
dirkf [Sun, 3 Sep 2023 22:15:19 +0000 (23:15 +0100)]
[utils] Revert
bbd3e7e, updating docstring, test instead
dirkf [Sun, 3 Sep 2023 00:18:22 +0000 (01:18 +0100)]
[utils] Properly handle list values in update_url()
An actual list value in a query update could have been treated
as a list of values because of the key:list parse_qs format.
dirkf [Sun, 3 Sep 2023 00:13:40 +0000 (01:13 +0100)]
[test] Remove redundancy from lambda expected value regex
dirkf [Sun, 27 Aug 2023 18:08:28 +0000 (19:08 +0100)]
[S4C] Add thumbnail extraction, extract series as playlist
Based on https://github.com/yt-dlp/yt-dlp/pull/7776: thx ifan-t, bashonly
dirkf [Thu, 31 Aug 2023 16:16:47 +0000 (17:16 +0100)]
[ci.yml] Improve conditions for nosetest installations
dirkf [Fri, 4 Aug 2023 21:54:12 +0000 (22:54 +0100)]
[S4C] Add extractor for Sianel Pedwar Cymru
* from https://github.com/yt-dlp/yt-dlp/pull/7730, thx ifan-t, bashonly
dirkf [Sun, 30 Jul 2023 20:50:52 +0000 (21:50 +0100)]
[utils] Advertise optional supported `Content-Encoding`s
dirkf [Sun, 30 Jul 2023 20:49:58 +0000 (21:49 +0100)]
[utils] Add and use `filter_dict()` from yt-dlp
dirkf [Sun, 30 Jul 2023 20:47:48 +0000 (21:47 +0100)]
[utils] Revise `isinstance()` tests (especially for str/unicode/bytes) to complete Linter fix
dirkf [Sun, 30 Jul 2023 20:45:57 +0000 (21:45 +0100)]
[utils] Avoid comparing `type(var)`, etc, to pass new Linter rules
dirkf [Fri, 28 Jul 2023 05:19:15 +0000 (06:19 +0100)]
[utils] Rework URL path munging for ., .. components
* move processing to YoutubeDLHandler
* also process `Location` header for redirect
* use tests from https://github.com/yt-dlp/yt-dlp/pull/7662
dirkf [Fri, 28 Jul 2023 05:03:14 +0000 (06:03 +0100)]
[utils] Rework decoding of `Content-Encoding`s
* support nested encodings
* support optional `br` encoding, if brotli package is installed
* support optional 'compress' encoding, if ncompress package is installed
* response `Content-Encoding` has only unprocessed encodings, or removed
* response `Content-Length` is decoded length (usable for filesize metadata)
* use zlib for both deflate and gzip decompression
* some elements taken from yt-dlp: thx especially coletdjnz
dirkf [Fri, 28 Jul 2023 09:52:04 +0000 (10:52 +0100)]
[workflows/ci.yml] Update to setup-java@v3
* avoid Node 12 deprecation
dirkf [Tue, 25 Jul 2023 14:11:15 +0000 (15:11 +0100)]
[build] Fix typo in `devscripts/fish-completion.py` (fix
2285605)
dirkf [Mon, 24 Jul 2023 23:22:54 +0000 (00:22 +0100)]
[build] Extend use of `devscripts/utils`
dirkf [Mon, 24 Jul 2023 23:17:15 +0000 (00:17 +0100)]
[compat] Use `compat_open()`
dirkf [Mon, 24 Jul 2023 22:43:36 +0000 (23:43 +0100)]
[build] Add and use `devscripts/utils`
dirkf [Mon, 24 Jul 2023 02:30:28 +0000 (03:30 +0100)]
[utils] Fix update_Request() with empty data (not None)
dirkf [Thu, 20 Jul 2023 18:32:29 +0000 (19:32 +0100)]
[test/test_execution.py] Use `compat_subprocess_get_DEVNULL()`
dirkf [Thu, 20 Jul 2023 17:49:48 +0000 (18:49 +0100)]
[build] Fix various Jython CI and test issues
dirkf [Thu, 20 Jul 2023 15:36:54 +0000 (16:36 +0100)]
[workflows/ci.yml] Build 3.12 with pyenv
dirkf [Thu, 20 Jul 2023 14:49:46 +0000 (15:49 +0100)]
[doc] Update developer guidance
* mention pynose
* mention traverse_obj and add/revise examples
[skip ci]
dirkf [Thu, 20 Jul 2023 11:42:46 +0000 (12:42 +0100)]
[workflows/ci.yml] Extend Python versions
* add 3.10 - 3.12
* use https://pypi.org/project/pynose/ for Py >= 3.9
* test Windows with 3.4
* set defaults (main, both) except push: (all, core)
dirkf [Thu, 20 Jul 2023 04:29:59 +0000 (05:29 +0100)]
[utils] Remove stray undocumented Host header in redirect (fix
46fde7c)
dirkf [Wed, 19 Jul 2023 13:14:50 +0000 (14:14 +0100)]
[InfoExtractor] Add `_match_valid_url()` class method and refactor
* API compatible with yt-dlp
* also support Sequence of patterns in _VALID_URL
* one place to compile _VALID_URL
* TODO: remove existing extractor shims
dirkf [Wed, 19 Jul 2023 12:01:02 +0000 (13:01 +0100)]
[utils] Fix broken Py 3.11+ compat in `traverse_obj()`
* inspect.getargspec is missing despite doc claiming backward compat
* replace with emulation of `Signature.bind()`
dirkf [Fri, 5 May 2023 18:25:42 +0000 (19:25 +0100)]
[InfoExtractor] Add search methods for Next/Nuxt.js from yt-dlp
* add _search_nextjs_data(), from https://github.com/yt-dlp/yt-dlp/pull/1386
thanks selfisekai
* add _search_nuxt_data(), from https://github.com/yt-dlp/yt-dlp/pull/1921,
thanks Lesmiscore, pukkandan
* add tests for the above
* also fix HTML5 type recognition and tests, from
https://github.com/yt-dlp/yt-dlp/commit/
222a230871fe4fe63f35c49590379c9a77116819,
thanks Lesmiscore
* update extractors in PR using above, fix tests.
dirkf [Wed, 3 May 2023 23:08:26 +0000 (00:08 +0100)]
[Clipchamp] Add new extractor back-ported from yt-dlp
dirkf [Wed, 3 May 2023 09:07:35 +0000 (10:07 +0100)]
[DLF] Add site extractors back-ported from yt-dlp
* from https://github.com/yt-dlp/yt-dlp/pull/6697, thanks nick-cd
dirkf [Wed, 3 May 2023 09:04:46 +0000 (10:04 +0100)]
[Whyp] Add extractor back-ported from yt-dlp
* from https://github.com/yt-dlp/yt-dlp/pull/6803, thanks CoryTibbettsDev
dirkf [Wed, 3 May 2023 09:02:25 +0000 (10:02 +0100)]
[GlobalPlayer] Add site extractors back-ported from yt-dlp
* from https://github.com/yt-dlp/yt-dlp/pull/6903, thanks garret1317
dirkf [Wed, 3 May 2023 12:08:58 +0000 (13:08 +0100)]
[InfoExtractor] Support groups in _`search_regex()`, etc
dirkf [Wed, 3 May 2023 09:54:52 +0000 (10:54 +0100)]
[utils] Minor updates (merge_dicts, T)
A couple of mods to ease yt-dlp back-ports:
* add kwargs to merge_dicts:
`unblank=True` (disallow empty string), `rev=False` (reverse the merge list)
* add `T(x)` shortcut for `{x}`, unsupported in Py2.6
dirkf [Wed, 3 May 2023 11:06:34 +0000 (12:06 +0100)]
[utils] Improve js_to_json, align with yt-dlp
* support variable substitution, from https://github.com/yt-dlp/yt-dlp/pull/#521 etc,
thanks ChillingPepper, Grub4k, pukkandan
* improve escape handling, from https://github.com/yt-dlp/yt-dlp/pull/#521
thanks Grub4k
* support template strings from https://github.com/yt-dlp/yt-dlp/pull/6623
thanks Grub4k
* add limited `!` evaluation (eg, !!0 -> false, see tests)
dirkf [Wed, 3 May 2023 11:40:09 +0000 (12:40 +0100)]
[utils] Align traverse_obj() with yt-dlp
Thanks Grub4k for these:
* traverse `Iterable`s, from https://github.com/yt-dlp/yt-dlp/pull/6902, etc
* traverse `set` key for transformations/filters, `re.Match` group names, from
https://github.com/yt-dlp/yt-dlp/commit/
776995bc109c5cd1aa56b684fada2ce718a386ec, etc
* traverse `re.Match`es, from https://github.com/yt-dlp/yt-dlp/pull/5174
* always return list when branching, from https://github.com/yt-dlp/yt-dlp/pull/5170
dirkf [Mon, 17 Jul 2023 19:39:11 +0000 (20:39 +0100)]
[compat] Fix old Pythons broken loading of valueless cookie attributes
Cookie string parsing in Py 2.6.9, probably earlier, requires `=`.
Also 3.2, though the CPython code appears to be OK: 3.1 was also wrong.
dirkf [Thu, 13 Jul 2023 19:14:50 +0000 (20:14 +0100)]
[test] Fixes for old Pythons
dirkf [Tue, 11 Jul 2023 20:51:32 +0000 (21:51 +0100)]
[doc] Warn against setting cookies with --add-header
bashonly [Tue, 4 Jul 2023 21:40:56 +0000 (16:40 -0500)]
[downloader/external] Fix cookie support
Simon Sawicki [Tue, 4 Jul 2023 19:41:04 +0000 (21:41 +0200)]
[core] Process header cookies on loading
bashonly [Tue, 4 Jul 2023 19:03:39 +0000 (14:03 -0500)]
[utils] `YoutubeDLCookieJar`: Add `get_cookie_header` and `get_cookies_for_url` methods
dirkf [Wed, 7 Jun 2023 18:38:54 +0000 (19:38 +0100)]
[core] Remove `Cookie` header on redirect to prevent leaks
Adated from yt-dlp/yt-dlp-ghsa-v8mc-9377-rwjj/pull/1/commits/
101caac
Thx coletdjnz
dirkf [Wed, 7 Jun 2023 13:51:50 +0000 (14:51 +0100)]
[core] Update redirect handling from yt-dlp
* Thx coletdjnz: https://github.com/yt-dlp/yt-dlp/pull/7094
* add test that redirected `POST` loses its `Content-Type`
dirkf [Fri, 7 Jul 2023 17:51:38 +0000 (18:51 +0100)]
[compat] Add Request and HTTPClient compat for redirect
* support `method` parameter of `Request.__init__` (Py 2 and old Py 3)
* support `getcode` method of compat_http_client.HTTPResponse (Py 2)
dirkf [Mon, 17 Jul 2023 19:47:58 +0000 (20:47 +0100)]
[test] Make skipped tests in test_execution work with Py 2.6
dirkf [Fri, 7 Jul 2023 17:45:31 +0000 (18:45 +0100)]
[core] Align error reporting methods with yt-dlp
dirkf [Thu, 6 Jul 2023 14:46:22 +0000 (15:46 +0100)]
[utils] Add {expected_type} and Iterable support to traverse_obj()
dirkf [Fri, 7 Jul 2023 17:41:32 +0000 (18:41 +0100)]
[core] Revert version display from
b8a86dc
dirkf [Tue, 4 Jul 2023 15:06:21 +0000 (16:06 +0100)]
[Misc] Fixes for 2.6 compatibility
dirkf [Fri, 30 Jun 2023 02:52:39 +0000 (03:52 +0100)]
[workflows/ci.yml] Fix test support for Py 2.6
dirkf [Thu, 29 Jun 2023 14:27:12 +0000 (15:27 +0100)]
[workflows/ci.yml] Restore test support for Py 3.2
dirkf [Thu, 22 Jun 2023 23:15:06 +0000 (00:15 +0100)]
[workflows/ci.yml] Restore test support for Py 3.3, 3.4, and add 2.6