NewPipeExtractor

Commit Graph

Author	SHA1	Message	Date
Stypox	c57016b79b	Make getCommentText @Nonnull	2024-03-27 15:26:06 +01:00
TobiGr	aaccfecda8	[YouTube] Detect new account termination messages	2024-03-20 14:57:41 +01:00
petlyh	4408e2d0ac	[YouTube] Add Albums channel tab	2023-12-30 14:01:30 +01:00
Tobi	1e93b1dc20	Merge pull request #1135 from Stypox/yt-emergency-info [YouTube] Implement emergency meta info	2023-12-29 12:01:40 +01:00
Stypox	5b59a1a8c5	[YouTube] Move meta info extraction to separate file YoutubeParsingHelper was longer than 2000 lines which caused checkstyle issues	2023-12-21 21:19:08 +01:00
Stypox	b8e12dd76c	[YouTube] Implement emergency meta info YouTube provides that meta info panel when users search for really sensitive content like suicide (e.g. "blue whale"). It contains: - an encouragement as title (e.g. "We are with you") - a phone number as action - details about how to call the phone number (e.g. availability) - an url pointing to the website of an association Also add a test that just checks if a meta info is properly extracted	2023-12-21 21:19:08 +01:00
Stypox	2938067c2c	[YouTube] Shorts don't provide a duration anymore	2023-12-21 20:41:01 +01:00
AudricV	56ab35423e	[YouTube] Fix potential NullPointerException in YoutubeSearchExtractor.getSearchSuggestion	2023-12-08 21:46:48 +01:00
AudricV	6ba8251be1	[YouTube] Bypass crisis resources blocking search results These crisis resources are preventing search results to be returned. See https://support.google.com/youtube/answer/10726080?hl=en for more info on them. This commit changes search parameters to include the property allowing to show search results.	2023-12-08 21:46:47 +01:00
AudricV	7dea2d0d27	[YouTube] Remove Channels channel tab support This tab has been removed by YouTube.	2023-12-08 21:46:47 +01:00
AudricV	3782d9a02a	[YouTube] Support new A/B tested like data and avoid like count conversion from integer to long Also make minor improvements to current like data extraction and remove previous like count data support, as it is not returned anymore.	2023-12-08 21:46:46 +01:00
AudricV	b71ce1123f	[YouTube] Extract only search results corresponding to a search type YouTube returns sometimes videos inside channel search results. As we only want results corresponding to the type we requested, this commits makes YoutubeSearchExtractor ignoring non-requested search results we get, using the extractor LinkHandler's first content filter value. Also remove an unneeded exception throwing declaration in YoutubeSearchExtractor.	2023-12-08 21:46:46 +01:00
AudricV	ff8ed7247f	[YouTube] Switch to new consent cookie Also move the documentation of the consent in its setter method in order to be accessible publicly and improve it.	2023-12-08 21:46:46 +01:00
AudricV	2c941794c0	[YouTube] Add utcOffsetMinutes to all InnerTube payloads This should make returned dates consistent between timezones and countries on which the extractor is ran. It was previously only set on YouTube Music search continuations.	2023-12-08 21:46:46 +01:00
AudricV	d97c9e0db1	[YouTube] Improve payloads and URLs of InnerTube requests For every InnerTube request: - Always add a `request` object with the following properties: - "internalExperimentFlags" set to an empty array; - "useSsl" set to "true"; - "lockedSafetyMode" set to "false". - Use proper TODO comment to provide a way to enable restricted mode on every request and add it on requests on which it wasn't present. For YouTube Music: - Remove alt query parameter, as it is not used anymore by the website; - Add prettyPrint query parameter with false value on YouTube Music search continuations.	2023-12-08 21:46:45 +01:00
AudricV	8a9ebcc373	[YouTube] Update InnerTube clients' version and devices' OS version and model	2023-12-08 21:46:45 +01:00
FineFindus	34b05a0dda	feat(youtube/comments): support creator replies	2023-10-09 16:33:43 +02:00
FineFindus	c1784a4bdb	[YouTube] Add channel owner to comments	2023-10-09 16:33:43 +02:00
FineFindus	dd7b2d9798	feat(youtube/comments): support creator replies	2023-09-25 10:40:45 +02:00
Youssif Shaaban Alsager	917554acc4	[YouTube] Add support for ultralow audio formats (#1063 )	2023-09-24 19:04:34 +02:00
Christian	fc67d49f59	Update copyright notices Update copyright notices to comply to GPLv3 and change NewPipe to NewPipe Extractor on some notices that were not updated.	2023-09-22 19:10:15 -03:00
AudricV	714b141ecb	[YouTube] Catch any exception when extracting something from JavaScript's base player	2023-09-21 21:59:33 +02:00
AudricV	588c6a8422	[YouTube] Quote signature deobfuscation function name and add semicolon only where needed	2023-09-21 21:59:33 +02:00
AudricV	a04bc320de	[YouTube] Convert signature timestamp to integer The signature timestamp is used as a number by HTML5 clients, so it should be used in the same way by the extractor too instead of being a string. As the timestamp doesn't seem to exceed 5 digits, an integer is used to store its value.	2023-09-21 21:59:32 +02:00
AudricV	7de3753a81	[YouTube] Refactor JavaScript player management API This commit is introducing breaking changes. For clients, everything is managed in a new class called YoutubeJavaScriptPlayerManager: - caching JavaScript base player code and its extracted code (functions and variables); - getting player signature timestamp; - getting deobfuscated signatures of streaming URLs; - getting streaming URLs with a throttling parameter deobfuscated, if applicable. The class delegates the extraction parts to external package-private classes: - YoutubeJavaScriptExtractor, to extract and download YouTube's JavaScript base player code: it always already present before and has been edited to mainly remove the previous caching system and made it package-private; - YoutubeSignatureUtils, for player signature timestamp and signature deobfuscation function of streaming URLs, added in a recent commit; - YoutubeThrottlingParameterUtils, which was originally YoutubeThrottlingDecrypter, for throttling parameter of streaming URLs deobfuscation function and checking whether this parameter is in a streaming URL. YoutubeJavaScriptPlayerManager caches and then runs the extracted code if it has been executed successfully. The cache system of throttling parameters deobfuscated values has been kept, its size can be get using the getThrottlingParametersCacheSize method and can be cleared independently using the clearThrottlingParametersCache method. If an exception occurs during the extraction or the parsing of a function property which is not related to JavaScript base player code fetching, it is stored until caches are cleared, making subsequent failing extraction calls of the requested function or property faster and consuming less resources, as the result should be the same until the base player code changes. All caches can be reset using the clearAllCaches method of YoutubeJavaScriptPlayerManager. Classes using JavaScript base player code and utilities directly (in the code and its tests) have been also updated in this commit.	2023-09-21 21:59:32 +02:00
AudricV	6884d191cd	[YouTube] Add utility class around signatures and fix signature deobfuscation function extraction The goal of this class is to decouple the extraction of signature timestamp and signature deobfuscation function from YoutubeStreamExtractor. The extraction of the signature deobfuscation function has been also adapted to support the latest YouTube player versions. This new class, YoutubeSignatureUtils, doens't store anything temporary such as a copy of the player code, which has to be passed where required. It is not public, as it will be used by a JavaScript player manager class in the future, in order to handle in a better way fetching, caching and resetting cache of the player code.	2023-09-21 21:59:26 +02:00
AudricV	266cd1f76b	[YouTube] Apply changes in YoutubeMusicSearchExtractor and split its InfoItemExtractors into separate classes Splitting YoutubeMusicSearchExtractor's InfoItemExtractors into separate classes (YoutubeMusicSongOrVideoInfoItemExtractor, YoutubeMusicAlbumOrPlaylistInfoItemExtractor and YoutubeMusicArtistInfoItemExtractor) allows to simplify YoutubeMusicSearchExtractor,improves reading and applying changes to InfoItems (no more losing at least quarter of a line due to indentations). These InfoItems, in which the image changes have been applied, don't extend the YouTube ones anymore, as most methods were overridden and the few ones that are not don't apply in YouTube Music items responses, so it was useless to extend them. The code of YoutubeMusicSearchExtractor have been also improved a bit.	2023-08-12 22:56:27 +02:00
AudricV	c1981ed54f	[YouTube] Apply changes in Extractors except YoutubeMusicSearchExtractor Also improve a bit some code related to the changes.	2023-08-12 22:56:27 +02:00
AudricV	4cc99f9ce1	[YouTube] Apply changes in InfoItemExtractors except YouTube Music ones	2023-08-12 22:56:27 +02:00
AudricV	adfad086ac	[YouTube] Add utility methods to get images from InfoItems and thumbnails arrays Unmodifiable lists of Images are returned, parsed from a given YouTube "thumbnails" JSON array. These methods will be used in all YouTube extractors and InfoItems, as the structures between content types (videos, channels, playlists, ...) are common.	2023-08-12 22:56:27 +02:00
Stypox	7294675aea	Merge pull request #1093 from AudricV/yt_support-shorts-ui-playlists [YouTube] Support Shorts UI in playlists	2023-08-12 11:11:36 +02:00
Stypox	44b664af15	[YouTube] Simplify Optional chains in channel	2023-08-12 11:02:51 +02:00
AudricV	1852031a0b	[YouTube] Support pageHeaderRenderer and interactiveTabbedHeaderRenderer channel headers The addition of this support required to turn the isCarouselHeader boolean into an enum containing all supported channel headers named HeaderType. Also assert that the page has been fetched where needed to avoid NullPointerExceptions when the channel page has been not fetched and remove the getChannelHeaderJson method in YoutubeChannelExtractor, method for which its code has been moved to its sole usage after the new headers support changes.	2023-08-08 19:12:27 +02:00
AudricV	e6f371fb94	[YouTube] Support Shorts UI in playlists Also remove an outdated A/B test comment.	2023-08-07 19:01:08 +02:00
Stypox	9d3761a371	[YouTube] Directly use playlist collector in channel tabs wrapper Note that this introduces a "Raw use of parameterized class 'InfoItemsPage'" warning, but it can be ignored since the type missing would be <InfoItem>, and StreamInfoItem extends InfoItem	2023-08-06 21:13:25 +02:00
Stypox	e34b4f1978	[YouTube] Avoid using Consumer	2023-08-06 13:02:31 +02:00
Stypox	ef67c7cd74	[YouTube] Simplify usage of channel header json	2023-08-06 13:02:31 +02:00
Stypox	a104cf3227	[YouTube] Fix docs in channel helper	2023-08-06 13:02:31 +02:00
AudricV	7366eab156	[YouTube] Add support for channel tabs and tags and age-restricted channels Support of tags and videos, shorts, live, playlists and channels tabs has been added for non-age restricted channels. Age-restricted channels are now also supported and always returned the videos, shorts and live tabs, accessible using system playlists. These tabs are the only ones which can be accessed using YouTube's desktop website without being logged-in. The videos channel tab parameter has been updated to the one used by the desktop website and when a channel extraction is fetched, this tab is returned in the list of tabs as a cached one in the corresponding link handler. Visitor data support per request has been added, as a valid visitor data is required to fetch continuations with contents on the shorts tab. It is only used in this case to enhance privacy. A dedicated shorts UI elements (reelItemRenderers) extractor has been added, YoutubeReelInfoItemExtractor. These elements do not provide the exact view count, any uploader info (name, URL, avatar, verified status) and the upload date. All service's LinkHandlers are now using the singleton pattern and some code has been also improved on the files changed. Co-authored-by: ThetaDev <t.testboy@gmail.com> Co-authored-by: Stypox <stypox@pm.me>	2023-08-06 12:15:04 +02:00
Stypox	3faaf4301c	Merge pull request #1087 from AudricV/yt_js-extractor-improvements-and-fixes [YouTube] Improve and fix YoutubeJavaScriptExtractor	2023-08-06 12:01:00 +02:00
Stypox	7c70fef197	Merge pull request #1089 from TeamNewPipe/ccc [media.ccc.de] Only extract kiosk live stream rooms if they are streaming	2023-08-06 10:12:04 +02:00
TobiGr	340095515d	Make Kiosk IDs accessible if possible	2023-08-05 03:18:40 +02:00
Kavin	25082d78b0	Replace SecureRandom with Random	2023-08-03 23:00:02 +01:00
AudricV	a3d160edab	[YouTube] Improve and fix YoutubeJavaScriptExtractor - Enhance documentation; - Fix the regular expression fallback on HTML embed watch page; - Use HTML scripts tag search first instead of the regular expression approach, now used as a last resort; - Compile regular expressions only once, in order to improve the performance of subsequent extraction calls when clearing the cache; - Provide original exceptions when fetching or parsing pages on which the base JavaScript's player could be found failed, allowing clients to detect network errors when they are the cause of the failures for instance; - Remove delegate method which was not taking a video ID and hardcoding one, as we can provide the video ID in all cases or do not provide a video ID at worse; - Rename and make extraction methods package-private, as they are not intended to be used publicly. These breaking internal changes have been applied where needed, in YoutubeJavaScriptExtractorTest and YoutubeStreamExtractor (in which an unneeded initStsFromPlayerJsIfNeeded call have been removed).	2023-08-02 23:05:08 +02:00
AudricV	f1fa84b4e3	[YouTube] Don't throw an exception when there is no banner available on a channel Channels may not have a banner, so no exception should be thrown if no banner is found.	2023-08-01 12:40:20 +02:00
Tobi	39a911db9f	Merge pull request #1084 from AudricV/yt_android-403s-workaround-and-streams-tests-fixes [YouTube] Workaround again 403 HTTP issues on the ANDROID InnerTube client and fix stream tests	2023-07-31 23:51:10 +02:00
AudricV	164c8e3abb	[YouTube] Workaround again 403 HTTP issues on the Android client by using new player parameters These parameters are the only ones currently known to bypass 403 HTTP issues related to failure of passing Android client integrity checks, as the ones of stories (and the base of the shorts ones) do not work anymore, which may be related to end of this format on the service.	2023-07-22 20:22:16 +02:00
FireMasterK	6db0d116fe	Add support for AV1 itags.	2023-07-22 13:23:44 +02:00
AudricV	4e22c5ee87	[YouTube] Support multiple declarations for throttling parameter function name array Also moved the corresponding regex parts in static constants for easier future modifications	2023-06-26 15:25:53 +02:00
Kavin	d961d349c3	[YouTube] Check whether player responses are valid for all InnerTube clients used (#1070 ) Co-authored-by: Audric V <74829229+AudricV@users.noreply.github.com>	2023-06-18 21:54:52 +02:00

1 2 3 4 5 ...

929 Commits