NewPipeExtractor

Commit Graph

Author	SHA1	Message	Date
AudricV	3760bd70a8	[YouTube] Add base implementation for show InfoItems As there are multiple show UI elements which share a lot of common data, a base implementation, an abstract class named YoutubeBaseShowInfoItemExtractor, has been created to handle common cases.	2024-04-10 21:30:47 +02:00
AudricV	65dfc26f13	[YouTube] Move channel header's verified status code to YoutubeChannelHelper This code will be used by YoutubeChannelTabExtractor to return whether the channel is verified in the corresponding property of all InfoItems from this channel. Also throw an exception when we cannot get the verified status of a channel in YoutubeChannelExtractor due to a missing channelHeader, if the channel has no channelAgeGateRenderer.	2024-04-10 21:30:47 +02:00
AudricV	5f0faf34d7	[YouTube] Support playlists as URL navigation endpoints	2024-04-10 21:30:47 +02:00
AudricV	944d3723cd	[YouTube] Do not get twice runs array in YoutubeParsingHelper The runs object was computed twice in getTextFromObject and getUrlFromObject methods, leading to unneeded search costs. This has been avoided by storing the array in method variables.	2024-04-10 21:30:46 +02:00
Stypox	02274d5395	[YouTube] Avoid XSS attacks in description or comments	2024-04-08 11:21:31 +02:00
Stypox	a90237816a	[YouTube] Cleanup description helper Remove unneeded isClose field, and make constants private	2024-04-08 11:21:31 +02:00
Stypox	b80c3f5d51	[YouTube] Replace link text with accessibility label	2024-04-08 00:14:28 +02:00
Stypox	09732d6785	[YouTube] Add support for styles in attributed descriptions Also refactor descriptions parsing.	2024-04-04 21:14:27 +02:00
AudricV	293c3e9e47	[YouTube] Support new A/B tested comments data Also improve current comments code by removing outdated comment renderer data.	2024-04-04 21:14:26 +02:00
Stypox	e5b30ae8c3	Merge pull request #1151 from Profpatsch/localization-return-optional LocaleCompat.forLanguageTag: return Optional if parsing fails	2024-03-29 13:50:48 +01:00
Stypox	23fc7aa209	Throw ParsingException instead of IllegalArg	2024-03-29 13:44:42 +01:00
Stypox	fb468a23f4	Merge pull request #1142 from TeamNewPipe/peertube-v6 [PeerTube] Add support for PeerTube v6 features	2024-03-29 12:25:38 +01:00
Stypox	6589e2c15d	Merge pull request #1148 from Stypox/mediaccc-channel-tab-handler [MediaCCC] Allow obtaining channel tab link handler	2024-03-28 13:45:05 +01:00
Stypox	c57016b79b	Make getCommentText @Nonnull	2024-03-27 15:26:06 +01:00
TobiGr	aaccfecda8	[YouTube] Detect new account termination messages	2024-03-20 14:57:41 +01:00
TobiGr	15e0e74b48	[PeerTube] Add support for stream frames/storyboards extraction Implement PeerTubeStreamExtractor.getFrames()	2024-01-29 10:22:06 +01:00
Profpatsch	7408173246	LocaleCompat.forLanguageTag: return Optional if parsing fails It’s not obvious that the function will fail in some cases and throw an `IllegalArgumentException`. So instead of just failing if parsing fails, return an Optional that all callers have to decide what to do (e.g. the YoutubeExtractor can just ignore the locale in that case, like it does with most other fields in the json if they are unexpected).	2024-01-07 14:31:34 +01:00
Stypox	aaf3231fc7	[MediaCCC] Fix lambda link handler keeping reference to extractor This caused problems in NewPipe, because extractors are not serializable, and well, keeping references to them is a bad idea anyway.	2023-12-30 23:23:19 +01:00
Stypox	cc9ade962e	[MediaCCC] Allow obtaining channel tab extractor from scratch i.e. without needing to pass through the conference/channel extractor This was needed because clients (like NewPipe) might rely on link handlers to hold as little data as possible, since they might be kept around for long or passed around in system transactions, so this commit allows obtaining a standalone link handler that does not hold a JsonObject within itself.	2023-12-30 22:53:27 +01:00
petlyh	4408e2d0ac	[YouTube] Add Albums channel tab	2023-12-30 14:01:30 +01:00
petlyh	2b2c1546d1	Avoid PeerTube accepting non-URLs	2023-12-29 12:27:39 +01:00
Tobi	1e93b1dc20	Merge pull request #1135 from Stypox/yt-emergency-info [YouTube] Implement emergency meta info	2023-12-29 12:01:40 +01:00
dragfyre	65e7bc5b95	Update PeertubeTrendingLinkHandlerFactory.java correcting Peertube local trending api URL (per #10685 in main NewPipe repo); see https://docs.joinpeertube.org/api-rest-reference.html#tag/Video/operation/getVideos	2023-12-28 14:50:31 +07:00
Stypox	fc54fb2fdb	Merge pull request #1140 from Stypox/yt-shorts-no-duration [YouTube] Always return -1 as duration of Shorts returned inside reel items	2023-12-21 21:52:40 +01:00
Stypox	0518487d26	Fix SearchInfo's non-null MetaInfo being null when initialized or when an extraction error occurs (#1141 ) The meta info might have been null either when SearchInfo is first initialized, or when extractor.getMetaInfo() throws an exception in getInfo(). This caused NewPipe to crash instead of showing a nice error in https://www.reddit.com/r/youtube/comments/184ttmw/what_exactly_about_blue_whales_has_youtube_so/.	2023-12-21 21:39:25 +01:00
Stypox	5b59a1a8c5	[YouTube] Move meta info extraction to separate file YoutubeParsingHelper was longer than 2000 lines which caused checkstyle issues	2023-12-21 21:19:08 +01:00
Stypox	b8e12dd76c	[YouTube] Implement emergency meta info YouTube provides that meta info panel when users search for really sensitive content like suicide (e.g. "blue whale"). It contains: - an encouragement as title (e.g. "We are with you") - a phone number as action - details about how to call the phone number (e.g. availability) - an url pointing to the website of an association Also add a test that just checks if a meta info is properly extracted	2023-12-21 21:19:08 +01:00
Stypox	2938067c2c	[YouTube] Shorts don't provide a duration anymore	2023-12-21 20:41:01 +01:00
AudricV	56ab35423e	[YouTube] Fix potential NullPointerException in YoutubeSearchExtractor.getSearchSuggestion	2023-12-08 21:46:48 +01:00
AudricV	6ba8251be1	[YouTube] Bypass crisis resources blocking search results These crisis resources are preventing search results to be returned. See https://support.google.com/youtube/answer/10726080?hl=en for more info on them. This commit changes search parameters to include the property allowing to show search results.	2023-12-08 21:46:47 +01:00
AudricV	7dea2d0d27	[YouTube] Remove Channels channel tab support This tab has been removed by YouTube.	2023-12-08 21:46:47 +01:00
AudricV	3782d9a02a	[YouTube] Support new A/B tested like data and avoid like count conversion from integer to long Also make minor improvements to current like data extraction and remove previous like count data support, as it is not returned anymore.	2023-12-08 21:46:46 +01:00
AudricV	b71ce1123f	[YouTube] Extract only search results corresponding to a search type YouTube returns sometimes videos inside channel search results. As we only want results corresponding to the type we requested, this commits makes YoutubeSearchExtractor ignoring non-requested search results we get, using the extractor LinkHandler's first content filter value. Also remove an unneeded exception throwing declaration in YoutubeSearchExtractor.	2023-12-08 21:46:46 +01:00
AudricV	ff8ed7247f	[YouTube] Switch to new consent cookie Also move the documentation of the consent in its setter method in order to be accessible publicly and improve it.	2023-12-08 21:46:46 +01:00
AudricV	2c941794c0	[YouTube] Add utcOffsetMinutes to all InnerTube payloads This should make returned dates consistent between timezones and countries on which the extractor is ran. It was previously only set on YouTube Music search continuations.	2023-12-08 21:46:46 +01:00
AudricV	d97c9e0db1	[YouTube] Improve payloads and URLs of InnerTube requests For every InnerTube request: - Always add a `request` object with the following properties: - "internalExperimentFlags" set to an empty array; - "useSsl" set to "true"; - "lockedSafetyMode" set to "false". - Use proper TODO comment to provide a way to enable restricted mode on every request and add it on requests on which it wasn't present. For YouTube Music: - Remove alt query parameter, as it is not used anymore by the website; - Add prettyPrint query parameter with false value on YouTube Music search continuations.	2023-12-08 21:46:45 +01:00
AudricV	8a9ebcc373	[YouTube] Update InnerTube clients' version and devices' OS version and model	2023-12-08 21:46:45 +01:00
FineFindus	34b05a0dda	feat(youtube/comments): support creator replies	2023-10-09 16:33:43 +02:00
FineFindus	c1784a4bdb	[YouTube] Add channel owner to comments	2023-10-09 16:33:43 +02:00
TobiGr	f9846352ea	Fix wrong `@Nullable` annotation	2023-10-09 16:02:57 +02:00
Tobi	d6f5cba6e2	Merge pull request #1111 from FineFindus/feat/creator-reply Add `hasCreatorReply()` to CommentsInfoItem	2023-10-09 12:45:56 +02:00
TobiGr	d49f8411d7	[PeerTube] Implement CommentsInfoItemExtractor.hasCreatorReply()	2023-10-09 02:47:12 +02:00
AudricV	c98695fcea	[SoundCloud] Fix extraction of non-JPG images Default image qualities were removed in image URLs with the jpg extension, causing the addition of the image suffix to full non-JPG images URLs and so to invalid image URLs. Only the image quality name with its leading "-" character and the "." character after the name is now removed and replaced by a string format replaced itself with the image quality name for each quality. As the image suffixes do not contain the image extension, the name of image qualities lists has been adapted with these changes and some related comments have been also improved.	2023-10-01 20:33:25 +02:00
AudricV	ac00459c1a	Change requirement of image extensions in ImageSuffix class' Javadoc to a possibility Some services may provide different image formats using the same suffix, without we know what format the service provide. Enforcing an image extension could so lead to provide invalid image URLs, like for SoundCloud PNG images currently. With this documentation change, it is now clear that users of this class decide of whether they want to include image extensions in the suffix. The previous behavior described in the Javadoc was not enforced.	2023-09-30 21:11:09 +02:00
FineFindus	dd7b2d9798	feat(youtube/comments): support creator replies	2023-09-25 10:40:45 +02:00
Youssif Shaaban Alsager	917554acc4	[YouTube] Add support for ultralow audio formats (#1063 )	2023-09-24 19:04:34 +02:00
Christian	fc67d49f59	Update copyright notices Update copyright notices to comply to GPLv3 and change NewPipe to NewPipe Extractor on some notices that were not updated.	2023-09-22 19:10:15 -03:00
AudricV	714b141ecb	[YouTube] Catch any exception when extracting something from JavaScript's base player	2023-09-21 21:59:33 +02:00
AudricV	588c6a8422	[YouTube] Quote signature deobfuscation function name and add semicolon only where needed	2023-09-21 21:59:33 +02:00
AudricV	a04bc320de	[YouTube] Convert signature timestamp to integer The signature timestamp is used as a number by HTML5 clients, so it should be used in the same way by the extractor too instead of being a string. As the timestamp doesn't seem to exceed 5 digits, an integer is used to store its value.	2023-09-21 21:59:32 +02:00

1 2 3 4 5 ...

1524 Commits