Compare commits
	
		
			1 Commits
		
	
	
		
			2017.02.17
			...
			2016.07.26
		
	
	| Author | SHA1 | Date | |
|---|---|---|---|
|   | 3c519ad54d | 
							
								
								
									
										10
									
								
								.github/ISSUE_TEMPLATE.md
									
									
									
									
										vendored
									
									
								
							
							
						
						
									
										10
									
								
								.github/ISSUE_TEMPLATE.md
									
									
									
									
										vendored
									
									
								
							| @@ -6,8 +6,8 @@ | ||||
|  | ||||
| --- | ||||
|  | ||||
| ### Make sure you are using the *latest* version: run `youtube-dl --version` and ensure your version is *2017.02.17*. If it's not read [this FAQ entry](https://github.com/rg3/youtube-dl/blob/master/README.md#how-do-i-update-youtube-dl) and update. Issues with outdated version will be rejected. | ||||
| - [ ] I've **verified** and **I assure** that I'm running youtube-dl **2017.02.17** | ||||
| ### Make sure you are using the *latest* version: run `youtube-dl --version` and ensure your version is *2016.07.26*. If it's not read [this FAQ entry](https://github.com/rg3/youtube-dl/blob/master/README.md#how-do-i-update-youtube-dl) and update. Issues with outdated version will be rejected. | ||||
| - [ ] I've **verified** and **I assure** that I'm running youtube-dl **2016.07.26** | ||||
|  | ||||
| ### Before submitting an *issue* make sure you have: | ||||
| - [ ] At least skimmed through [README](https://github.com/rg3/youtube-dl/blob/master/README.md) and **most notably** [FAQ](https://github.com/rg3/youtube-dl#faq) and [BUGS](https://github.com/rg3/youtube-dl#bugs) sections | ||||
| @@ -35,7 +35,7 @@ $ youtube-dl -v <your command line> | ||||
| [debug] User config: [] | ||||
| [debug] Command-line args: [u'-v', u'http://www.youtube.com/watch?v=BaW_jenozKcj'] | ||||
| [debug] Encodings: locale cp1251, fs mbcs, out cp866, pref cp1251 | ||||
| [debug] youtube-dl version 2017.02.17 | ||||
| [debug] youtube-dl version 2016.07.26 | ||||
| [debug] Python version 2.7.11 - Windows-2003Server-5.2.3790-SP2 | ||||
| [debug] exe versions: ffmpeg N-75573-g1d0487f, ffprobe N-75573-g1d0487f, rtmpdump 2.4 | ||||
| [debug] Proxy map: {} | ||||
| @@ -50,11 +50,9 @@ $ youtube-dl -v <your command line> | ||||
| - Single video: https://youtu.be/BaW_jenozKc | ||||
| - Playlist: https://www.youtube.com/playlist?list=PL4lCao7KL_QFVb7Iudeipvc2BCavECqzc | ||||
|  | ||||
| Note that **youtube-dl does not support sites dedicated to [copyright infringement](https://github.com/rg3/youtube-dl#can-you-add-support-for-this-anime-video-site-or-site-which-shows-current-movies-for-free)**. In order for site support request to be accepted all provided example URLs should not violate any copyrights. | ||||
|  | ||||
| --- | ||||
|  | ||||
| ### Description of your *issue*, suggested solution and other information | ||||
|  | ||||
| Explanation of your *issue* in arbitrary form goes here. Please make sure the [description is worded well enough to be understood](https://github.com/rg3/youtube-dl#is-the-description-of-the-issue-itself-sufficient). Provide as much context and examples as possible. | ||||
| If work on your *issue* requires account credentials please provide them or explain how one can obtain them. | ||||
| If work on your *issue* required an account credentials please provide them or explain how one can obtain them. | ||||
|   | ||||
							
								
								
									
										4
									
								
								.github/ISSUE_TEMPLATE_tmpl.md
									
									
									
									
										vendored
									
									
								
							
							
						
						
									
										4
									
								
								.github/ISSUE_TEMPLATE_tmpl.md
									
									
									
									
										vendored
									
									
								
							| @@ -50,11 +50,9 @@ $ youtube-dl -v <your command line> | ||||
| - Single video: https://youtu.be/BaW_jenozKc | ||||
| - Playlist: https://www.youtube.com/playlist?list=PL4lCao7KL_QFVb7Iudeipvc2BCavECqzc | ||||
|  | ||||
| Note that **youtube-dl does not support sites dedicated to [copyright infringement](https://github.com/rg3/youtube-dl#can-you-add-support-for-this-anime-video-site-or-site-which-shows-current-movies-for-free)**. In order for site support request to be accepted all provided example URLs should not violate any copyrights. | ||||
|  | ||||
| --- | ||||
|  | ||||
| ### Description of your *issue*, suggested solution and other information | ||||
|  | ||||
| Explanation of your *issue* in arbitrary form goes here. Please make sure the [description is worded well enough to be understood](https://github.com/rg3/youtube-dl#is-the-description-of-the-issue-itself-sufficient). Provide as much context and examples as possible. | ||||
| If work on your *issue* requires account credentials please provide them or explain how one can obtain them. | ||||
| If work on your *issue* required an account credentials please provide them or explain how one can obtain them. | ||||
|   | ||||
							
								
								
									
										5
									
								
								.github/PULL_REQUEST_TEMPLATE.md
									
									
									
									
										vendored
									
									
								
							
							
						
						
									
										5
									
								
								.github/PULL_REQUEST_TEMPLATE.md
									
									
									
									
										vendored
									
									
								
							| @@ -10,13 +10,8 @@ | ||||
| - [ ] At least skimmed through [adding new extractor tutorial](https://github.com/rg3/youtube-dl#adding-support-for-a-new-site) and [youtube-dl coding conventions](https://github.com/rg3/youtube-dl#youtube-dl-coding-conventions) sections | ||||
| - [ ] [Searched](https://github.com/rg3/youtube-dl/search?q=is%3Apr&type=Issues) the bugtracker for similar pull requests | ||||
|  | ||||
| ### In order to be accepted and merged into youtube-dl each piece of code must be in public domain or released under [Unlicense](http://unlicense.org/). Check one of the following options: | ||||
| - [ ] I am the original author of this code and I am willing to release it under [Unlicense](http://unlicense.org/) | ||||
| - [ ] I am not the original author of this code but it is in public domain or released under [Unlicense](http://unlicense.org/) (provide reliable evidence) | ||||
|  | ||||
| ### What is the purpose of your *pull request*? | ||||
| - [ ] Bug fix | ||||
| - [ ] Improvement | ||||
| - [ ] New extractor | ||||
| - [ ] New feature | ||||
|  | ||||
|   | ||||
							
								
								
									
										5
									
								
								.gitignore
									
									
									
									
										vendored
									
									
								
							
							
						
						
									
										5
									
								
								.gitignore
									
									
									
									
										vendored
									
									
								
							| @@ -29,11 +29,6 @@ updates_key.pem | ||||
| *.m4a | ||||
| *.m4v | ||||
| *.mp3 | ||||
| *.3gp | ||||
| *.wav | ||||
| *.ape | ||||
| *.mkv | ||||
| *.swf | ||||
| *.part | ||||
| *.swp | ||||
| test/testdata | ||||
|   | ||||
| @@ -6,12 +6,8 @@ python: | ||||
|   - "3.3" | ||||
|   - "3.4" | ||||
|   - "3.5" | ||||
|   - "3.6" | ||||
| sudo: false | ||||
| env: | ||||
|   - YTDL_TEST_SET=core | ||||
|   - YTDL_TEST_SET=download | ||||
| script: ./devscripts/run_tests.sh | ||||
| script: nosetests test --verbose | ||||
| notifications: | ||||
|   email: | ||||
|     - filippo.valsorda@gmail.com | ||||
|   | ||||
							
								
								
									
										25
									
								
								AUTHORS
									
									
									
									
									
								
							
							
						
						
									
										25
									
								
								AUTHORS
									
									
									
									
									
								
							| @@ -26,7 +26,7 @@ Albert Kim | ||||
| Pierre Rudloff | ||||
| Huarong Huo | ||||
| Ismael Mejía | ||||
| Steffan Donal | ||||
| Steffan 'Ruirize' James | ||||
| Andras Elso | ||||
| Jelle van der Waa | ||||
| Marcin Cieślak | ||||
| @@ -179,26 +179,3 @@ Jakub Adam Wieczorek | ||||
| Aleksandar Topuzović | ||||
| Nehal Patel | ||||
| Rob van Bekkum | ||||
| Petr Zvoníček | ||||
| Pratyush Singh | ||||
| Aleksander Nitecki | ||||
| Sebastian Blunt | ||||
| Matěj Cepl | ||||
| Xie Yanbo | ||||
| Philip Xu | ||||
| John Hawkinson | ||||
| Rich Leeper | ||||
| Zhong Jianxin | ||||
| Thor77 | ||||
| Mattias Wadman | ||||
| Arjan Verwer | ||||
| Costy Petrisor | ||||
| Logan B | ||||
| Alex Seiler | ||||
| Vijay Singh | ||||
| Paul Hartmann | ||||
| Stephen Chen | ||||
| Fabian Stahl | ||||
| Bagira | ||||
| Odd Stråbø | ||||
| Philip Herzog | ||||
|   | ||||
| @@ -12,7 +12,7 @@ $ youtube-dl -v <your command line> | ||||
| [debug] Proxy map: {} | ||||
| ... | ||||
| ``` | ||||
| **Do not post screenshots of verbose logs; only plain text is acceptable.** | ||||
| **Do not post screenshots of verbose log only plain text is acceptable.** | ||||
|  | ||||
| The output (including the first lines) contains important debugging information. Issues without the full output are often not reproducible and therefore do not get solved in short order, if ever. | ||||
|  | ||||
| @@ -46,7 +46,7 @@ Make sure that someone has not already opened the issue you're trying to open. S | ||||
|  | ||||
| ###  Why are existing options not enough? | ||||
|  | ||||
| Before requesting a new feature, please have a quick peek at [the list of supported options](https://github.com/rg3/youtube-dl/blob/master/README.md#options). Many feature requests are for features that actually exist already! Please, absolutely do show off your work in the issue report and detail how the existing similar options do *not* solve your problem. | ||||
| Before requesting a new feature, please have a quick peek at [the list of supported options](https://github.com/rg3/youtube-dl/blob/master/README.md#synopsis). Many feature requests are for features that actually exist already! Please, absolutely do show off your work in the issue report and detail how the existing similar options do *not* solve your problem. | ||||
|  | ||||
| ###  Is there enough context in your bug report? | ||||
|  | ||||
| @@ -58,7 +58,7 @@ We are then presented with a very complicated request when the original problem | ||||
|  | ||||
| Some of our users seem to think there is a limit of issues they can or should open. There is no limit of issues they can or should open. While it may seem appealing to be able to dump all your issues into one ticket, that means that someone who solves one of your issues cannot mark the issue as closed. Typically, reporting a bunch of issues leads to the ticket lingering since nobody wants to attack that behemoth, until someone mercifully splits the issue into multiple ones. | ||||
|  | ||||
| In particular, every site support request issue should only pertain to services at one site (generally under a common domain, but always using the same backend technology). Do not request support for vimeo user videos, White house podcasts, and Google Plus pages in the same issue. Also, make sure that you don't post bug reports alongside feature requests. As a rule of thumb, a feature request does not include outputs of youtube-dl that are not immediately related to the feature at hand. Do not post reports of a network error alongside the request for a new video service. | ||||
| In particular, every site support request issue should only pertain to services at one site (generally under a common domain, but always using the same backend technology). Do not request support for vimeo user videos, Whitehouse podcasts, and Google Plus pages in the same issue. Also, make sure that you don't post bug reports alongside feature requests. As a rule of thumb, a feature request does not include outputs of youtube-dl that are not immediately related to the feature at hand. Do not post reports of a network error alongside the request for a new video service. | ||||
|  | ||||
| ###  Is anyone going to need the feature? | ||||
|  | ||||
| @@ -66,7 +66,7 @@ Only post features that you (or an incapacitated friend you can personally talk | ||||
|  | ||||
| ###  Is your question about youtube-dl? | ||||
|  | ||||
| It may sound strange, but some bug reports we receive are completely unrelated to youtube-dl and relate to a different, or even the reporter's own, application. Please make sure that you are actually using youtube-dl. If you are using a UI for youtube-dl, report the bug to the maintainer of the actual application providing the UI. On the other hand, if your UI for youtube-dl fails in some way you believe is related to youtube-dl, by all means, go ahead and report the bug. | ||||
| It may sound strange, but some bug reports we receive are completely unrelated to youtube-dl and relate to a different or even the reporter's own application. Please make sure that you are actually using youtube-dl. If you are using a UI for youtube-dl, report the bug to the maintainer of the actual application providing the UI. On the other hand, if your UI for youtube-dl fails in some way you believe is related to youtube-dl, by all means, go ahead and report the bug. | ||||
|  | ||||
| # DEVELOPER INSTRUCTIONS | ||||
|  | ||||
| @@ -85,16 +85,16 @@ To run the test, simply invoke your favorite test runner, or execute a test file | ||||
| If you want to create a build of youtube-dl yourself, you'll need | ||||
|  | ||||
| * python | ||||
| * make (only GNU make is supported) | ||||
| * make (both GNU make and BSD make are supported) | ||||
| * pandoc | ||||
| * zip | ||||
| * nosetests | ||||
|  | ||||
| ### Adding support for a new site | ||||
|  | ||||
| If you want to add support for a new site, first of all **make sure** this site is **not dedicated to [copyright infringement](README.md#can-you-add-support-for-this-anime-video-site-or-site-which-shows-current-movies-for-free)**. youtube-dl does **not support** such sites thus pull requests adding support for them **will be rejected**. | ||||
| If you want to add support for a new site, first of all **make sure** this site is **not dedicated to [copyright infringement](#can-you-add-support-for-this-anime-video-site-or-site-which-shows-current-movies-for-free)**. youtube-dl does **not support** such sites thus pull requests adding support for them **will be rejected**. | ||||
|  | ||||
| After you have ensured this site is distributing its content legally, you can follow this quick list (assuming your service is called `yourextractor`): | ||||
| After you have ensured this site is distributing it's content legally, you can follow this quick list (assuming your service is called `yourextractor`): | ||||
|  | ||||
| 1. [Fork this repository](https://github.com/rg3/youtube-dl/fork) | ||||
| 2. Check out the source code with: | ||||
| @@ -124,7 +124,7 @@ After you have ensured this site is distributing its content legally, you can fo | ||||
|                 'id': '42', | ||||
|                 'ext': 'mp4', | ||||
|                 'title': 'Video title goes here', | ||||
|                 'thumbnail': r're:^https?://.*\.jpg$', | ||||
|                 'thumbnail': 're:^https?://.*\.jpg$', | ||||
|                 # TODO more properties, either as: | ||||
|                 # * A value | ||||
|                 # * MD5 checksum; start the string with md5: | ||||
| @@ -167,19 +167,19 @@ In any case, thank you very much for your contributions! | ||||
|  | ||||
| This section introduces a guide lines for writing idiomatic, robust and future-proof extractor code. | ||||
|  | ||||
| Extractors are very fragile by nature since they depend on the layout of the source data provided by 3rd party media hosters out of your control and this layout tends to change. As an extractor implementer your task is not only to write code that will extract media links and metadata correctly but also to minimize dependency on the source's layout and even to make the code foresee potential future changes and be ready for that. This is important because it will allow the extractor not to break on minor layout changes thus keeping old youtube-dl versions working. Even though this breakage issue is easily fixed by emitting a new version of youtube-dl with a fix incorporated, all the previous versions become broken in all repositories and distros' packages that may not be so prompt in fetching the update from us. Needless to say, some non rolling release distros may never receive an update at all. | ||||
| Extractors are very fragile by nature since they depend on the layout of the source data provided by 3rd party media hoster out of your control and this layout tend to change. As an extractor implementer your task is not only to write code that will extract media links and metadata correctly but also to minimize code dependency on source's layout changes and even to make the code foresee potential future changes and be ready for that. This is important because it will allow extractor not to break on minor layout changes thus keeping old youtube-dl versions working. Even though this breakage issue is easily fixed by emitting a new version of youtube-dl with fix incorporated all the previous version become broken in all repositories and distros' packages that may not be so prompt in fetching the update from us. Needless to say some may never receive an update at all that is possible for non rolling release distros. | ||||
|  | ||||
| ### Mandatory and optional metafields | ||||
|  | ||||
| For extraction to work youtube-dl relies on metadata your extractor extracts and provides to youtube-dl expressed by an [information dictionary](https://github.com/rg3/youtube-dl/blob/master/youtube_dl/extractor/common.py#L75-L257) or simply *info dict*. Only the following meta fields in the *info dict* are considered mandatory for a successful extraction process by youtube-dl: | ||||
| For extraction to work youtube-dl relies on metadata your extractor extracts and provides to youtube-dl expressed by [information dictionary](https://github.com/rg3/youtube-dl/blob/master/youtube_dl/extractor/common.py#L75-L257) or simply *info dict*. Only the following meta fields in *info dict* are considered mandatory for successful extraction process by youtube-dl: | ||||
|  | ||||
|  - `id` (media identifier) | ||||
|  - `title` (media title) | ||||
|  - `url` (media download URL) or `formats` | ||||
|  | ||||
| In fact only the last option is technically mandatory (i.e. if you can't figure out the download location of the media the extraction does not make any sense). But by convention youtube-dl also treats `id` and `title` as mandatory. Thus the aforementioned metafields are the critical data that the extraction does not make any sense without and if any of them fail to be extracted then the extractor is considered completely broken. | ||||
| In fact only the last option is technically mandatory (i.e. if you can't figure out the download location of the media the extraction does not make any sense). But by convention youtube-dl also treats `id` and `title` to be mandatory. Thus aforementioned metafields are the critical data the extraction does not make any sense without and if any of them fail to be extracted then extractor is considered completely broken. | ||||
|  | ||||
| [Any field](https://github.com/rg3/youtube-dl/blob/master/youtube_dl/extractor/common.py#L149-L257) apart from the aforementioned ones are considered **optional**. That means that extraction should be **tolerant** to situations when sources for these fields can potentially be unavailable (even if they are always available at the moment) and **future-proof** in order not to break the extraction of general purpose mandatory fields. | ||||
| [Any field](https://github.com/rg3/youtube-dl/blob/master/youtube_dl/extractor/common.py#L149-L257) apart from the aforementioned ones are considered **optional**. That means that extraction should be **tolerate** to situations when sources for these fields can potentially be unavailable (even if they are always available at the moment) and **future-proof** in order not to break the extraction of general purpose mandatory fields. | ||||
|  | ||||
| #### Example | ||||
|  | ||||
| @@ -199,7 +199,7 @@ Assume at this point `meta`'s layout is: | ||||
| } | ||||
| ``` | ||||
|  | ||||
| Assume you want to extract `summary` and put it into the resulting info dict as `description`. Since `description` is an optional meta field you should be ready that this key may be missing from the `meta` dict, so that you should extract it like: | ||||
| Assume you want to extract `summary` and put into resulting info dict as `description`. Since `description` is optional metafield you should be ready that this key may be missing from the `meta` dict, so that you should extract it like: | ||||
|  | ||||
| ```python | ||||
| description = meta.get('summary')  # correct | ||||
| @@ -211,7 +211,7 @@ and not like: | ||||
| description = meta['summary']  # incorrect | ||||
| ``` | ||||
|  | ||||
| The latter will break extraction process with `KeyError` if `summary` disappears from `meta` at some later time but with the former approach extraction will just go ahead with `description` set to `None` which is perfectly fine (remember `None` is equivalent to the absence of data). | ||||
| The latter will break extraction process with `KeyError` if `summary` disappears from `meta` at some time later but with former approach extraction will just go ahead with `description` set to `None` that is perfectly fine (remember `None` is equivalent for absence of data).  | ||||
|  | ||||
| Similarly, you should pass `fatal=False` when extracting optional data from a webpage with `_search_regex`, `_html_search_regex` or similar methods, for instance: | ||||
|  | ||||
| @@ -231,21 +231,21 @@ description = self._search_regex( | ||||
|     webpage, 'description', default=None) | ||||
| ``` | ||||
|  | ||||
| On failure this code will silently continue the extraction with `description` set to `None`. That is useful for metafields that may or may not be present. | ||||
| On failure this code will silently continue the extraction with `description` set to `None`. That is useful for metafields that are known to may or may not be present. | ||||
|   | ||||
| ### Provide fallbacks | ||||
|  | ||||
| When extracting metadata try to do so from multiple sources. For example if `title` is present in several places, try extracting from at least some of them. This makes it more future-proof in case some of the sources become unavailable. | ||||
| When extracting metadata try to provide several scenarios for that. For example if `title` is present in several places/sources try extracting from at least some of them. This would make it more future-proof in case some of the sources became unavailable. | ||||
|  | ||||
| #### Example | ||||
|  | ||||
| Say `meta` from the previous example has a `title` and you are about to extract it. Since `title` is a mandatory meta field you should end up with something like: | ||||
| Say `meta` from previous example has a `title` and you are about to extract it. Since `title` is mandatory meta field you should end up with something like: | ||||
|  | ||||
| ```python | ||||
| title = meta['title'] | ||||
| ``` | ||||
|  | ||||
| If `title` disappears from `meta` in future due to some changes on the hoster's side the extraction would fail since `title` is mandatory. That's expected. | ||||
| If `title` disappeares from `meta` in future due to some changes on hoster's side the extraction would fail since `title` is mandatory. That's expected. | ||||
|  | ||||
| Assume that you have some another source you can extract `title` from, for example `og:title` HTML meta of a `webpage`. In this case you can provide a fallback scenario: | ||||
|  | ||||
| @@ -282,7 +282,7 @@ title = self._search_regex( | ||||
|     webpage, 'title', group='title') | ||||
| ``` | ||||
|  | ||||
| Note how you tolerate potential changes in the `style` attribute's value or switch from using double quotes to single for `class` attribute:  | ||||
| Note how you tolerate potential changes in `style` attribute's value or switch from using double quotes to single for `class` attribute:  | ||||
|  | ||||
| The code definitely should not look like: | ||||
|  | ||||
|   | ||||
							
								
								
									
										10
									
								
								Makefile
									
									
									
									
									
								
							
							
						
						
									
										10
									
								
								Makefile
									
									
									
									
									
								
							| @@ -1,7 +1,7 @@ | ||||
| all: youtube-dl README.md CONTRIBUTING.md README.txt youtube-dl.1 youtube-dl.bash-completion youtube-dl.zsh youtube-dl.fish supportedsites | ||||
|  | ||||
| clean: | ||||
| 	rm -rf youtube-dl.1.temp.md youtube-dl.1 youtube-dl.bash-completion README.txt MANIFEST build/ dist/ .coverage cover/ youtube-dl.tar.gz youtube-dl.zsh youtube-dl.fish youtube_dl/extractor/lazy_extractors.py *.dump *.part* *.info.json *.mp4 *.m4a *.flv *.mp3 *.avi *.mkv *.webm *.3gp *.wav *.ape *.swf *.jpg *.png CONTRIBUTING.md.tmp ISSUE_TEMPLATE.md.tmp youtube-dl youtube-dl.exe | ||||
| 	rm -rf youtube-dl.1.temp.md youtube-dl.1 youtube-dl.bash-completion README.txt MANIFEST build/ dist/ .coverage cover/ youtube-dl.tar.gz youtube-dl.zsh youtube-dl.fish youtube_dl/extractor/lazy_extractors.py *.dump *.part *.info.json *.mp4 *.m4a *.flv *.mp3 *.avi *.mkv *.webm *.jpg *.png CONTRIBUTING.md.tmp ISSUE_TEMPLATE.md.tmp youtube-dl youtube-dl.exe | ||||
| 	find . -name "*.pyc" -delete | ||||
| 	find . -name "*.class" -delete | ||||
|  | ||||
| @@ -12,7 +12,7 @@ SHAREDIR ?= $(PREFIX)/share | ||||
| PYTHON ?= /usr/bin/env python | ||||
|  | ||||
| # set SYSCONFDIR to /etc if PREFIX=/usr or PREFIX=/usr/local | ||||
| SYSCONFDIR = $(shell if [ $(PREFIX) = /usr -o $(PREFIX) = /usr/local ]; then echo /etc; else echo $(PREFIX)/etc; fi) | ||||
| SYSCONFDIR != if [ $(PREFIX) = /usr -o $(PREFIX) = /usr/local ]; then echo /etc; else echo $(PREFIX)/etc; fi | ||||
|  | ||||
| install: youtube-dl youtube-dl.1 youtube-dl.bash-completion youtube-dl.zsh youtube-dl.fish | ||||
| 	install -d $(DESTDIR)$(BINDIR) | ||||
| @@ -90,11 +90,11 @@ fish-completion: youtube-dl.fish | ||||
|  | ||||
| lazy-extractors: youtube_dl/extractor/lazy_extractors.py | ||||
|  | ||||
| _EXTRACTOR_FILES = $(shell find youtube_dl/extractor -iname '*.py' -and -not -iname 'lazy_extractors.py') | ||||
| _EXTRACTOR_FILES != find youtube_dl/extractor -iname '*.py' -and -not -iname 'lazy_extractors.py' | ||||
| youtube_dl/extractor/lazy_extractors.py: devscripts/make_lazy_extractors.py devscripts/lazy_load_template.py $(_EXTRACTOR_FILES) | ||||
| 	$(PYTHON) devscripts/make_lazy_extractors.py $@ | ||||
|  | ||||
| youtube-dl.tar.gz: youtube-dl README.md README.txt youtube-dl.1 youtube-dl.bash-completion youtube-dl.zsh youtube-dl.fish ChangeLog | ||||
| youtube-dl.tar.gz: youtube-dl README.md README.txt youtube-dl.1 youtube-dl.bash-completion youtube-dl.zsh youtube-dl.fish | ||||
| 	@tar -czf youtube-dl.tar.gz --transform "s|^|youtube-dl/|" --owner 0 --group 0 \ | ||||
| 		--exclude '*.DS_Store' \ | ||||
| 		--exclude '*.kate-swp' \ | ||||
| @@ -107,7 +107,7 @@ youtube-dl.tar.gz: youtube-dl README.md README.txt youtube-dl.1 youtube-dl.bash- | ||||
| 		--exclude 'docs/_build' \ | ||||
| 		-- \ | ||||
| 		bin devscripts test youtube_dl docs \ | ||||
| 		ChangeLog LICENSE README.md README.txt \ | ||||
| 		LICENSE README.md README.txt \ | ||||
| 		Makefile MANIFEST.in youtube-dl.1 youtube-dl.bash-completion \ | ||||
| 		youtube-dl.zsh youtube-dl.fish setup.py \ | ||||
| 		youtube-dl | ||||
|   | ||||
							
								
								
									
										267
									
								
								README.md
									
									
									
									
									
								
							
							
						
						
									
										267
									
								
								README.md
									
									
									
									
									
								
							| @@ -29,7 +29,7 @@ Windows users can [download an .exe file](https://yt-dl.org/latest/youtube-dl.ex | ||||
|  | ||||
| You can also use pip: | ||||
|  | ||||
|     sudo -H pip install --upgrade youtube-dl | ||||
|     sudo pip install --upgrade youtube-dl | ||||
|      | ||||
| This command will update youtube-dl if you have already installed it. See the [pypi page](https://pypi.python.org/pypi/youtube_dl) for more information. | ||||
|  | ||||
| @@ -44,7 +44,11 @@ Or with [MacPorts](https://www.macports.org/): | ||||
| Alternatively, refer to the [developer instructions](#developer-instructions) for how to check out and work with the git repository. For further options, including PGP signatures, see the [youtube-dl Download Page](https://rg3.github.io/youtube-dl/download.html). | ||||
|  | ||||
| # DESCRIPTION | ||||
| **youtube-dl** is a command-line program to download videos from YouTube.com and a few more sites. It requires the Python interpreter, version 2.6, 2.7, or 3.2+, and it is not platform specific. It should work on your Unix box, on Windows or on Mac OS X. It is released to the public domain, which means you can modify it, redistribute it or use it however you like. | ||||
| **youtube-dl** is a command-line program to download videos from | ||||
| YouTube.com and a few more sites. It requires the Python interpreter, version | ||||
| 2.6, 2.7, or 3.2+, and it is not platform specific. It should work on | ||||
| your Unix box, on Windows or on Mac OS X. It is released to the public domain, | ||||
| which means you can modify it, redistribute it or use it however you like. | ||||
|  | ||||
|     youtube-dl [OPTIONS] URL [URL...] | ||||
|  | ||||
| @@ -80,9 +84,6 @@ Alternatively, refer to the [developer instructions](#developer-instructions) fo | ||||
|                                      configuration in ~/.config/youtube- | ||||
|                                      dl/config (%APPDATA%/youtube-dl/config.txt | ||||
|                                      on Windows) | ||||
|     --config-location PATH           Location of the configuration file; either | ||||
|                                      the path to the config or its containing | ||||
|                                      directory. | ||||
|     --flat-playlist                  Do not extract the videos of a playlist, | ||||
|                                      only list them. | ||||
|     --mark-watched                   Mark videos watched (YouTube only) | ||||
| @@ -97,13 +98,16 @@ Alternatively, refer to the [developer instructions](#developer-instructions) fo | ||||
|                                      string (--proxy "") for direct connection | ||||
|     --socket-timeout SECONDS         Time to wait before giving up, in seconds | ||||
|     --source-address IP              Client-side IP address to bind to | ||||
|                                      (experimental) | ||||
|     -4, --force-ipv4                 Make all connections via IPv4 | ||||
|                                      (experimental) | ||||
|     -6, --force-ipv6                 Make all connections via IPv6 | ||||
|                                      (experimental) | ||||
|     --geo-verification-proxy URL     Use this proxy to verify the IP address for | ||||
|                                      some geo-restricted sites. The default | ||||
|                                      proxy specified by --proxy (or none, if the | ||||
|                                      options is not present) is used for the | ||||
|                                      actual downloading. | ||||
|                                      actual downloading. (experimental) | ||||
|  | ||||
| ## Video Selection: | ||||
|     --playlist-start NUMBER          Playlist video to start at (default is 1) | ||||
| @@ -134,23 +138,23 @@ Alternatively, refer to the [developer instructions](#developer-instructions) fo | ||||
|                                      COUNT views | ||||
|     --max-views COUNT                Do not download any videos with more than | ||||
|                                      COUNT views | ||||
|     --match-filter FILTER            Generic video filter. Specify any key (see | ||||
|                                      help for -o for a list of available keys) | ||||
|                                      to match if the key is present, !key to | ||||
|                                      check if the key is not present, key > | ||||
|                                      NUMBER (like "comment_count > 12", also | ||||
|                                      works with >=, <, <=, !=, =) to compare | ||||
|                                      against a number, and & to require multiple | ||||
|                                      matches. Values which are not known are | ||||
|                                      excluded unless you put a question mark (?) | ||||
|                                      after the operator. For example, to only | ||||
|                                      match videos that have been liked more than | ||||
|                                      100 times and disliked less than 50 times | ||||
|                                      (or the dislike functionality is not | ||||
|                                      available at the given service), but who | ||||
|                                      also have a description, use --match-filter | ||||
|                                      "like_count > 100 & dislike_count <? 50 & | ||||
|                                      description" . | ||||
|     --match-filter FILTER            Generic video filter (experimental). | ||||
|                                      Specify any key (see help for -o for a list | ||||
|                                      of available keys) to match if the key is | ||||
|                                      present, !key to check if the key is not | ||||
|                                      present,key > NUMBER (like "comment_count > | ||||
|                                      12", also works with >=, <, <=, !=, =) to | ||||
|                                      compare against a number, and & to require | ||||
|                                      multiple matches. Values which are not | ||||
|                                      known are excluded unless you put a | ||||
|                                      question mark (?) after the operator.For | ||||
|                                      example, to only match videos that have | ||||
|                                      been liked more than 100 times and disliked | ||||
|                                      less than 50 times (or the dislike | ||||
|                                      functionality is not available at the given | ||||
|                                      service), but who also have a description, | ||||
|                                      use --match-filter "like_count > 100 & | ||||
|                                      dislike_count <? 50 & description" . | ||||
|     --no-playlist                    Download only the video, if the URL refers | ||||
|                                      to a video and a playlist. | ||||
|     --yes-playlist                   Download the playlist, if the URL refers to | ||||
| @@ -169,12 +173,7 @@ Alternatively, refer to the [developer instructions](#developer-instructions) fo | ||||
|     -R, --retries RETRIES            Number of retries (default is 10), or | ||||
|                                      "infinite". | ||||
|     --fragment-retries RETRIES       Number of retries for a fragment (default | ||||
|                                      is 10), or "infinite" (DASH and hlsnative | ||||
|                                      only) | ||||
|     --skip-unavailable-fragments     Skip unavailable fragments (DASH and | ||||
|                                      hlsnative only) | ||||
|     --abort-on-unavailable-fragment  Abort downloading when some fragment is not | ||||
|                                      available | ||||
|                                      is 10), or "infinite" (DASH only) | ||||
|     --buffer-size SIZE               Size of download buffer (e.g. 1024 or 16K) | ||||
|                                      (default is 1024) | ||||
|     --no-resize-buffer               Do not automatically adjust the buffer | ||||
| @@ -182,9 +181,8 @@ Alternatively, refer to the [developer instructions](#developer-instructions) fo | ||||
|                                      automatically resized from an initial value | ||||
|                                      of SIZE. | ||||
|     --playlist-reverse               Download playlist videos in reverse order | ||||
|     --playlist-random                Download playlist videos in random order | ||||
|     --xattr-set-filesize             Set file xattribute ytdl.filesize with | ||||
|                                      expected file size (experimental) | ||||
|                                      expected filesize (experimental) | ||||
|     --hls-prefer-native              Use the native HLS downloader instead of | ||||
|                                      ffmpeg | ||||
|     --hls-prefer-ffmpeg              Use ffmpeg instead of the native HLS | ||||
| @@ -203,14 +201,36 @@ Alternatively, refer to the [developer instructions](#developer-instructions) fo | ||||
|     -a, --batch-file FILE            File containing URLs to download ('-' for | ||||
|                                      stdin) | ||||
|     --id                             Use only video ID in file name | ||||
|     -o, --output TEMPLATE            Output filename template, see the "OUTPUT | ||||
|                                      TEMPLATE" for all the info | ||||
|     -o, --output TEMPLATE            Output filename template. Use %(title)s to | ||||
|                                      get the title, %(uploader)s for the | ||||
|                                      uploader name, %(uploader_id)s for the | ||||
|                                      uploader nickname if different, | ||||
|                                      %(autonumber)s to get an automatically | ||||
|                                      incremented number, %(ext)s for the | ||||
|                                      filename extension, %(format)s for the | ||||
|                                      format description (like "22 - 1280x720" or | ||||
|                                      "HD"), %(format_id)s for the unique id of | ||||
|                                      the format (like YouTube's itags: "137"), | ||||
|                                      %(upload_date)s for the upload date | ||||
|                                      (YYYYMMDD), %(extractor)s for the provider | ||||
|                                      (youtube, metacafe, etc), %(id)s for the | ||||
|                                      video id, %(playlist_title)s, | ||||
|                                      %(playlist_id)s, or %(playlist)s (=title if | ||||
|                                      present, ID otherwise) for the playlist the | ||||
|                                      video is in, %(playlist_index)s for the | ||||
|                                      position in the playlist. %(height)s and | ||||
|                                      %(width)s for the width and height of the | ||||
|                                      video format. %(resolution)s for a textual | ||||
|                                      description of the resolution of the video | ||||
|                                      format. %% for a literal percent. Use - to | ||||
|                                      output to stdout. Can also be used to | ||||
|                                      download to a different directory, for | ||||
|                                      example with -o '/my/downloads/%(uploader)s | ||||
|                                      /%(title)s-%(id)s.%(ext)s' . | ||||
|     --autonumber-size NUMBER         Specify the number of digits in | ||||
|                                      %(autonumber)s when it is present in output | ||||
|                                      filename template or --auto-number option | ||||
|                                      is given (default is 5) | ||||
|     --autonumber-start NUMBER        Specify the start value for %(autonumber)s | ||||
|                                      (default is 1) | ||||
|                                      is given | ||||
|     --restrict-filenames             Restrict filenames to only ASCII | ||||
|                                      characters, and avoid "&" and spaces in | ||||
|                                      filenames | ||||
| @@ -310,15 +330,7 @@ Alternatively, refer to the [developer instructions](#developer-instructions) fo | ||||
|                                      bidirectional text support. Requires bidiv | ||||
|                                      or fribidi executable in PATH | ||||
|     --sleep-interval SECONDS         Number of seconds to sleep before each | ||||
|                                      download when used alone or a lower bound | ||||
|                                      of a range for randomized sleep before each | ||||
|                                      download (minimum possible number of | ||||
|                                      seconds to sleep) when used along with | ||||
|                                      --max-sleep-interval. | ||||
|     --max-sleep-interval SECONDS     Upper bound of a range for randomized sleep | ||||
|                                      before each download (maximum possible | ||||
|                                      number of seconds to sleep). Must only be | ||||
|                                      used along with --min-sleep-interval. | ||||
|                                      download. | ||||
|  | ||||
| ## Video Format Options: | ||||
|     -f, --format FORMAT              Video format code, see the "FORMAT | ||||
| @@ -353,28 +365,17 @@ Alternatively, refer to the [developer instructions](#developer-instructions) fo | ||||
|     -u, --username USERNAME          Login with this account ID | ||||
|     -p, --password PASSWORD          Account password. If this option is left | ||||
|                                      out, youtube-dl will ask interactively. | ||||
|     -2, --twofactor TWOFACTOR        Two-factor authentication code | ||||
|     -2, --twofactor TWOFACTOR        Two-factor auth code | ||||
|     -n, --netrc                      Use .netrc authentication data | ||||
|     --video-password PASSWORD        Video password (vimeo, smotri, youku) | ||||
|  | ||||
| ## Adobe Pass Options: | ||||
|     --ap-mso MSO                     Adobe Pass multiple-system operator (TV | ||||
|                                      provider) identifier, use --ap-list-mso for | ||||
|                                      a list of available MSOs | ||||
|     --ap-username USERNAME           Multiple-system operator account login | ||||
|     --ap-password PASSWORD           Multiple-system operator account password. | ||||
|                                      If this option is left out, youtube-dl will | ||||
|                                      ask interactively. | ||||
|     --ap-list-mso                    List all supported multiple-system | ||||
|                                      operators | ||||
|  | ||||
| ## Post-processing Options: | ||||
|     -x, --extract-audio              Convert video files to audio-only files | ||||
|                                      (requires ffmpeg or avconv and ffprobe or | ||||
|                                      avprobe) | ||||
|     --audio-format FORMAT            Specify audio format: "best", "aac", | ||||
|                                      "vorbis", "mp3", "m4a", "opus", or "wav"; | ||||
|                                      "best" by default; No effect without -x | ||||
|                                      "best" by default | ||||
|     --audio-quality QUALITY          Specify ffmpeg/avconv audio quality, insert | ||||
|                                      a value between 0 (better) and 9 (worse) | ||||
|                                      for VBR or a specific bitrate like 128K | ||||
| @@ -427,35 +428,25 @@ You can configure youtube-dl by placing any supported command line option to a c | ||||
|  | ||||
| For example, with the following configuration file youtube-dl will always extract the audio, not copy the mtime, use a proxy and save all videos under `Movies` directory in your home directory: | ||||
| ``` | ||||
| # Lines starting with # are comments | ||||
|  | ||||
| # Always extract audio | ||||
| -x | ||||
|  | ||||
| # Do not copy the mtime | ||||
| --no-mtime | ||||
|  | ||||
| # Use this proxy | ||||
| --proxy 127.0.0.1:3128 | ||||
|  | ||||
| # Save all videos under Movies directory in your home directory | ||||
| -o ~/Movies/%(title)s.%(ext)s | ||||
| # Lines starting with # are comments | ||||
| ``` | ||||
|  | ||||
| Note that options in configuration file are just the same options aka switches used in regular command line calls thus there **must be no whitespace** after `-` or `--`, e.g. `-o` or `--proxy` but not `- o` or `-- proxy`. | ||||
|  | ||||
| You can use `--ignore-config` if you want to disable the configuration file for a particular youtube-dl run. | ||||
|  | ||||
| You can also use `--config-location` if you want to use custom configuration file for a particular youtube-dl run. | ||||
|  | ||||
| ### Authentication with `.netrc` file | ||||
|  | ||||
| You may also want to configure automatic credentials storage for extractors that support authentication (by providing login and password with `--username` and `--password`) in order not to pass credentials as command line arguments on every youtube-dl execution and prevent tracking plain text passwords in the shell command history. You can achieve this using a [`.netrc` file](http://stackoverflow.com/tags/.netrc/info) on a per extractor basis. For that you will need to create a `.netrc` file in your `$HOME` and restrict permissions to read/write by only you: | ||||
| You may also want to configure automatic credentials storage for extractors that support authentication (by providing login and password with `--username` and `--password`) in order not to pass credentials as command line arguments on every youtube-dl execution and prevent tracking plain text passwords in the shell command history. You can achieve this using a [`.netrc` file](http://stackoverflow.com/tags/.netrc/info) on per extractor basis. For that you will need to create a `.netrc` file in your `$HOME` and restrict permissions to read/write by you only: | ||||
| ``` | ||||
| touch $HOME/.netrc | ||||
| chmod a-rwx,u+rw $HOME/.netrc | ||||
| ``` | ||||
| After that you can add credentials for an extractor in the following format, where *extractor* is the name of the extractor in lowercase: | ||||
| After that you can add credentials for extractor in the following format, where *extractor* is the name of extractor in lowercase: | ||||
| ``` | ||||
| machine <extractor> login <login> password <password> | ||||
| ``` | ||||
| @@ -551,13 +542,13 @@ Available for the media that is a track or a part of a music album: | ||||
|  - `disc_number`: Number of the disc or other physical medium the track belongs to | ||||
|  - `release_year`: Year (YYYY) when the album was released | ||||
|  | ||||
| Each aforementioned sequence when referenced in an output template will be replaced by the actual value corresponding to the sequence name. Note that some of the sequences are not guaranteed to be present since they depend on the metadata obtained by a particular extractor. Such sequences will be replaced with `NA`. | ||||
| Each aforementioned sequence when referenced in output template will be replaced by the actual value corresponding to the sequence name. Note that some of the sequences are not guaranteed to be present since they depend on the metadata obtained by particular extractor, such sequences will be replaced with `NA`. | ||||
|  | ||||
| For example for `-o %(title)s-%(id)s.%(ext)s` and an mp4 video with title `youtube-dl test video` and id `BaW_jenozKcj`, this will result in a `youtube-dl test video-BaW_jenozKcj.mp4` file created in the current directory. | ||||
| For example for `-o %(title)s-%(id)s.%(ext)s` and mp4 video with title `youtube-dl test video` and id `BaW_jenozKcj` this will result in a `youtube-dl test video-BaW_jenozKcj.mp4` file created in the current directory. | ||||
|  | ||||
| Output templates can also contain arbitrary hierarchical path, e.g. `-o '%(playlist)s/%(playlist_index)s - %(title)s.%(ext)s'` which will result in downloading each video in a directory corresponding to this path template. Any missing directory will be automatically created for you. | ||||
| Output template can also contain arbitrary hierarchical path, e.g. `-o '%(playlist)s/%(playlist_index)s - %(title)s.%(ext)s'` that will result in downloading each video in a directory corresponding to this path template. Any missing directory will be automatically created for you. | ||||
|  | ||||
| To use percent literals in an output template use `%%`. To output to stdout use `-o -`. | ||||
| To specify percent literal in output template use `%%`. To output to stdout use `-o -`. | ||||
|  | ||||
| The current default template is `%(title)s-%(id)s.%(ext)s`. | ||||
|  | ||||
| @@ -565,7 +556,7 @@ In some cases, you don't want special characters such as 中, spaces, or &, such | ||||
|  | ||||
| #### Output template and Windows batch files | ||||
|  | ||||
| If you are using an output template inside a Windows batch file then you must escape plain percent characters (`%`) by doubling, so that `-o "%(title)s-%(id)s.%(ext)s"` should become `-o "%%(title)s-%%(id)s.%%(ext)s"`. However you should not touch `%`'s that are not plain characters, e.g. environment variables for expansion should stay intact: `-o "C:\%HOMEPATH%\Desktop\%%(title)s.%%(ext)s"`. | ||||
| If you are using output template inside a Windows batch file then you must escape plain percent characters (`%`) by doubling, so that `-o "%(title)s-%(id)s.%(ext)s"` should become `-o "%%(title)s-%%(id)s.%%(ext)s"`. However you should not touch `%`'s that are not plain characters, e.g. environment variables for expansion should stay intact: `-o "C:\%HOMEPATH%\Desktop\%%(title)s.%%(ext)s"`. | ||||
|  | ||||
| #### Output template examples | ||||
|  | ||||
| @@ -598,7 +589,7 @@ $ youtube-dl -o - BaW_jenozKc | ||||
|  | ||||
| By default youtube-dl tries to download the best available quality, i.e. if you want the best quality you **don't need** to pass any special options, youtube-dl will guess it for you by **default**. | ||||
|  | ||||
| But sometimes you may want to download in a different format, for example when you are on a slow or intermittent connection. The key mechanism for achieving this is so-called *format selection* based on which you can explicitly specify desired format, select formats based on some criterion or criteria, setup precedence and much more. | ||||
| But sometimes you may want to download in a different format, for example when you are on a slow or intermittent connection. The key mechanism for achieving this is so called *format selection* based on which you can explicitly specify desired format, select formats based on some criterion or criteria, setup precedence and much more. | ||||
|  | ||||
| The general syntax for format selection is `--format FORMAT` or shorter `-f FORMAT` where `FORMAT` is a *selector expression*, i.e. an expression that describes format or formats you would like to download. | ||||
|  | ||||
| @@ -606,21 +597,21 @@ The general syntax for format selection is `--format FORMAT` or shorter `-f FORM | ||||
|  | ||||
| The simplest case is requesting a specific format, for example with `-f 22` you can download the format with format code equal to 22. You can get the list of available format codes for particular video using `--list-formats` or `-F`. Note that these format codes are extractor specific.  | ||||
|  | ||||
| You can also use a file extension (currently `3gp`, `aac`, `flv`, `m4a`, `mp3`, `mp4`, `ogg`, `wav`, `webm` are supported) to download the best quality format of a particular file extension served as a single file, e.g. `-f webm` will download the best quality format with the `webm` extension served as a single file. | ||||
| You can also use a file extension (currently `3gp`, `aac`, `flv`, `m4a`, `mp3`, `mp4`, `ogg`, `wav`, `webm` are supported) to download best quality format of particular file extension served as a single file, e.g. `-f webm` will download best quality format with `webm` extension served as a single file. | ||||
|  | ||||
| You can also use special names to select particular edge case formats: | ||||
|  - `best`: Select the best quality format represented by a single file with video and audio. | ||||
|  - `worst`: Select the worst quality format represented by a single file with video and audio. | ||||
|  - `bestvideo`: Select the best quality video-only format (e.g. DASH video). May not be available. | ||||
|  - `worstvideo`: Select the worst quality video-only format. May not be available. | ||||
|  - `bestaudio`: Select the best quality audio only-format. May not be available. | ||||
|  - `worstaudio`: Select the worst quality audio only-format. May not be available. | ||||
| You can also use special names to select particular edge case format: | ||||
|  - `best`: Select best quality format represented by single file with video and audio | ||||
|  - `worst`: Select worst quality format represented by single file with video and audio | ||||
|  - `bestvideo`: Select best quality video only format (e.g. DASH video), may not be available | ||||
|  - `worstvideo`: Select worst quality video only format, may not be available | ||||
|  - `bestaudio`: Select best quality audio only format, may not be available | ||||
|  - `worstaudio`: Select worst quality audio only format, may not be available | ||||
|  | ||||
| For example, to download the worst quality video-only format you can use `-f worstvideo`. | ||||
| For example, to download worst quality video only format you can use `-f worstvideo`. | ||||
|  | ||||
| If you want to download multiple videos and they don't have the same formats available, you can specify the order of preference using slashes. Note that slash is left-associative, i.e. formats on the left hand side are preferred, for example `-f 22/17/18` will download format 22 if it's available, otherwise it will download format 17 if it's available, otherwise it will download format 18 if it's available, otherwise it will complain that no suitable formats are available for download. | ||||
|  | ||||
| If you want to download several formats of the same video use a comma as a separator, e.g. `-f 22,17,18` will download all these three formats, of course if they are available. Or a more sophisticated example combined with the precedence feature: `-f 136/137/mp4/bestvideo,140/m4a/bestaudio`. | ||||
| If you want to download several formats of the same video use comma as a separator, e.g. `-f 22,17,18` will download all these three formats, of course if they are available. Or more sophisticated example combined with precedence feature `-f 136/137/mp4/bestvideo,140/m4a/bestaudio`. | ||||
|  | ||||
| You can also filter the video formats by putting a condition in brackets, as in `-f "best[height=720]"` (or `-f "[filesize>10M]"`). | ||||
|  | ||||
| @@ -639,18 +630,18 @@ Also filtering work for comparisons `=` (equals), `!=` (not equals), `^=` (begin | ||||
|  - `acodec`: Name of the audio codec in use | ||||
|  - `vcodec`: Name of the video codec in use | ||||
|  - `container`: Name of the container format | ||||
|  - `protocol`: The protocol that will be used for the actual download, lower-case (`http`, `https`, `rtsp`, `rtmp`, `rtmpe`, `mms`, `f4m`, `ism`, `m3u8`, or `m3u8_native`) | ||||
|  - `protocol`: The protocol that will be used for the actual download, lower-case. `http`, `https`, `rtsp`, `rtmp`, `rtmpe`, `m3u8`, or `m3u8_native` | ||||
|  - `format_id`: A short description of the format | ||||
|  | ||||
| Note that none of the aforementioned meta fields are guaranteed to be present since this solely depends on the metadata obtained by particular extractor, i.e. the metadata offered by the video hoster. | ||||
| Note that none of the aforementioned meta fields are guaranteed to be present since this solely depends on the metadata obtained by particular extractor, i.e. the metadata offered by video hoster. | ||||
|  | ||||
| Formats for which the value is not known are excluded unless you put a question mark (`?`) after the operator. You can combine format filters, so `-f "[height <=? 720][tbr>500]"` selects up to 720p videos (or videos where the height is not known) with a bitrate of at least 500 KBit/s. | ||||
|  | ||||
| You can merge the video and audio of two formats into a single file using `-f <video-format>+<audio-format>` (requires ffmpeg or avconv installed), for example `-f bestvideo+bestaudio` will download the best video-only format, the best audio-only format and mux them together with ffmpeg/avconv. | ||||
| You can merge the video and audio of two formats into a single file using `-f <video-format>+<audio-format>` (requires ffmpeg or avconv installed), for example `-f bestvideo+bestaudio` will download best video only format, best audio only format and mux them together with ffmpeg/avconv. | ||||
|  | ||||
| Format selectors can also be grouped using parentheses, for example if you want to download the best mp4 and webm formats with a height lower than 480 you can use `-f '(mp4,webm)[height<480]'`. | ||||
|  | ||||
| Since the end of April 2015 and version 2015.04.26, youtube-dl uses `-f bestvideo+bestaudio/best` as the default format selection (see [#5447](https://github.com/rg3/youtube-dl/issues/5447), [#5456](https://github.com/rg3/youtube-dl/issues/5456)). If ffmpeg or avconv are installed this results in downloading `bestvideo` and `bestaudio` separately and muxing them together into a single file giving the best overall quality available. Otherwise it falls back to `best` and results in downloading the best available quality served as a single file. `best` is also needed for videos that don't come from YouTube because they don't provide the audio and video in two different files. If you want to only download some DASH formats (for example if you are not interested in getting videos with a resolution higher than 1080p), you can add `-f bestvideo[height<=?1080]+bestaudio/best` to your configuration file. Note that if you use youtube-dl to stream to `stdout` (and most likely to pipe it to your media player then), i.e. you explicitly specify output template as `-o -`, youtube-dl still uses `-f best` format selection in order to start content delivery immediately to your player and not to wait until `bestvideo` and `bestaudio` are downloaded and muxed. | ||||
| Since the end of April 2015 and version 2015.04.26 youtube-dl uses `-f bestvideo+bestaudio/best` as default format selection (see [#5447](https://github.com/rg3/youtube-dl/issues/5447), [#5456](https://github.com/rg3/youtube-dl/issues/5456)). If ffmpeg or avconv are installed this results in downloading `bestvideo` and `bestaudio` separately and muxing them together into a single file giving the best overall quality available. Otherwise it falls back to `best` and results in downloading the best available quality served as a single file. `best` is also needed for videos that don't come from YouTube because they don't provide the audio and video in two different files. If you want to only download some DASH formats (for example if you are not interested in getting videos with a resolution higher than 1080p), you can add `-f bestvideo[height<=?1080]+bestaudio/best` to your configuration file. Note that if you use youtube-dl to stream to `stdout` (and most likely to pipe it to your media player then), i.e. you explicitly specify output template as `-o -`, youtube-dl still uses `-f best` format selection in order to start content delivery immediately to your player and not to wait until `bestvideo` and `bestaudio` are downloaded and muxed. | ||||
|  | ||||
| If you want to preserve the old format selection behavior (prior to youtube-dl 2015.04.26), i.e. you want to download the best available quality media served as a single file, you should explicitly specify your choice with `-f best`. You may want to add it to the [configuration file](#configuration) in order not to type it every time you run youtube-dl. | ||||
|  | ||||
| @@ -665,16 +656,12 @@ $ youtube-dl -f 'bestvideo[ext=mp4]+bestaudio[ext=m4a]/best[ext=mp4]/best' | ||||
| # Download best format available but not better that 480p | ||||
| $ youtube-dl -f 'bestvideo[height<=480]+bestaudio/best[height<=480]' | ||||
|  | ||||
| # Download best video only format but no bigger than 50 MB | ||||
| # Download best video only format but no bigger that 50 MB | ||||
| $ youtube-dl -f 'best[filesize<50M]' | ||||
|  | ||||
| # Download best format available via direct link over HTTP/HTTPS protocol | ||||
| $ youtube-dl -f '(bestvideo+bestaudio/best)[protocol^=http]' | ||||
|  | ||||
| # Download the best video format and the best audio format without merging them | ||||
| $ youtube-dl -f 'bestvideo,bestaudio' -o '%(title)s.f%(format_id)s.%(ext)s' | ||||
| ``` | ||||
| Note that in the last example, an output template is recommended as bestvideo and bestaudio may have the same file name. | ||||
|  | ||||
|  | ||||
| # VIDEO SELECTION | ||||
| @@ -729,7 +716,7 @@ Add a file exclusion for `youtube-dl.exe` in Windows Defender settings. | ||||
|  | ||||
| YouTube changed their playlist format in March 2014 and later on, so you'll need at least youtube-dl 2014.07.25 to download all YouTube videos. | ||||
|  | ||||
| If you have installed youtube-dl with a package manager, pip, setup.py or a tarball, please use that to update. Note that Ubuntu packages do not seem to get updated anymore. Since we are not affiliated with Ubuntu, there is little we can do. Feel free to [report bugs](https://bugs.launchpad.net/ubuntu/+source/youtube-dl/+filebug) to the [Ubuntu packaging people](mailto:ubuntu-motu@lists.ubuntu.com?subject=outdated%20version%20of%20youtube-dl) - all they have to do is update the package to a somewhat recent version. See above for a way to update. | ||||
| If you have installed youtube-dl with a package manager, pip, setup.py or a tarball, please use that to update. Note that Ubuntu packages do not seem to get updated anymore. Since we are not affiliated with Ubuntu, there is little we can do. Feel free to [report bugs](https://bugs.launchpad.net/ubuntu/+source/youtube-dl/+filebug) to the [Ubuntu packaging guys](mailto:ubuntu-motu@lists.ubuntu.com?subject=outdated%20version%20of%20youtube-dl) - all they have to do is update the package to a somewhat recent version. See above for a way to update. | ||||
|  | ||||
| ### I'm getting an error when trying to use output template: `error: using output template conflicts with using title, video ID or auto number` | ||||
|  | ||||
| @@ -745,7 +732,7 @@ Most people asking this question are not aware that youtube-dl now defaults to d | ||||
|  | ||||
| ### I get HTTP error 402 when trying to download a video. What's this? | ||||
|  | ||||
| Apparently YouTube requires you to pass a CAPTCHA test if you download too much. We're [considering to provide a way to let you solve the CAPTCHA](https://github.com/rg3/youtube-dl/issues/154), but at the moment, your best course of action is pointing a web browser to the youtube URL, solving the CAPTCHA, and restart youtube-dl. | ||||
| Apparently YouTube requires you to pass a CAPTCHA test if you download too much. We're [considering to provide a way to let you solve the CAPTCHA](https://github.com/rg3/youtube-dl/issues/154), but at the moment, your best course of action is pointing a webbrowser to the youtube URL, solving the CAPTCHA, and restart youtube-dl. | ||||
|  | ||||
| ### Do I need any other programs? | ||||
|  | ||||
| @@ -755,11 +742,11 @@ Videos or video formats streamed via RTMP protocol can only be downloaded when [ | ||||
|  | ||||
| ### I have downloaded a video but how can I play it? | ||||
|  | ||||
| Once the video is fully downloaded, use any video player, such as [mpv](https://mpv.io/), [vlc](http://www.videolan.org/) or [mplayer](http://www.mplayerhq.hu/). | ||||
| Once the video is fully downloaded, use any video player, such as [mpv](https://mpv.io/), [vlc](http://www.videolan.org) or [mplayer](http://www.mplayerhq.hu/). | ||||
|  | ||||
| ### I extracted a video URL with `-g`, but it does not play on another machine / in my web browser. | ||||
| ### I extracted a video URL with `-g`, but it does not play on another machine / in my webbrowser. | ||||
|  | ||||
| It depends a lot on the service. In many cases, requests for the video (to download/play it) must come from the same IP address and with the same cookies and/or HTTP headers. Use the `--cookies` option to write the required cookies into a file, and advise your downloader to read cookies from that file. Some sites also require a common user agent to be used, use `--dump-user-agent` to see the one in use by youtube-dl. You can also get necessary cookies and HTTP headers from JSON output obtained with `--dump-json`. | ||||
| It depends a lot on the service. In many cases, requests for the video (to download/play it) must come from the same IP address and with the same cookies.  Use the `--cookies` option to write the required cookies into a file, and advise your downloader to read cookies from that file. Some sites also require a common user agent to be used, use `--dump-user-agent` to see the one in use by youtube-dl. | ||||
|  | ||||
| It may be beneficial to use IPv6; in some cases, the restrictions are only applied to IPv4. Some services (sometimes only for a subset of videos) do not restrict the video URL by IP address, cookie, or user-agent, but these are the exception rather than the rule. | ||||
|  | ||||
| @@ -837,42 +824,10 @@ Either prepend `http://www.youtube.com/watch?v=` or separate the ID from the opt | ||||
|  | ||||
| ### How do I pass cookies to youtube-dl? | ||||
|  | ||||
| Use the `--cookies` option, for example `--cookies /path/to/cookies/file.txt`. | ||||
|  | ||||
| In order to extract cookies from browser use any conforming browser extension for exporting cookies. For example, [cookies.txt](https://chrome.google.com/webstore/detail/cookiestxt/njabckikapfpffapmjgojcnbfjonfjfg) (for Chrome) or [Export Cookies](https://addons.mozilla.org/en-US/firefox/addon/export-cookies/) (for Firefox). | ||||
|  | ||||
| Note that the cookies file must be in Mozilla/Netscape format and the first line of the cookies file must be either `# HTTP Cookie File` or `# Netscape HTTP Cookie File`. Make sure you have correct [newline format](https://en.wikipedia.org/wiki/Newline) in the cookies file and convert newlines if necessary to correspond with your OS, namely `CRLF` (`\r\n`) for Windows and `LF` (`\n`) for Unix and Unix-like systems (Linux, Mac OS, etc.). `HTTP Error 400: Bad Request` when using `--cookies` is a good sign of invalid newline format. | ||||
| Use the `--cookies` option, for example `--cookies /path/to/cookies/file.txt`. Note that the cookies file must be in Mozilla/Netscape format and the first line of the cookies file must be either `# HTTP Cookie File` or `# Netscape HTTP Cookie File`. Make sure you have correct [newline format](https://en.wikipedia.org/wiki/Newline) in the cookies file and convert newlines if necessary to correspond with your OS, namely `CRLF` (`\r\n`) for Windows, `LF` (`\n`) for Linux and `CR` (`\r`) for Mac OS. `HTTP Error 400: Bad Request` when using `--cookies` is a good sign of invalid newline format. | ||||
|  | ||||
| Passing cookies to youtube-dl is a good way to workaround login when a particular extractor does not implement it explicitly. Another use case is working around [CAPTCHA](https://en.wikipedia.org/wiki/CAPTCHA) some websites require you to solve in particular cases in order to get access (e.g. YouTube, CloudFlare). | ||||
|  | ||||
| ### How do I stream directly to media player? | ||||
|  | ||||
| You will first need to tell youtube-dl to stream media to stdout with `-o -`, and also tell your media player to read from stdin (it must be capable of this for streaming) and then pipe former to latter. For example, streaming to [vlc](http://www.videolan.org/) can be achieved with: | ||||
|  | ||||
|     youtube-dl -o - "http://www.youtube.com/watch?v=BaW_jenozKcj" | vlc - | ||||
|  | ||||
| ### How do I download only new videos from a playlist? | ||||
|  | ||||
| Use download-archive feature. With this feature you should initially download the complete playlist with `--download-archive /path/to/download/archive/file.txt` that will record identifiers of all the videos in a special file. Each subsequent run with the same `--download-archive` will download only new videos and skip all videos that have been downloaded before. Note that only successful downloads are recorded in the file. | ||||
|  | ||||
| For example, at first, | ||||
|  | ||||
|     youtube-dl --download-archive archive.txt "https://www.youtube.com/playlist?list=PLwiyx1dc3P2JR9N8gQaQN_BCvlSlap7re" | ||||
|  | ||||
| will download the complete `PLwiyx1dc3P2JR9N8gQaQN_BCvlSlap7re` playlist and create a file `archive.txt`. Each subsequent run will only download new videos if any: | ||||
|  | ||||
|     youtube-dl --download-archive archive.txt "https://www.youtube.com/playlist?list=PLwiyx1dc3P2JR9N8gQaQN_BCvlSlap7re" | ||||
|  | ||||
| ### Should I add `--hls-prefer-native` into my config? | ||||
|  | ||||
| When youtube-dl detects an HLS video, it can download it either with the built-in downloader or ffmpeg. Since many HLS streams are slightly invalid and ffmpeg/youtube-dl each handle some invalid cases better than the other, there is an option to switch the downloader if needed. | ||||
|  | ||||
| When youtube-dl knows that one particular downloader works better for a given website, that downloader will be picked. Otherwise, youtube-dl will pick the best downloader for general compatibility, which at the moment happens to be ffmpeg. This choice may change in future versions of youtube-dl, with improvements of the built-in downloader and/or ffmpeg. | ||||
|  | ||||
| In particular, the generic extractor (used when your website is not in the [list of supported sites by youtube-dl](http://rg3.github.io/youtube-dl/supportedsites.html) cannot mandate one specific downloader. | ||||
|  | ||||
| If you put either `--hls-prefer-native` or `--hls-prefer-ffmpeg` into your configuration, a different subset of videos will fail to download correctly. Instead, it is much better to [file an issue](https://yt-dl.org/bug) or a pull request which details why the native or the ffmpeg HLS downloader is a better choice for your use case. | ||||
|  | ||||
| ### Can you add support for this anime video site, or site which shows current movies for free? | ||||
|  | ||||
| As a matter of policy (as well as legality), youtube-dl does not include support for services that specialize in infringing copyright. As a rule of thumb, if you cannot easily find a video that the service is quite obviously allowed to distribute (i.e. that has been uploaded by the creator, the creator's distributor, or is published under a free license), the service is probably unfit for inclusion to youtube-dl. | ||||
| @@ -903,7 +858,7 @@ If you want to find out whether a given URL is supported, simply call youtube-dl | ||||
|  | ||||
| # Why do I need to go through that much red tape when filing bugs? | ||||
|  | ||||
| Before we had the issue template, despite our extensive [bug reporting instructions](#bugs), about 80% of the issue reports we got were useless, for instance because people used ancient versions hundreds of releases old, because of simple syntactic errors (not in youtube-dl but in general shell usage), because the problem was already reported multiple times before, because people did not actually read an error message, even if it said "please install ffmpeg", because people did not mention the URL they were trying to download and many more simple, easy-to-avoid problems, many of whom were totally unrelated to youtube-dl. | ||||
| Before we had the issue template, despite our extensive [bug reporting instructions](#bugs), about 80% of the issue reports we got were useless, for instance because people used ancient versions hundreds of releases old, because of simple syntactic errors (not in youtube-dl but in general shell usage), because the problem was alrady reported multiple times before, because people did not actually read an error message, even if it said "please install ffmpeg", because people did not mention the URL they were trying to download and many more simple, easy-to-avoid problems, many of whom were totally unrelated to youtube-dl. | ||||
|  | ||||
| youtube-dl is an open-source project manned by too few volunteers, so we'd rather spend time fixing bugs where we are certain none of those simple problems apply, and where we can be reasonably confident to be able to reproduce the issue without asking the reporter repeatedly. As such, the output of `youtube-dl -v YOUR_URL_HERE` is really all that's required to file an issue. The issue template also guides you through some basic steps you can do, such as checking that your version of youtube-dl is current. | ||||
|  | ||||
| @@ -924,16 +879,16 @@ To run the test, simply invoke your favorite test runner, or execute a test file | ||||
| If you want to create a build of youtube-dl yourself, you'll need | ||||
|  | ||||
| * python | ||||
| * make (only GNU make is supported) | ||||
| * make (both GNU make and BSD make are supported) | ||||
| * pandoc | ||||
| * zip | ||||
| * nosetests | ||||
|  | ||||
| ### Adding support for a new site | ||||
|  | ||||
| If you want to add support for a new site, first of all **make sure** this site is **not dedicated to [copyright infringement](README.md#can-you-add-support-for-this-anime-video-site-or-site-which-shows-current-movies-for-free)**. youtube-dl does **not support** such sites thus pull requests adding support for them **will be rejected**. | ||||
| If you want to add support for a new site, first of all **make sure** this site is **not dedicated to [copyright infringement](#can-you-add-support-for-this-anime-video-site-or-site-which-shows-current-movies-for-free)**. youtube-dl does **not support** such sites thus pull requests adding support for them **will be rejected**. | ||||
|  | ||||
| After you have ensured this site is distributing its content legally, you can follow this quick list (assuming your service is called `yourextractor`): | ||||
| After you have ensured this site is distributing it's content legally, you can follow this quick list (assuming your service is called `yourextractor`): | ||||
|  | ||||
| 1. [Fork this repository](https://github.com/rg3/youtube-dl/fork) | ||||
| 2. Check out the source code with: | ||||
| @@ -963,7 +918,7 @@ After you have ensured this site is distributing its content legally, you can fo | ||||
|                 'id': '42', | ||||
|                 'ext': 'mp4', | ||||
|                 'title': 'Video title goes here', | ||||
|                 'thumbnail': r're:^https?://.*\.jpg$', | ||||
|                 'thumbnail': 're:^https?://.*\.jpg$', | ||||
|                 # TODO more properties, either as: | ||||
|                 # * A value | ||||
|                 # * MD5 checksum; start the string with md5: | ||||
| @@ -1006,19 +961,19 @@ In any case, thank you very much for your contributions! | ||||
|  | ||||
| This section introduces a guide lines for writing idiomatic, robust and future-proof extractor code. | ||||
|  | ||||
| Extractors are very fragile by nature since they depend on the layout of the source data provided by 3rd party media hosters out of your control and this layout tends to change. As an extractor implementer your task is not only to write code that will extract media links and metadata correctly but also to minimize dependency on the source's layout and even to make the code foresee potential future changes and be ready for that. This is important because it will allow the extractor not to break on minor layout changes thus keeping old youtube-dl versions working. Even though this breakage issue is easily fixed by emitting a new version of youtube-dl with a fix incorporated, all the previous versions become broken in all repositories and distros' packages that may not be so prompt in fetching the update from us. Needless to say, some non rolling release distros may never receive an update at all. | ||||
| Extractors are very fragile by nature since they depend on the layout of the source data provided by 3rd party media hoster out of your control and this layout tend to change. As an extractor implementer your task is not only to write code that will extract media links and metadata correctly but also to minimize code dependency on source's layout changes and even to make the code foresee potential future changes and be ready for that. This is important because it will allow extractor not to break on minor layout changes thus keeping old youtube-dl versions working. Even though this breakage issue is easily fixed by emitting a new version of youtube-dl with fix incorporated all the previous version become broken in all repositories and distros' packages that may not be so prompt in fetching the update from us. Needless to say some may never receive an update at all that is possible for non rolling release distros. | ||||
|  | ||||
| ### Mandatory and optional metafields | ||||
|  | ||||
| For extraction to work youtube-dl relies on metadata your extractor extracts and provides to youtube-dl expressed by an [information dictionary](https://github.com/rg3/youtube-dl/blob/master/youtube_dl/extractor/common.py#L75-L257) or simply *info dict*. Only the following meta fields in the *info dict* are considered mandatory for a successful extraction process by youtube-dl: | ||||
| For extraction to work youtube-dl relies on metadata your extractor extracts and provides to youtube-dl expressed by [information dictionary](https://github.com/rg3/youtube-dl/blob/master/youtube_dl/extractor/common.py#L75-L257) or simply *info dict*. Only the following meta fields in *info dict* are considered mandatory for successful extraction process by youtube-dl: | ||||
|  | ||||
|  - `id` (media identifier) | ||||
|  - `title` (media title) | ||||
|  - `url` (media download URL) or `formats` | ||||
|  | ||||
| In fact only the last option is technically mandatory (i.e. if you can't figure out the download location of the media the extraction does not make any sense). But by convention youtube-dl also treats `id` and `title` as mandatory. Thus the aforementioned metafields are the critical data that the extraction does not make any sense without and if any of them fail to be extracted then the extractor is considered completely broken. | ||||
| In fact only the last option is technically mandatory (i.e. if you can't figure out the download location of the media the extraction does not make any sense). But by convention youtube-dl also treats `id` and `title` to be mandatory. Thus aforementioned metafields are the critical data the extraction does not make any sense without and if any of them fail to be extracted then extractor is considered completely broken. | ||||
|  | ||||
| [Any field](https://github.com/rg3/youtube-dl/blob/master/youtube_dl/extractor/common.py#L149-L257) apart from the aforementioned ones are considered **optional**. That means that extraction should be **tolerant** to situations when sources for these fields can potentially be unavailable (even if they are always available at the moment) and **future-proof** in order not to break the extraction of general purpose mandatory fields. | ||||
| [Any field](https://github.com/rg3/youtube-dl/blob/master/youtube_dl/extractor/common.py#L149-L257) apart from the aforementioned ones are considered **optional**. That means that extraction should be **tolerate** to situations when sources for these fields can potentially be unavailable (even if they are always available at the moment) and **future-proof** in order not to break the extraction of general purpose mandatory fields. | ||||
|  | ||||
| #### Example | ||||
|  | ||||
| @@ -1038,7 +993,7 @@ Assume at this point `meta`'s layout is: | ||||
| } | ||||
| ``` | ||||
|  | ||||
| Assume you want to extract `summary` and put it into the resulting info dict as `description`. Since `description` is an optional meta field you should be ready that this key may be missing from the `meta` dict, so that you should extract it like: | ||||
| Assume you want to extract `summary` and put into resulting info dict as `description`. Since `description` is optional metafield you should be ready that this key may be missing from the `meta` dict, so that you should extract it like: | ||||
|  | ||||
| ```python | ||||
| description = meta.get('summary')  # correct | ||||
| @@ -1050,7 +1005,7 @@ and not like: | ||||
| description = meta['summary']  # incorrect | ||||
| ``` | ||||
|  | ||||
| The latter will break extraction process with `KeyError` if `summary` disappears from `meta` at some later time but with the former approach extraction will just go ahead with `description` set to `None` which is perfectly fine (remember `None` is equivalent to the absence of data). | ||||
| The latter will break extraction process with `KeyError` if `summary` disappears from `meta` at some time later but with former approach extraction will just go ahead with `description` set to `None` that is perfectly fine (remember `None` is equivalent for absence of data).  | ||||
|  | ||||
| Similarly, you should pass `fatal=False` when extracting optional data from a webpage with `_search_regex`, `_html_search_regex` or similar methods, for instance: | ||||
|  | ||||
| @@ -1070,21 +1025,21 @@ description = self._search_regex( | ||||
|     webpage, 'description', default=None) | ||||
| ``` | ||||
|  | ||||
| On failure this code will silently continue the extraction with `description` set to `None`. That is useful for metafields that may or may not be present. | ||||
| On failure this code will silently continue the extraction with `description` set to `None`. That is useful for metafields that are known to may or may not be present. | ||||
|   | ||||
| ### Provide fallbacks | ||||
|  | ||||
| When extracting metadata try to do so from multiple sources. For example if `title` is present in several places, try extracting from at least some of them. This makes it more future-proof in case some of the sources become unavailable. | ||||
| When extracting metadata try to provide several scenarios for that. For example if `title` is present in several places/sources try extracting from at least some of them. This would make it more future-proof in case some of the sources became unavailable. | ||||
|  | ||||
| #### Example | ||||
|  | ||||
| Say `meta` from the previous example has a `title` and you are about to extract it. Since `title` is a mandatory meta field you should end up with something like: | ||||
| Say `meta` from previous example has a `title` and you are about to extract it. Since `title` is mandatory meta field you should end up with something like: | ||||
|  | ||||
| ```python | ||||
| title = meta['title'] | ||||
| ``` | ||||
|  | ||||
| If `title` disappears from `meta` in future due to some changes on the hoster's side the extraction would fail since `title` is mandatory. That's expected. | ||||
| If `title` disappeares from `meta` in future due to some changes on hoster's side the extraction would fail since `title` is mandatory. That's expected. | ||||
|  | ||||
| Assume that you have some another source you can extract `title` from, for example `og:title` HTML meta of a `webpage`. In this case you can provide a fallback scenario: | ||||
|  | ||||
| @@ -1121,7 +1076,7 @@ title = self._search_regex( | ||||
|     webpage, 'title', group='title') | ||||
| ``` | ||||
|  | ||||
| Note how you tolerate potential changes in the `style` attribute's value or switch from using double quotes to single for `class` attribute:  | ||||
| Note how you tolerate potential changes in `style` attribute's value or switch from using double quotes to single for `class` attribute:  | ||||
|  | ||||
| The code definitely should not look like: | ||||
|  | ||||
| @@ -1150,7 +1105,7 @@ with youtube_dl.YoutubeDL(ydl_opts) as ydl: | ||||
|     ydl.download(['http://www.youtube.com/watch?v=BaW_jenozKc']) | ||||
| ``` | ||||
|  | ||||
| Most likely, you'll want to use various options. For a list of options available, have a look at [`youtube_dl/YoutubeDL.py`](https://github.com/rg3/youtube-dl/blob/master/youtube_dl/YoutubeDL.py#L129-L279). For a start, if you want to intercept youtube-dl's output, set a `logger` object. | ||||
| Most likely, you'll want to use various options. For a list of options available, have a look at [`youtube_dl/YoutubeDL.py`](https://github.com/rg3/youtube-dl/blob/master/youtube_dl/YoutubeDL.py#L128-L278). For a start, if you want to intercept youtube-dl's output, set a `logger` object. | ||||
|  | ||||
| Here's a more complete example of a program that outputs only errors (and a short message after the download is finished), and downloads/converts the video to an mp3 file: | ||||
|  | ||||
| @@ -1191,7 +1146,7 @@ with youtube_dl.YoutubeDL(ydl_opts) as ydl: | ||||
|  | ||||
| # BUGS | ||||
|  | ||||
| Bugs and suggestions should be reported at: <https://github.com/rg3/youtube-dl/issues>. Unless you were prompted to or there is another pertinent reason (e.g. GitHub fails to accept the bug report), please do not send bug reports via personal email. For discussions, join us in the IRC channel [#youtube-dl](irc://chat.freenode.net/#youtube-dl) on freenode ([webchat](http://webchat.freenode.net/?randomnick=1&channels=youtube-dl)). | ||||
| Bugs and suggestions should be reported at: <https://github.com/rg3/youtube-dl/issues>. Unless you were prompted so or there is another pertinent reason (e.g. GitHub fails to accept the bug report), please do not send bug reports via personal email. For discussions, join us in the IRC channel [#youtube-dl](irc://chat.freenode.net/#youtube-dl) on freenode ([webchat](http://webchat.freenode.net/?randomnick=1&channels=youtube-dl)). | ||||
|  | ||||
| **Please include the full output of youtube-dl when run with `-v`**, i.e. **add** `-v` flag to **your command line**, copy the **whole** output and post it in the issue body wrapped in \`\`\` for better formatting. It should look similar to this: | ||||
| ``` | ||||
| @@ -1207,7 +1162,7 @@ $ youtube-dl -v <your command line> | ||||
| [debug] Proxy map: {} | ||||
| ... | ||||
| ``` | ||||
| **Do not post screenshots of verbose logs; only plain text is acceptable.** | ||||
| **Do not post screenshots of verbose log only plain text is acceptable.** | ||||
|  | ||||
| The output (including the first lines) contains important debugging information. Issues without the full output are often not reproducible and therefore do not get solved in short order, if ever. | ||||
|  | ||||
| @@ -1241,7 +1196,7 @@ Make sure that someone has not already opened the issue you're trying to open. S | ||||
|  | ||||
| ###  Why are existing options not enough? | ||||
|  | ||||
| Before requesting a new feature, please have a quick peek at [the list of supported options](https://github.com/rg3/youtube-dl/blob/master/README.md#options). Many feature requests are for features that actually exist already! Please, absolutely do show off your work in the issue report and detail how the existing similar options do *not* solve your problem. | ||||
| Before requesting a new feature, please have a quick peek at [the list of supported options](https://github.com/rg3/youtube-dl/blob/master/README.md#synopsis). Many feature requests are for features that actually exist already! Please, absolutely do show off your work in the issue report and detail how the existing similar options do *not* solve your problem. | ||||
|  | ||||
| ###  Is there enough context in your bug report? | ||||
|  | ||||
| @@ -1253,7 +1208,7 @@ We are then presented with a very complicated request when the original problem | ||||
|  | ||||
| Some of our users seem to think there is a limit of issues they can or should open. There is no limit of issues they can or should open. While it may seem appealing to be able to dump all your issues into one ticket, that means that someone who solves one of your issues cannot mark the issue as closed. Typically, reporting a bunch of issues leads to the ticket lingering since nobody wants to attack that behemoth, until someone mercifully splits the issue into multiple ones. | ||||
|  | ||||
| In particular, every site support request issue should only pertain to services at one site (generally under a common domain, but always using the same backend technology). Do not request support for vimeo user videos, White house podcasts, and Google Plus pages in the same issue. Also, make sure that you don't post bug reports alongside feature requests. As a rule of thumb, a feature request does not include outputs of youtube-dl that are not immediately related to the feature at hand. Do not post reports of a network error alongside the request for a new video service. | ||||
| In particular, every site support request issue should only pertain to services at one site (generally under a common domain, but always using the same backend technology). Do not request support for vimeo user videos, Whitehouse podcasts, and Google Plus pages in the same issue. Also, make sure that you don't post bug reports alongside feature requests. As a rule of thumb, a feature request does not include outputs of youtube-dl that are not immediately related to the feature at hand. Do not post reports of a network error alongside the request for a new video service. | ||||
|  | ||||
| ###  Is anyone going to need the feature? | ||||
|  | ||||
| @@ -1261,7 +1216,7 @@ Only post features that you (or an incapacitated friend you can personally talk | ||||
|  | ||||
| ###  Is your question about youtube-dl? | ||||
|  | ||||
| It may sound strange, but some bug reports we receive are completely unrelated to youtube-dl and relate to a different, or even the reporter's own, application. Please make sure that you are actually using youtube-dl. If you are using a UI for youtube-dl, report the bug to the maintainer of the actual application providing the UI. On the other hand, if your UI for youtube-dl fails in some way you believe is related to youtube-dl, by all means, go ahead and report the bug. | ||||
| It may sound strange, but some bug reports we receive are completely unrelated to youtube-dl and relate to a different or even the reporter's own application. Please make sure that you are actually using youtube-dl. If you are using a UI for youtube-dl, report the bug to the maintainer of the actual application providing the UI. On the other hand, if your UI for youtube-dl fails in some way you believe is related to youtube-dl, by all means, go ahead and report the bug. | ||||
|  | ||||
| # COPYRIGHT | ||||
|  | ||||
|   | ||||
| @@ -25,6 +25,5 @@ def build_completion(opt_parser): | ||||
|         filled_template = template.replace("{{flags}}", " ".join(opts_flag)) | ||||
|         f.write(filled_template) | ||||
|  | ||||
|  | ||||
| parser = youtube_dl.parseOpts()[0] | ||||
| build_completion(parser) | ||||
|   | ||||
| @@ -424,6 +424,8 @@ class BuildHTTPRequestHandler(compat_http_server.BaseHTTPRequestHandler): | ||||
|                     self.send_header('Content-Length', len(msg)) | ||||
|                     self.end_headers() | ||||
|                     self.wfile.write(msg) | ||||
|                 except HTTPError as e: | ||||
|                     self.send_response(e.code, str(e)) | ||||
|             else: | ||||
|                 self.send_response(500, 'Unknown build method "%s"' % action) | ||||
|         else: | ||||
|   | ||||
| @@ -2,13 +2,11 @@ | ||||
| from __future__ import unicode_literals | ||||
|  | ||||
| import base64 | ||||
| import io | ||||
| import json | ||||
| import mimetypes | ||||
| import netrc | ||||
| import optparse | ||||
| import os | ||||
| import re | ||||
| import sys | ||||
|  | ||||
| sys.path.insert(0, os.path.dirname(os.path.dirname(os.path.abspath(__file__)))) | ||||
| @@ -92,23 +90,16 @@ class GitHubReleaser(object): | ||||
|  | ||||
|  | ||||
| def main(): | ||||
|     parser = optparse.OptionParser(usage='%prog CHANGELOG VERSION BUILDPATH') | ||||
|     parser = optparse.OptionParser(usage='%prog VERSION BUILDPATH') | ||||
|     options, args = parser.parse_args() | ||||
|     if len(args) != 3: | ||||
|     if len(args) != 2: | ||||
|         parser.error('Expected a version and a build directory') | ||||
|  | ||||
|     changelog_file, version, build_path = args | ||||
|  | ||||
|     with io.open(changelog_file, encoding='utf-8') as inf: | ||||
|         changelog = inf.read() | ||||
|  | ||||
|     mobj = re.search(r'(?s)version %s\n{2}(.+?)\n{3}' % version, changelog) | ||||
|     body = mobj.group(1) if mobj else '' | ||||
|     version, build_path = args | ||||
|  | ||||
|     releaser = GitHubReleaser() | ||||
|  | ||||
|     new_release = releaser.create_release( | ||||
|         version, name='youtube-dl %s' % version, body=body) | ||||
|     new_release = releaser.create_release(version, name='youtube-dl %s' % version) | ||||
|     release_id = new_release['id'] | ||||
|  | ||||
|     for asset in os.listdir(build_path): | ||||
|   | ||||
| @@ -44,6 +44,5 @@ def build_completion(opt_parser): | ||||
|     with open(FISH_COMPLETION_FILE, 'w') as f: | ||||
|         f.write(filled_template) | ||||
|  | ||||
|  | ||||
| parser = youtube_dl.parseOpts()[0] | ||||
| build_completion(parser) | ||||
|   | ||||
| @@ -23,7 +23,6 @@ def openssl_encode(algo, key, iv): | ||||
|     out, _ = prog.communicate(secret_msg) | ||||
|     return out | ||||
|  | ||||
|  | ||||
| iv = key = [0x20, 0x15] + 14 * [0] | ||||
|  | ||||
| r = openssl_encode('aes-128-cbc', key, iv) | ||||
|   | ||||
| @@ -32,6 +32,5 @@ def main(): | ||||
|     with open('supportedsites.html', 'w', encoding='utf-8') as sitesf: | ||||
|         sitesf.write(template) | ||||
|  | ||||
|  | ||||
| if __name__ == '__main__': | ||||
|     main() | ||||
|   | ||||
| @@ -1,4 +1,4 @@ | ||||
| # coding: utf-8 | ||||
| # encoding: utf-8 | ||||
| from __future__ import unicode_literals | ||||
|  | ||||
| import re | ||||
|   | ||||
| @@ -28,6 +28,5 @@ def main(): | ||||
|     with io.open(outfile, 'w', encoding='utf-8') as outf: | ||||
|         outf.write(out) | ||||
|  | ||||
|  | ||||
| if __name__ == '__main__': | ||||
|     main() | ||||
|   | ||||
| @@ -59,7 +59,6 @@ def build_lazy_ie(ie, name): | ||||
|         s += make_valid_template.format(valid_url=ie._make_valid_url()) | ||||
|     return s | ||||
|  | ||||
|  | ||||
| # find the correct sorting and add the required base classes so that sublcasses | ||||
| # can be correctly created | ||||
| classes = _ALL_CLASSES[:-1] | ||||
|   | ||||
| @@ -41,6 +41,5 @@ def main(): | ||||
|     with io.open(outfile, 'w', encoding='utf-8') as outf: | ||||
|         outf.write(out) | ||||
|  | ||||
|  | ||||
| if __name__ == '__main__': | ||||
|     main() | ||||
|   | ||||
| @@ -54,26 +54,21 @@ def filter_options(readme): | ||||
|  | ||||
|         if in_options: | ||||
|             if line.lstrip().startswith('-'): | ||||
|                 split = re.split(r'\s{2,}', line.lstrip()) | ||||
|                 # Description string may start with `-` as well. If there is | ||||
|                 # only one piece then it's a description bit not an option. | ||||
|                 if len(split) > 1: | ||||
|                     option, description = split | ||||
|                     split_option = option.split(' ') | ||||
|                 option, description = re.split(r'\s{2,}', line.lstrip()) | ||||
|                 split_option = option.split(' ') | ||||
|  | ||||
|                     if not split_option[-1].startswith('-'):  # metavar | ||||
|                         option = ' '.join(split_option[:-1] + ['*%s*' % split_option[-1]]) | ||||
|                 if not split_option[-1].startswith('-'):  # metavar | ||||
|                     option = ' '.join(split_option[:-1] + ['*%s*' % split_option[-1]]) | ||||
|  | ||||
|                     # Pandoc's definition_lists. See http://pandoc.org/README.html | ||||
|                     # for more information. | ||||
|                     ret += '\n%s\n:   %s\n' % (option, description) | ||||
|                     continue | ||||
|             ret += line.lstrip() + '\n' | ||||
|                 # Pandoc's definition_lists. See http://pandoc.org/README.html | ||||
|                 # for more information. | ||||
|                 ret += '\n%s\n:   %s\n' % (option, description) | ||||
|             else: | ||||
|                 ret += line.lstrip() + '\n' | ||||
|         else: | ||||
|             ret += line + '\n' | ||||
|  | ||||
|     return ret | ||||
|  | ||||
|  | ||||
| if __name__ == '__main__': | ||||
|     main() | ||||
|   | ||||
| @@ -60,9 +60,6 @@ if ! type pandoc >/dev/null 2>/dev/null; then echo 'ERROR: pandoc is missing'; e | ||||
| if ! python3 -c 'import rsa' 2>/dev/null; then echo 'ERROR: python3-rsa is missing'; exit 1; fi | ||||
| if ! python3 -c 'import wheel' 2>/dev/null; then echo 'ERROR: wheel is missing'; exit 1; fi | ||||
|  | ||||
| read -p "Is ChangeLog up to date? (y/n) " -n 1 | ||||
| if [[ ! $REPLY =~ ^[Yy]$ ]]; then exit 1; fi | ||||
|  | ||||
| /bin/echo -e "\n### First of all, testing..." | ||||
| make clean | ||||
| if $skip_tests ; then | ||||
| @@ -74,12 +71,9 @@ fi | ||||
| /bin/echo -e "\n### Changing version in version.py..." | ||||
| sed -i "s/__version__ = '.*'/__version__ = '$version'/" youtube_dl/version.py | ||||
|  | ||||
| /bin/echo -e "\n### Changing version in ChangeLog..." | ||||
| sed -i "s/<unreleased>/$version/" ChangeLog | ||||
|  | ||||
| /bin/echo -e "\n### Committing documentation, templates and youtube_dl/version.py..." | ||||
| make README.md CONTRIBUTING.md .github/ISSUE_TEMPLATE.md supportedsites | ||||
| git add README.md CONTRIBUTING.md .github/ISSUE_TEMPLATE.md docs/supportedsites.md youtube_dl/version.py ChangeLog | ||||
| git add README.md CONTRIBUTING.md .github/ISSUE_TEMPLATE.md docs/supportedsites.md youtube_dl/version.py | ||||
| git commit $gpg_sign_commits -m "release $version" | ||||
|  | ||||
| /bin/echo -e "\n### Now tagging, signing and pushing..." | ||||
| @@ -110,7 +104,7 @@ RELEASE_FILES="youtube-dl youtube-dl.exe youtube-dl-$version.tar.gz" | ||||
| for f in $RELEASE_FILES; do gpg --passphrase-repeat 5 --detach-sig "build/$version/$f"; done | ||||
|  | ||||
| ROOT=$(pwd) | ||||
| python devscripts/create-github-release.py ChangeLog $version "$ROOT/build/$version" | ||||
| python devscripts/create-github-release.py $version "$ROOT/build/$version" | ||||
|  | ||||
| ssh ytdl@yt-dl.org "sh html/update_latest.sh $version" | ||||
|  | ||||
|   | ||||
| @@ -1,19 +0,0 @@ | ||||
| #!/bin/bash | ||||
|  | ||||
| DOWNLOAD_TESTS="age_restriction|download|subtitles|write_annotations|iqiyi_sdk_interpreter" | ||||
|  | ||||
| test_set="" | ||||
|  | ||||
| case "$YTDL_TEST_SET" in | ||||
|     core) | ||||
|         test_set="-I test_($DOWNLOAD_TESTS)\.py" | ||||
|     ;; | ||||
|     download) | ||||
|         test_set="-I test_(?!$DOWNLOAD_TESTS).+\.py" | ||||
|     ;; | ||||
|     *) | ||||
|         break | ||||
|     ;; | ||||
| esac | ||||
|  | ||||
| nosetests test --verbose $test_set | ||||
| @@ -1,7 +1,6 @@ | ||||
| #!/usr/bin/env python | ||||
| from __future__ import unicode_literals | ||||
|  | ||||
| import itertools | ||||
| import json | ||||
| import os | ||||
| import re | ||||
| @@ -22,26 +21,21 @@ def format_size(bytes): | ||||
|  | ||||
| total_bytes = 0 | ||||
|  | ||||
| for page in itertools.count(1): | ||||
|     releases = json.loads(compat_urllib_request.urlopen( | ||||
|         'https://api.github.com/repos/rg3/youtube-dl/releases?page=%s' % page | ||||
|     ).read().decode('utf-8')) | ||||
| releases = json.loads(compat_urllib_request.urlopen( | ||||
|     'https://api.github.com/repos/rg3/youtube-dl/releases').read().decode('utf-8')) | ||||
|  | ||||
|     if not releases: | ||||
|         break | ||||
|  | ||||
|     for release in releases: | ||||
|         compat_print(release['name']) | ||||
|         for asset in release['assets']: | ||||
|             asset_name = asset['name'] | ||||
|             total_bytes += asset['download_count'] * asset['size'] | ||||
|             if all(not re.match(p, asset_name) for p in ( | ||||
|                     r'^youtube-dl$', | ||||
|                     r'^youtube-dl-\d{4}\.\d{2}\.\d{2}(?:\.\d+)?\.tar\.gz$', | ||||
|                     r'^youtube-dl\.exe$')): | ||||
|                 continue | ||||
|             compat_print( | ||||
|                 ' %s size: %s downloads: %d' | ||||
|                 % (asset_name, format_size(asset['size']), asset['download_count'])) | ||||
| for release in releases: | ||||
|     compat_print(release['name']) | ||||
|     for asset in release['assets']: | ||||
|         asset_name = asset['name'] | ||||
|         total_bytes += asset['download_count'] * asset['size'] | ||||
|         if all(not re.match(p, asset_name) for p in ( | ||||
|                 r'^youtube-dl$', | ||||
|                 r'^youtube-dl-\d{4}\.\d{2}\.\d{2}(?:\.\d+)?\.tar\.gz$', | ||||
|                 r'^youtube-dl\.exe$')): | ||||
|             continue | ||||
|         compat_print( | ||||
|             ' %s size: %s downloads: %d' | ||||
|             % (asset_name, format_size(asset['size']), asset['download_count'])) | ||||
|  | ||||
| compat_print('total downloads traffic: %s' % format_size(total_bytes)) | ||||
|   | ||||
| @@ -44,6 +44,5 @@ def build_completion(opt_parser): | ||||
|     with open(ZSH_COMPLETION_FILE, "w") as f: | ||||
|         f.write(template) | ||||
|  | ||||
|  | ||||
| parser = youtube_dl.parseOpts()[0] | ||||
| build_completion(parser) | ||||
|   | ||||
| @@ -1,4 +1,4 @@ | ||||
| # coding: utf-8 | ||||
| # -*- coding: utf-8 -*- | ||||
| # | ||||
| # youtube-dl documentation build configuration file, created by | ||||
| # sphinx-quickstart on Fri Mar 14 21:05:43 2014. | ||||
|   | ||||
| @@ -11,19 +11,14 @@ | ||||
|  - **4tube** | ||||
|  - **56.com** | ||||
|  - **5min** | ||||
|  - **6play** | ||||
|  - **8tracks** | ||||
|  - **91porn** | ||||
|  - **9c9media** | ||||
|  - **9c9media:stack** | ||||
|  - **9gag** | ||||
|  - **9now.com.au** | ||||
|  - **abc.net.au** | ||||
|  - **abc.net.au:iview** | ||||
|  - **Abc7News** | ||||
|  - **abcnews** | ||||
|  - **abcnews:video** | ||||
|  - **abcotvs**: ABC Owned Television Stations | ||||
|  - **abcotvs:clips** | ||||
|  - **AcademicEarth:Course** | ||||
|  - **acast** | ||||
|  - **acast:channel** | ||||
| @@ -34,14 +29,12 @@ | ||||
|  - **AdobeTVVideo** | ||||
|  - **AdultSwim** | ||||
|  - **aenetworks**: A+E Networks: A&E, Lifetime, History.com, FYI Network | ||||
|  - **afreecatv**: afreecatv.com | ||||
|  - **afreecatv:global**: afreecatv.com | ||||
|  - **AfreecaTV**: afreecatv.com | ||||
|  - **Aftonbladet** | ||||
|  - **AirMozilla** | ||||
|  - **AlJazeera** | ||||
|  - **Allocine** | ||||
|  - **AlphaPorno** | ||||
|  - **AMCNetworks** | ||||
|  - **anderetijden**: npo.nl and ntr.nl | ||||
|  - **AnimeOnDemand** | ||||
|  - **anitube.se** | ||||
|  - **AnySex** | ||||
| @@ -72,12 +65,6 @@ | ||||
|  - **audiomack** | ||||
|  - **audiomack:album** | ||||
|  - **auroravid**: AuroraVid | ||||
|  - **AWAAN** | ||||
|  - **awaan:live** | ||||
|  - **awaan:season** | ||||
|  - **awaan:video** | ||||
|  - **AZMedien**: AZ Medien videos | ||||
|  - **AZMedienPlaylist**: AZ Medien playlists | ||||
|  - **Azubu** | ||||
|  - **AzubuLive** | ||||
|  - **BaiduVideo**: 百度视频 | ||||
| @@ -85,17 +72,14 @@ | ||||
|  - **bambuser:channel** | ||||
|  - **Bandcamp** | ||||
|  - **Bandcamp:album** | ||||
|  - **bangumi.bilibili.com**: BiliBili番剧 | ||||
|  - **bbc**: BBC | ||||
|  - **bbc.co.uk**: BBC iPlayer | ||||
|  - **bbc.co.uk:article**: BBC articles | ||||
|  - **bbc.co.uk:iplayer:playlist** | ||||
|  - **bbc.co.uk:playlist** | ||||
|  - **Beam:live** | ||||
|  - **Beatport** | ||||
|  - **BeatportPro** | ||||
|  - **Beeg** | ||||
|  - **BehindKink** | ||||
|  - **BellMedia** | ||||
|  - **Bet** | ||||
|  - **Bigflix** | ||||
|  - **Bild**: Bild.de | ||||
| @@ -117,7 +101,6 @@ | ||||
|  - **bt:vestlendingen**: Bergens Tidende - Vestlendingen | ||||
|  - **BuzzFeed** | ||||
|  - **BYUtv** | ||||
|  - **BYUtvEvent** | ||||
|  - **Camdemy** | ||||
|  - **CamdemyFolder** | ||||
|  - **CamWithHer** | ||||
| @@ -126,23 +109,17 @@ | ||||
|  - **Canvas** | ||||
|  - **CarambaTV** | ||||
|  - **CarambaTVPage** | ||||
|  - **CartoonNetwork** | ||||
|  - **cbc.ca** | ||||
|  - **cbc.ca:player** | ||||
|  - **cbc.ca:watch** | ||||
|  - **cbc.ca:watch:video** | ||||
|  - **CBC** | ||||
|  - **CBCPlayer** | ||||
|  - **CBS** | ||||
|  - **CBSInteractive** | ||||
|  - **CBSLocal** | ||||
|  - **cbsnews**: CBS News | ||||
|  - **cbsnews:livevideo**: CBS News Live Videos | ||||
|  - **CBSNews**: CBS News | ||||
|  - **CBSNewsLiveVideo**: CBS News Live Videos | ||||
|  - **CBSSports** | ||||
|  - **CCMA** | ||||
|  - **CCTV**: 央视网 | ||||
|  - **CDA** | ||||
|  - **CeskaTelevize** | ||||
|  - **channel9**: Channel 9 | ||||
|  - **CharlieRose** | ||||
|  - **Chaturbate** | ||||
|  - **Chilloutzone** | ||||
|  - **chirbit** | ||||
| @@ -165,11 +142,8 @@ | ||||
|  - **CollegeRama** | ||||
|  - **ComCarCoff** | ||||
|  - **ComedyCentral** | ||||
|  - **ComedyCentralFullEpisodes** | ||||
|  - **ComedyCentralShortname** | ||||
|  - **ComedyCentralTV** | ||||
|  - **CondeNast**: Condé Nast media group: Allure, Architectural Digest, Ars Technica, Bon Appétit, Brides, Condé Nast, Condé Nast Traveler, Details, Epicurious, GQ, Glamour, Golf Digest, SELF, Teen Vogue, The New Yorker, Vanity Fair, Vogue, W Magazine, WIRED | ||||
|  - **Corus** | ||||
|  - **Coub** | ||||
|  - **Cracked** | ||||
|  - **Crackle** | ||||
| @@ -180,11 +154,10 @@ | ||||
|  - **CSNNE** | ||||
|  - **CSpan**: C-SPAN | ||||
|  - **CtsNews**: 華視新聞 | ||||
|  - **CTV** | ||||
|  - **CTVNews** | ||||
|  - **culturebox.francetvinfo.fr** | ||||
|  - **CultureUnplugged** | ||||
|  - **curiositystream** | ||||
|  - **curiositystream:collection** | ||||
|  - **CWTV** | ||||
|  - **DailyMail** | ||||
|  - **dailymotion** | ||||
| @@ -196,6 +169,10 @@ | ||||
|  - **daum.net:playlist** | ||||
|  - **daum.net:user** | ||||
|  - **DBTV** | ||||
|  - **DCN** | ||||
|  - **dcn:live** | ||||
|  - **dcn:season** | ||||
|  - **dcn:video** | ||||
|  - **DctpTv** | ||||
|  - **DeezerPlaylist** | ||||
|  - **defense.gouv.fr** | ||||
| @@ -204,8 +181,6 @@ | ||||
|  - **DigitallySpeaking** | ||||
|  - **Digiteka** | ||||
|  - **Discovery** | ||||
|  - **DiscoveryGo** | ||||
|  - **Disney** | ||||
|  - **Dotsub** | ||||
|  - **DouyuTV**: 斗鱼 | ||||
|  - **DPlay** | ||||
| @@ -214,8 +189,7 @@ | ||||
|  - **DRBonanza** | ||||
|  - **Dropbox** | ||||
|  - **DrTuber** | ||||
|  - **drtv** | ||||
|  - **drtv:live** | ||||
|  - **DRTV** | ||||
|  - **Dumpert** | ||||
|  - **dvtv**: http://video.aktualne.cz/ | ||||
|  - **dw** | ||||
| @@ -223,7 +197,6 @@ | ||||
|  - **EaglePlatform** | ||||
|  - **EbaumsWorld** | ||||
|  - **EchoMsk** | ||||
|  - **egghead:course**: egghead.io course | ||||
|  - **eHow** | ||||
|  - **Einthusan** | ||||
|  - **eitb.tv** | ||||
| @@ -237,22 +210,19 @@ | ||||
|  - **EroProfile** | ||||
|  - **Escapist** | ||||
|  - **ESPN** | ||||
|  - **ESPNArticle** | ||||
|  - **EsriVideo** | ||||
|  - **Europa** | ||||
|  - **EveryonesMixtape** | ||||
|  - **exfm**: ex.fm | ||||
|  - **ExpoTV** | ||||
|  - **ExtremeTube** | ||||
|  - **EyedoTV** | ||||
|  - **facebook** | ||||
|  - **FacebookPluginsVideo** | ||||
|  - **faz.net** | ||||
|  - **fc2** | ||||
|  - **fc2:embed** | ||||
|  - **Fczenit** | ||||
|  - **features.aol.com** | ||||
|  - **fernsehkritik.tv** | ||||
|  - **filmon** | ||||
|  - **filmon:channel** | ||||
|  - **Firstpost** | ||||
|  - **FiveTV** | ||||
|  - **Flickr** | ||||
| @@ -261,30 +231,28 @@ | ||||
|  - **FootyRoom** | ||||
|  - **Formula1** | ||||
|  - **FOX** | ||||
|  - **FOX9** | ||||
|  - **Foxgay** | ||||
|  - **foxnews**: Fox News and Fox Business Video | ||||
|  - **foxnews:article** | ||||
|  - **foxnews:insider** | ||||
|  - **FoxNews**: Fox News and Fox Business Video | ||||
|  - **FoxSports** | ||||
|  - **france2.fr:generation-quoi** | ||||
|  - **FranceCulture** | ||||
|  - **FranceCultureEmission** | ||||
|  - **FranceInter** | ||||
|  - **francetv**: France 2, 3, 4, 5 and Ô | ||||
|  - **francetvinfo.fr** | ||||
|  - **Freesound** | ||||
|  - **freespeech.org** | ||||
|  - **FreeVideo** | ||||
|  - **Funimation** | ||||
|  - **FunnyOrDie** | ||||
|  - **Fusion** | ||||
|  - **FXNetworks** | ||||
|  - **GameInformer** | ||||
|  - **Gamekings** | ||||
|  - **GameOne** | ||||
|  - **gameone:playlist** | ||||
|  - **Gamersyde** | ||||
|  - **GameSpot** | ||||
|  - **GameStar** | ||||
|  - **Gaskrank** | ||||
|  - **Gazeta** | ||||
|  - **GDCVault** | ||||
|  - **generic**: Generic downloader that works on some sites | ||||
| @@ -294,9 +262,9 @@ | ||||
|  - **Glide**: Glide mobile video messages (glide.me) | ||||
|  - **Globo** | ||||
|  - **GloboArticle** | ||||
|  - **Go** | ||||
|  - **GodTube** | ||||
|  - **GodTV** | ||||
|  - **GoldenMoustache** | ||||
|  - **Golem** | ||||
|  - **GoogleDrive** | ||||
|  - **Goshgay** | ||||
| @@ -304,18 +272,15 @@ | ||||
|  - **Groupon** | ||||
|  - **Hark** | ||||
|  - **HBO** | ||||
|  - **HBOEpisode** | ||||
|  - **HearThisAt** | ||||
|  - **Heise** | ||||
|  - **HellPorno** | ||||
|  - **Helsinki**: helsinki.fi | ||||
|  - **HentaiStigma** | ||||
|  - **hgtv.com:show** | ||||
|  - **HistoricFilms** | ||||
|  - **history:topic**: History.com Topic | ||||
|  - **hitbox** | ||||
|  - **hitbox:live** | ||||
|  - **HitRecord** | ||||
|  - **HornBunny** | ||||
|  - **HotNewHipHop** | ||||
|  - **HotStar** | ||||
| @@ -323,7 +288,6 @@ | ||||
|  - **HowStuffWorks** | ||||
|  - **HRTi** | ||||
|  - **HRTiPlaylist** | ||||
|  - **Huajiao**: 花椒直播 | ||||
|  - **HuffPost**: Huffington Post | ||||
|  - **Hypem** | ||||
|  - **Iconosquare** | ||||
| @@ -333,7 +297,6 @@ | ||||
|  - **Imgur** | ||||
|  - **ImgurAlbum** | ||||
|  - **Ina** | ||||
|  - **Inc** | ||||
|  - **Indavideo** | ||||
|  - **IndavideoEmbed** | ||||
|  - **InfoQ** | ||||
| @@ -343,14 +306,10 @@ | ||||
|  - **IPrima** | ||||
|  - **iqiyi**: 爱奇艺 | ||||
|  - **Ir90Tv** | ||||
|  - **ITV** | ||||
|  - **ivi**: ivi.ru | ||||
|  - **ivi:compilation**: ivi.ru compilations | ||||
|  - **ivideon**: Ivideon TV | ||||
|  - **Iwara** | ||||
|  - **Izlesene** | ||||
|  - **Jamendo** | ||||
|  - **JamendoAlbum** | ||||
|  - **JeuxVideo** | ||||
|  - **Jove** | ||||
|  - **jpopsuki.tv** | ||||
| @@ -363,7 +322,6 @@ | ||||
|  - **KarriereVideos** | ||||
|  - **keek** | ||||
|  - **KeezMovies** | ||||
|  - **Ketnet** | ||||
|  - **KhanAcademy** | ||||
|  - **KickStarter** | ||||
|  - **KonserthusetPlay** | ||||
| @@ -378,15 +336,12 @@ | ||||
|  - **kuwo:singer**: 酷我音乐 - 歌手 | ||||
|  - **kuwo:song**: 酷我音乐 | ||||
|  - **la7.it** | ||||
|  - **laola1tv** | ||||
|  - **laola1tv:embed** | ||||
|  - **LCI** | ||||
|  - **Laola1Tv** | ||||
|  - **Lcp** | ||||
|  - **LcpPlay** | ||||
|  - **Le**: 乐视网 | ||||
|  - **Learnr** | ||||
|  - **Lecture2Go** | ||||
|  - **LEGO** | ||||
|  - **Lemonde** | ||||
|  - **LePlaylist** | ||||
|  - **LetvCloud**: 乐视云 | ||||
| @@ -412,19 +367,14 @@ | ||||
|  - **mailru**: Видео@Mail.Ru | ||||
|  - **MakersChannel** | ||||
|  - **MakerTV** | ||||
|  - **mangomolo:live** | ||||
|  - **mangomolo:video** | ||||
|  - **MatchTV** | ||||
|  - **MDR**: MDR.DE and KiKA | ||||
|  - **media.ccc.de** | ||||
|  - **Meipai**: 美拍 | ||||
|  - **MelonVOD** | ||||
|  - **META** | ||||
|  - **metacafe** | ||||
|  - **Metacritic** | ||||
|  - **Mgoon** | ||||
|  - **MGTV**: 芒果TV | ||||
|  - **MiaoPai** | ||||
|  - **Minhateca** | ||||
|  - **MinistryGrid** | ||||
|  - **Minoto** | ||||
| @@ -446,14 +396,10 @@ | ||||
|  - **MovieClips** | ||||
|  - **MovieFap** | ||||
|  - **Moviezine** | ||||
|  - **MovingImage** | ||||
|  - **MPORA** | ||||
|  - **MSN** | ||||
|  - **mtg**: MTG services | ||||
|  - **mtv** | ||||
|  - **MTV** | ||||
|  - **mtv.de** | ||||
|  - **mtv81** | ||||
|  - **mtv:video** | ||||
|  - **mtvservices:embedded** | ||||
|  - **MuenchenTV**: münchen.tv | ||||
|  - **MusicPlayOn** | ||||
| @@ -469,13 +415,11 @@ | ||||
|  - **MyVidster** | ||||
|  - **n-tv.de** | ||||
|  - **natgeo** | ||||
|  - **natgeo:episodeguide** | ||||
|  - **natgeo:video** | ||||
|  - **natgeo:channel** | ||||
|  - **Naver** | ||||
|  - **NBA** | ||||
|  - **NBC** | ||||
|  - **NBCNews** | ||||
|  - **NBCOlympics** | ||||
|  - **NBCSports** | ||||
|  - **NBCSportsVPlayer** | ||||
|  - **ndr**: NDR.de - Norddeutscher Rundfunk | ||||
| @@ -495,23 +439,20 @@ | ||||
|  - **Newstube** | ||||
|  - **NextMedia**: 蘋果日報 | ||||
|  - **NextMediaActionNews**: 蘋果日報 - 動新聞 | ||||
|  - **NextTV**: 壹電視 | ||||
|  - **nfb**: National Film Board of Canada | ||||
|  - **nfl.com** | ||||
|  - **NhkVod** | ||||
|  - **nhl.com** | ||||
|  - **nhl.com:news**: NHL news | ||||
|  - **nhl.com:videocenter** | ||||
|  - **nhl.com:videocenter:category**: NHL videocenter category | ||||
|  - **nick.com** | ||||
|  - **nick.de** | ||||
|  - **nicknight** | ||||
|  - **niconico**: ニコニコ動画 | ||||
|  - **NiconicoPlaylist** | ||||
|  - **NineCNineMedia** | ||||
|  - **Nintendo** | ||||
|  - **njoy**: N-JOY | ||||
|  - **njoy:embed** | ||||
|  - **NobelPrize** | ||||
|  - **Noco** | ||||
|  - **Normalboots** | ||||
|  - **NosVideo** | ||||
| @@ -532,24 +473,17 @@ | ||||
|  - **NRKPlaylist** | ||||
|  - **NRKSkole**: NRK Skole | ||||
|  - **NRKTV**: NRK TV and NRK Radio | ||||
|  - **NRKTVDirekte**: NRK TV Direkte and NRK Radio Direkte | ||||
|  - **NRKTVEpisodes** | ||||
|  - **NRKTVSeries** | ||||
|  - **ntv.ru** | ||||
|  - **Nuvid** | ||||
|  - **NYTimes** | ||||
|  - **NYTimesArticle** | ||||
|  - **NZZ** | ||||
|  - **ocw.mit.edu** | ||||
|  - **OdaTV** | ||||
|  - **Odnoklassniki** | ||||
|  - **OktoberfestTV** | ||||
|  - **on.aol.com** | ||||
|  - **OnDemandKorea** | ||||
|  - **onet.pl** | ||||
|  - **onet.tv** | ||||
|  - **onet.tv:channel** | ||||
|  - **OnetMVP** | ||||
|  - **OnionStudios** | ||||
|  - **Ooyala** | ||||
|  - **OoyalaExternal** | ||||
| @@ -559,7 +493,6 @@ | ||||
|  - **orf:iptv**: iptv.ORF.at | ||||
|  - **orf:oe1**: Radio Österreich 1 | ||||
|  - **orf:tvthek**: ORF TVthek | ||||
|  - **PandaTV**: 熊猫TV | ||||
|  - **pandora.tv**: 판도라TV | ||||
|  - **parliamentlive.tv**: UK parliament videos | ||||
|  - **Patreon** | ||||
| @@ -571,10 +504,10 @@ | ||||
|  - **PhilharmonieDeParis**: Philharmonie de Paris | ||||
|  - **phoenix.de** | ||||
|  - **Photobucket** | ||||
|  - **Piksel** | ||||
|  - **Pinkbike** | ||||
|  - **Pladform** | ||||
|  - **play.fm** | ||||
|  - **played.to** | ||||
|  - **PlaysTV** | ||||
|  - **Playtvak**: Playtvak.cz, iDNES.cz and Lidovky.cz | ||||
|  - **Playvid** | ||||
| @@ -584,11 +517,7 @@ | ||||
|  - **plus.google**: Google Plus | ||||
|  - **pluzz.francetv.fr** | ||||
|  - **podomatic** | ||||
|  - **Pokemon** | ||||
|  - **PolskieRadio** | ||||
|  - **PolskieRadioCategory** | ||||
|  - **PornCom** | ||||
|  - **PornFlip** | ||||
|  - **PornHd** | ||||
|  - **PornHub**: PornHub and Thumbzilla | ||||
|  - **PornHubPlaylist** | ||||
| @@ -621,8 +550,6 @@ | ||||
|  - **RDS**: RDS.ca | ||||
|  - **RedTube** | ||||
|  - **RegioTV** | ||||
|  - **RENTV** | ||||
|  - **RENTVArticle** | ||||
|  - **Restudy** | ||||
|  - **Reuters** | ||||
|  - **ReverbNation** | ||||
| @@ -630,12 +557,10 @@ | ||||
|  - **revision3:embed** | ||||
|  - **RICE** | ||||
|  - **RingTV** | ||||
|  - **RMCDecouverte** | ||||
|  - **RockstarGames** | ||||
|  - **RoosterTeeth** | ||||
|  - **RottenTomatoes** | ||||
|  - **Roxwel** | ||||
|  - **Rozhlas** | ||||
|  - **RTBF** | ||||
|  - **rte**: Raidió Teilifís Éireann TV | ||||
|  - **rte:radio**: Raidió Teilifís Éireann radio | ||||
| @@ -670,16 +595,18 @@ | ||||
|  - **screen.yahoo:search**: Yahoo screen search | ||||
|  - **Screencast** | ||||
|  - **ScreencastOMatic** | ||||
|  - **scrippsnetworks:watch** | ||||
|  - **ScreenJunkies** | ||||
|  - **ScreenwaveMedia** | ||||
|  - **Seeker** | ||||
|  - **SenateISVP** | ||||
|  - **SendtoNews** | ||||
|  - **ServingSys** | ||||
|  - **Sexu** | ||||
|  - **Shahid** | ||||
|  - **Shared**: shared.sx | ||||
|  - **ShowRoomLive** | ||||
|  - **Shared**: shared.sx and vivo.sx | ||||
|  - **ShareSix** | ||||
|  - **Sina** | ||||
|  - **SixPlay** | ||||
|  - **skynewsarabia:article** | ||||
|  - **skynewsarabia:video** | ||||
|  - **SkySports** | ||||
| @@ -691,7 +618,6 @@ | ||||
|  - **smotri:user**: Smotri.com user videos | ||||
|  - **Snotr** | ||||
|  - **Sohu** | ||||
|  - **SonyLIV** | ||||
|  - **soundcloud** | ||||
|  - **soundcloud:playlist** | ||||
|  - **soundcloud:search**: Soundcloud search | ||||
| @@ -711,13 +637,14 @@ | ||||
|  - **Spiegeltv** | ||||
|  - **Spike** | ||||
|  - **Sport5** | ||||
|  - **SportBox** | ||||
|  - **SportBoxEmbed** | ||||
|  - **SportDeutschland** | ||||
|  - **Sportschau** | ||||
|  - **Sprout** | ||||
|  - **sr:mediathek**: Saarländischer Rundfunk | ||||
|  - **SRGSSR** | ||||
|  - **SRGSSRPlay**: srf.ch, rts.ch, rsi.ch, rtr.ch and swissinfo.ch play sites | ||||
|  - **SSA** | ||||
|  - **stanfordoc**: Stanford Open ClassRoom | ||||
|  - **Steam** | ||||
|  - **Stitcher** | ||||
| @@ -731,17 +658,16 @@ | ||||
|  - **SWRMediathek** | ||||
|  - **Syfy** | ||||
|  - **SztvHu** | ||||
|  - **t-online.de** | ||||
|  - **Tagesschau** | ||||
|  - **tagesschau:player** | ||||
|  - **Tapely** | ||||
|  - **Tass** | ||||
|  - **TBS** | ||||
|  - **TDSLifeway** | ||||
|  - **teachertube**: teachertube.com videos | ||||
|  - **teachertube:user:collection**: teachertube.com user and collection videos | ||||
|  - **TeachingChannel** | ||||
|  - **Teamcoco** | ||||
|  - **TeamFourStar** | ||||
|  - **TeamFour** | ||||
|  - **TechTalks** | ||||
|  - **techtv.mit.edu** | ||||
|  - **ted** | ||||
| @@ -750,22 +676,19 @@ | ||||
|  - **Telecinco**: telecinco.es, cuatro.com and mediaset.es | ||||
|  - **Telegraaf** | ||||
|  - **TeleMB** | ||||
|  - **TeleQuebec** | ||||
|  - **TeleTask** | ||||
|  - **Telewebion** | ||||
|  - **TF1** | ||||
|  - **TFO** | ||||
|  - **TheIntercept** | ||||
|  - **theoperaplatform** | ||||
|  - **ThePlatform** | ||||
|  - **ThePlatformFeed** | ||||
|  - **TheScene** | ||||
|  - **TheSixtyOne** | ||||
|  - **TheStar** | ||||
|  - **TheWeatherChannel** | ||||
|  - **ThisAmericanLife** | ||||
|  - **ThisAV** | ||||
|  - **ThisOldHouse** | ||||
|  - **THVideo** | ||||
|  - **THVideoPlaylist** | ||||
|  - **tinypic**: tinypic.com videos | ||||
|  - **tlc.de** | ||||
|  - **TMZ** | ||||
| @@ -779,7 +702,8 @@ | ||||
|  - **ToypicsUser**: Toypics user profile | ||||
|  - **TrailerAddict** (Currently broken) | ||||
|  - **Trilulilu** | ||||
|  - **TruTV** | ||||
|  - **trollvids** | ||||
|  - **TruTube** | ||||
|  - **Tube8** | ||||
|  - **TubiTv** | ||||
|  - **tudou** | ||||
| @@ -797,28 +721,20 @@ | ||||
|  - **TV2Article** | ||||
|  - **TV3** | ||||
|  - **TV4**: tv4.se and tv4play.se | ||||
|  - **TVA** | ||||
|  - **TVANouvelles** | ||||
|  - **TVANouvellesArticle** | ||||
|  - **TVC** | ||||
|  - **TVCArticle** | ||||
|  - **tvigle**: Интернет-телевидение Tvigle.ru | ||||
|  - **tvland.com** | ||||
|  - **TVNoe** | ||||
|  - **tvp**: Telewizja Polska | ||||
|  - **tvp:embed**: Telewizja Polska | ||||
|  - **tvp:series** | ||||
|  - **TVPlayer** | ||||
|  - **TVPlay**: TV3Play and related services | ||||
|  - **Tweakers** | ||||
|  - **twitch:chapter** | ||||
|  - **twitch:clips** | ||||
|  - **twitch:past_broadcasts** | ||||
|  - **twitch:profile** | ||||
|  - **twitch:stream** | ||||
|  - **twitch:video** | ||||
|  - **twitch:videos:all** | ||||
|  - **twitch:videos:highlights** | ||||
|  - **twitch:videos:past-broadcasts** | ||||
|  - **twitch:videos:uploads** | ||||
|  - **twitch:vod** | ||||
|  - **twitter** | ||||
|  - **twitter:amplify** | ||||
| @@ -826,14 +742,9 @@ | ||||
|  - **udemy** | ||||
|  - **udemy:course** | ||||
|  - **UDNEmbed**: 聯合影音 | ||||
|  - **UKTVPlay** | ||||
|  - **Unistra** | ||||
|  - **uol.com.br** | ||||
|  - **uplynk** | ||||
|  - **uplynk:preplay** | ||||
|  - **Urort**: NRK P3 Urørt | ||||
|  - **URPlay** | ||||
|  - **USANetwork** | ||||
|  - **USAToday** | ||||
|  - **ustream** | ||||
|  - **ustream:channel** | ||||
| @@ -849,13 +760,10 @@ | ||||
|  - **VevoPlaylist** | ||||
|  - **VGTV**: VGTV, BTTV, FTV, Aftenposten and Aftonbladet | ||||
|  - **vh1.com** | ||||
|  - **Viafree** | ||||
|  - **Vice** | ||||
|  - **Viceland** | ||||
|  - **ViceShow** | ||||
|  - **Vidbit** | ||||
|  - **Viddler** | ||||
|  - **Videa** | ||||
|  - **video.google:search**: Google Video search | ||||
|  - **video.mit.edu** | ||||
|  - **VideoDetective** | ||||
| @@ -865,7 +773,7 @@ | ||||
|  - **videomore:season** | ||||
|  - **videomore:video** | ||||
|  - **VideoPremium** | ||||
|  - **VideoPress** | ||||
|  - **VideoTt**: video.tt - Your True Tube (Currently broken) | ||||
|  - **videoweed**: VideoWeed | ||||
|  - **Vidio** | ||||
|  - **vidme** | ||||
| @@ -892,18 +800,11 @@ | ||||
|  - **Vimple**: Vimple - one-click video hosting | ||||
|  - **Vine** | ||||
|  - **vine:user** | ||||
|  - **Viu** | ||||
|  - **viu:ott** | ||||
|  - **viu:playlist** | ||||
|  - **Vivo**: vivo.sx | ||||
|  - **vk**: VK | ||||
|  - **vk:uservideos**: VK - User's Videos | ||||
|  - **vk:wallpost** | ||||
|  - **vlive** | ||||
|  - **vlive:channel** | ||||
|  - **Vodlocker** | ||||
|  - **VODPl** | ||||
|  - **VODPlatform** | ||||
|  - **VoiceRepublic** | ||||
|  - **VoxMedia** | ||||
|  - **Vporn** | ||||
| @@ -911,9 +812,6 @@ | ||||
|  - **VRT** | ||||
|  - **vube**: Vube.com | ||||
|  - **VuClip** | ||||
|  - **VVVVID** | ||||
|  - **VyboryMos** | ||||
|  - **Vzaar** | ||||
|  - **Walla** | ||||
|  - **washingtonpost** | ||||
|  - **washingtonpost:article** | ||||
| @@ -921,15 +819,13 @@ | ||||
|  - **WatchIndianPorn**: Watch Indian Porn | ||||
|  - **WDR** | ||||
|  - **wdr:mobile** | ||||
|  - **Webcaster** | ||||
|  - **WebcasterFeed** | ||||
|  - **WebOfStories** | ||||
|  - **WebOfStoriesPlaylist** | ||||
|  - **WeiqiTV**: WQTV | ||||
|  - **wholecloud**: WholeCloud | ||||
|  - **Wimp** | ||||
|  - **Wistia** | ||||
|  - **wnl**: npo.nl and ntr.nl | ||||
|  - **WNL** | ||||
|  - **WorldStarHipHop** | ||||
|  - **wrzuta.pl** | ||||
|  - **wrzuta.pl:playlist** | ||||
| @@ -983,4 +879,6 @@ | ||||
|  - **Zapiks** | ||||
|  - **ZDF** | ||||
|  - **ZDFChannel** | ||||
|  - **zingmp3**: mp3.zing.vn | ||||
|  - **zingmp3:album**: mp3.zing.vn albums | ||||
|  - **zingmp3:song**: mp3.zing.vn songs | ||||
|  - **ZippCast** | ||||
|   | ||||
							
								
								
									
										2
									
								
								setup.py
									
									
									
									
									
								
							
							
						
						
									
										2
									
								
								setup.py
									
									
									
									
									
								
							| @@ -1,5 +1,5 @@ | ||||
| #!/usr/bin/env python | ||||
| # coding: utf-8 | ||||
| # -*- coding: utf-8 -*- | ||||
|  | ||||
| from __future__ import print_function | ||||
|  | ||||
|   | ||||
| @@ -48,9 +48,6 @@ class TestInfoExtractor(unittest.TestCase): | ||||
|         self.assertEqual(ie._og_search_property('foobar', html), 'Foo') | ||||
|         self.assertEqual(ie._og_search_property('test1', html), 'foo > < bar') | ||||
|         self.assertEqual(ie._og_search_property('test2', html), 'foo >//< bar') | ||||
|         self.assertEqual(ie._og_search_property(('test0', 'test1'), html), 'foo > < bar') | ||||
|         self.assertRaises(RegexNotFoundError, ie._og_search_property, 'test0', html, None, fatal=True) | ||||
|         self.assertRaises(RegexNotFoundError, ie._og_search_property, ('test0', 'test00'), html, None, fatal=True) | ||||
|  | ||||
|     def test_html_search_meta(self): | ||||
|         ie = self.ie | ||||
| @@ -84,6 +81,5 @@ class TestInfoExtractor(unittest.TestCase): | ||||
|         self.assertRaises(ExtractorError, self.ie._download_json, uri, None) | ||||
|         self.assertEqual(self.ie._download_json(uri, None, fatal=False), None) | ||||
|  | ||||
|  | ||||
| if __name__ == '__main__': | ||||
|     unittest.main() | ||||
|   | ||||
| @@ -1,5 +1,4 @@ | ||||
| #!/usr/bin/env python | ||||
| # coding: utf-8 | ||||
|  | ||||
| from __future__ import unicode_literals | ||||
|  | ||||
| @@ -541,10 +540,10 @@ class TestYoutubeDL(unittest.TestCase): | ||||
|         self.assertEqual(ydl._format_note({}), '') | ||||
|         assertRegexpMatches(self, ydl._format_note({ | ||||
|             'vbr': 10, | ||||
|         }), r'^\s*10k$') | ||||
|         }), '^\s*10k$') | ||||
|         assertRegexpMatches(self, ydl._format_note({ | ||||
|             'fps': 30, | ||||
|         }), r'^30fps$') | ||||
|         }), '^30fps$') | ||||
|  | ||||
|     def test_postprocessors(self): | ||||
|         filename = 'post-processor-testfile.mp4' | ||||
| @@ -606,9 +605,6 @@ class TestYoutubeDL(unittest.TestCase): | ||||
|             'extractor': 'TEST', | ||||
|             'duration': 30, | ||||
|             'filesize': 10 * 1024, | ||||
|             'playlist_id': '42', | ||||
|             'uploader': "變態妍字幕版 太妍 тест", | ||||
|             'creator': "тест ' 123 ' тест--", | ||||
|         } | ||||
|         second = { | ||||
|             'id': '2', | ||||
| @@ -618,8 +614,6 @@ class TestYoutubeDL(unittest.TestCase): | ||||
|             'duration': 10, | ||||
|             'description': 'foo', | ||||
|             'filesize': 5 * 1024, | ||||
|             'playlist_id': '43', | ||||
|             'uploader': "тест 123", | ||||
|         } | ||||
|         videos = [first, second] | ||||
|  | ||||
| @@ -656,30 +650,6 @@ class TestYoutubeDL(unittest.TestCase): | ||||
|         res = get_videos(f) | ||||
|         self.assertEqual(res, ['1']) | ||||
|  | ||||
|         f = match_filter_func('playlist_id = 42') | ||||
|         res = get_videos(f) | ||||
|         self.assertEqual(res, ['1']) | ||||
|  | ||||
|         f = match_filter_func('uploader = "變態妍字幕版 太妍 тест"') | ||||
|         res = get_videos(f) | ||||
|         self.assertEqual(res, ['1']) | ||||
|  | ||||
|         f = match_filter_func('uploader != "變態妍字幕版 太妍 тест"') | ||||
|         res = get_videos(f) | ||||
|         self.assertEqual(res, ['2']) | ||||
|  | ||||
|         f = match_filter_func('creator = "тест \' 123 \' тест--"') | ||||
|         res = get_videos(f) | ||||
|         self.assertEqual(res, ['1']) | ||||
|  | ||||
|         f = match_filter_func("creator = 'тест \\' 123 \\' тест--'") | ||||
|         res = get_videos(f) | ||||
|         self.assertEqual(res, ['1']) | ||||
|  | ||||
|         f = match_filter_func(r"creator = 'тест \' 123 \' тест--' & duration > 30") | ||||
|         res = get_videos(f) | ||||
|         self.assertEqual(res, []) | ||||
|  | ||||
|     def test_playlist_items_selection(self): | ||||
|         entries = [{ | ||||
|             'id': compat_str(i), | ||||
|   | ||||
| @@ -51,6 +51,5 @@ class TestAES(unittest.TestCase): | ||||
|         decrypted = (aes_decrypt_text(encrypted, password, 32)) | ||||
|         self.assertEqual(decrypted, self.secret_msg) | ||||
|  | ||||
|  | ||||
| if __name__ == '__main__': | ||||
|     unittest.main() | ||||
|   | ||||
| @@ -60,7 +60,6 @@ def _file_md5(fn): | ||||
|     with open(fn, 'rb') as f: | ||||
|         return hashlib.md5(f.read()).hexdigest() | ||||
|  | ||||
|  | ||||
| defs = gettestcases() | ||||
|  | ||||
|  | ||||
| @@ -218,7 +217,6 @@ def generator(test_case): | ||||
|  | ||||
|     return test_template | ||||
|  | ||||
|  | ||||
| # And add them to TestDownload | ||||
| for n, test_case in enumerate(defs): | ||||
|     test_method = generator(test_case) | ||||
|   | ||||
| @@ -39,6 +39,5 @@ class TestExecution(unittest.TestCase): | ||||
|         _, stderr = p.communicate() | ||||
|         self.assertFalse(stderr) | ||||
|  | ||||
|  | ||||
| if __name__ == '__main__': | ||||
|     unittest.main() | ||||
|   | ||||
| @@ -87,7 +87,7 @@ class TestHTTP(unittest.TestCase): | ||||
|  | ||||
|         ydl = YoutubeDL({'logger': FakeLogger()}) | ||||
|         r = ydl.extract_info('http://localhost:%d/302' % self.port) | ||||
|         self.assertEqual(r['entries'][0]['url'], 'http://localhost:%d/vid.mp4' % self.port) | ||||
|         self.assertEqual(r['url'], 'http://localhost:%d/vid.mp4' % self.port) | ||||
|  | ||||
|  | ||||
| class TestHTTPS(unittest.TestCase): | ||||
| @@ -111,7 +111,7 @@ class TestHTTPS(unittest.TestCase): | ||||
|  | ||||
|         ydl = YoutubeDL({'logger': FakeLogger(), 'nocheckcertificate': True}) | ||||
|         r = ydl.extract_info('https://localhost:%d/video.html' % self.port) | ||||
|         self.assertEqual(r['entries'][0]['url'], 'https://localhost:%d/vid.mp4' % self.port) | ||||
|         self.assertEqual(r['url'], 'https://localhost:%d/vid.mp4' % self.port) | ||||
|  | ||||
|  | ||||
| def _build_proxy_handler(name): | ||||
| @@ -169,6 +169,5 @@ class TestProxy(unittest.TestCase): | ||||
|         # b'xn--fiq228c' is '中文'.encode('idna') | ||||
|         self.assertEqual(response, 'normal: http://xn--fiq228c.tw/') | ||||
|  | ||||
|  | ||||
| if __name__ == '__main__': | ||||
|     unittest.main() | ||||
|   | ||||
| @@ -43,6 +43,5 @@ class TestIqiyiSDKInterpreter(unittest.TestCase): | ||||
|         ie._login() | ||||
|         self.assertTrue('unable to log in:' in logger.messages[0]) | ||||
|  | ||||
|  | ||||
| if __name__ == '__main__': | ||||
|     unittest.main() | ||||
|   | ||||
| @@ -104,14 +104,6 @@ class TestJSInterpreter(unittest.TestCase): | ||||
|         }''') | ||||
|         self.assertEqual(jsi.call_function('x'), [20, 20, 30, 40, 50]) | ||||
|  | ||||
|     def test_call(self): | ||||
|         jsi = JSInterpreter(''' | ||||
|         function x() { return 2; } | ||||
|         function y(a) { return x() + a; } | ||||
|         function z() { return y(3); } | ||||
|         ''') | ||||
|         self.assertEqual(jsi.call_function('z'), 5) | ||||
|  | ||||
|  | ||||
| if __name__ == '__main__': | ||||
|     unittest.main() | ||||
|   | ||||
| @@ -34,20 +34,14 @@ from youtube_dl.utils import ( | ||||
|     find_xpath_attr, | ||||
|     fix_xml_ampersands, | ||||
|     get_element_by_class, | ||||
|     get_element_by_attribute, | ||||
|     get_elements_by_class, | ||||
|     get_elements_by_attribute, | ||||
|     InAdvancePagedList, | ||||
|     intlist_to_bytes, | ||||
|     is_html, | ||||
|     js_to_json, | ||||
|     limit_length, | ||||
|     mimetype2ext, | ||||
|     month_by_name, | ||||
|     ohdave_rsa_encrypt, | ||||
|     OnDemandPagedList, | ||||
|     orderedSet, | ||||
|     parse_age_limit, | ||||
|     parse_duration, | ||||
|     parse_filesize, | ||||
|     parse_count, | ||||
| @@ -72,8 +66,6 @@ from youtube_dl.utils import ( | ||||
|     uppercase_escape, | ||||
|     lowercase_escape, | ||||
|     url_basename, | ||||
|     base_url, | ||||
|     urljoin, | ||||
|     urlencode_postdata, | ||||
|     urshift, | ||||
|     update_url_query, | ||||
| @@ -297,10 +289,6 @@ class TestUtil(unittest.TestCase): | ||||
|         self.assertEqual(unified_strdate('25-09-2014'), '20140925') | ||||
|         self.assertEqual(unified_strdate('27.02.2016 17:30'), '20160227') | ||||
|         self.assertEqual(unified_strdate('UNKNOWN DATE FORMAT'), None) | ||||
|         self.assertEqual(unified_strdate('Feb 7, 2016 at 6:35 pm'), '20160207') | ||||
|         self.assertEqual(unified_strdate('July 15th, 2013'), '20130715') | ||||
|         self.assertEqual(unified_strdate('September 1st, 2013'), '20130901') | ||||
|         self.assertEqual(unified_strdate('Sep 2nd, 2013'), '20130902') | ||||
|  | ||||
|     def test_unified_timestamps(self): | ||||
|         self.assertEqual(unified_timestamp('December 21, 2010'), 1292889600) | ||||
| @@ -320,8 +308,6 @@ class TestUtil(unittest.TestCase): | ||||
|         self.assertEqual(unified_timestamp('25-09-2014'), 1411603200) | ||||
|         self.assertEqual(unified_timestamp('27.02.2016 17:30'), 1456594200) | ||||
|         self.assertEqual(unified_timestamp('UNKNOWN DATE FORMAT'), None) | ||||
|         self.assertEqual(unified_timestamp('May 16, 2016 11:15 PM'), 1463440500) | ||||
|         self.assertEqual(unified_timestamp('Feb 7, 2016 at 6:35 pm'), 1454870100) | ||||
|  | ||||
|     def test_determine_ext(self): | ||||
|         self.assertEqual(determine_ext('http://example.com/foo/bar.mp4/?download'), 'mp4') | ||||
| @@ -445,44 +431,6 @@ class TestUtil(unittest.TestCase): | ||||
|             url_basename('http://media.w3.org/2010/05/sintel/trailer.mp4'), | ||||
|             'trailer.mp4') | ||||
|  | ||||
|     def test_base_url(self): | ||||
|         self.assertEqual(base_url('http://foo.de/'), 'http://foo.de/') | ||||
|         self.assertEqual(base_url('http://foo.de/bar'), 'http://foo.de/') | ||||
|         self.assertEqual(base_url('http://foo.de/bar/'), 'http://foo.de/bar/') | ||||
|         self.assertEqual(base_url('http://foo.de/bar/baz'), 'http://foo.de/bar/') | ||||
|         self.assertEqual(base_url('http://foo.de/bar/baz?x=z/x/c'), 'http://foo.de/bar/') | ||||
|  | ||||
|     def test_urljoin(self): | ||||
|         self.assertEqual(urljoin('http://foo.de/', '/a/b/c.txt'), 'http://foo.de/a/b/c.txt') | ||||
|         self.assertEqual(urljoin('//foo.de/', '/a/b/c.txt'), '//foo.de/a/b/c.txt') | ||||
|         self.assertEqual(urljoin('http://foo.de/', 'a/b/c.txt'), 'http://foo.de/a/b/c.txt') | ||||
|         self.assertEqual(urljoin('http://foo.de', '/a/b/c.txt'), 'http://foo.de/a/b/c.txt') | ||||
|         self.assertEqual(urljoin('http://foo.de', 'a/b/c.txt'), 'http://foo.de/a/b/c.txt') | ||||
|         self.assertEqual(urljoin('http://foo.de/', 'http://foo.de/a/b/c.txt'), 'http://foo.de/a/b/c.txt') | ||||
|         self.assertEqual(urljoin('http://foo.de/', '//foo.de/a/b/c.txt'), '//foo.de/a/b/c.txt') | ||||
|         self.assertEqual(urljoin(None, 'http://foo.de/a/b/c.txt'), 'http://foo.de/a/b/c.txt') | ||||
|         self.assertEqual(urljoin(None, '//foo.de/a/b/c.txt'), '//foo.de/a/b/c.txt') | ||||
|         self.assertEqual(urljoin('', 'http://foo.de/a/b/c.txt'), 'http://foo.de/a/b/c.txt') | ||||
|         self.assertEqual(urljoin(['foobar'], 'http://foo.de/a/b/c.txt'), 'http://foo.de/a/b/c.txt') | ||||
|         self.assertEqual(urljoin('http://foo.de/', None), None) | ||||
|         self.assertEqual(urljoin('http://foo.de/', ''), None) | ||||
|         self.assertEqual(urljoin('http://foo.de/', ['foobar']), None) | ||||
|         self.assertEqual(urljoin('http://foo.de/a/b/c.txt', '.././../d.txt'), 'http://foo.de/d.txt') | ||||
|  | ||||
|     def test_parse_age_limit(self): | ||||
|         self.assertEqual(parse_age_limit(None), None) | ||||
|         self.assertEqual(parse_age_limit(False), None) | ||||
|         self.assertEqual(parse_age_limit('invalid'), None) | ||||
|         self.assertEqual(parse_age_limit(0), 0) | ||||
|         self.assertEqual(parse_age_limit(18), 18) | ||||
|         self.assertEqual(parse_age_limit(21), 21) | ||||
|         self.assertEqual(parse_age_limit(22), None) | ||||
|         self.assertEqual(parse_age_limit('18'), 18) | ||||
|         self.assertEqual(parse_age_limit('18+'), 18) | ||||
|         self.assertEqual(parse_age_limit('PG-13'), 13) | ||||
|         self.assertEqual(parse_age_limit('TV-14'), 14) | ||||
|         self.assertEqual(parse_age_limit('TV-MA'), 17) | ||||
|  | ||||
|     def test_parse_duration(self): | ||||
|         self.assertEqual(parse_duration(None), None) | ||||
|         self.assertEqual(parse_duration(False), None) | ||||
| @@ -513,7 +461,6 @@ class TestUtil(unittest.TestCase): | ||||
|         self.assertEqual(parse_duration('1 hour 3 minutes'), 3780) | ||||
|         self.assertEqual(parse_duration('87 Min.'), 5220) | ||||
|         self.assertEqual(parse_duration('PT1H0.040S'), 3600.04) | ||||
|         self.assertEqual(parse_duration('PT00H03M30SZ'), 210) | ||||
|  | ||||
|     def test_fix_xml_ampersands(self): | ||||
|         self.assertEqual( | ||||
| @@ -662,22 +609,6 @@ class TestUtil(unittest.TestCase): | ||||
|             limit_length('foo bar baz asd', 12).startswith('foo bar')) | ||||
|         self.assertTrue('...' in limit_length('foo bar baz asd', 12)) | ||||
|  | ||||
|     def test_mimetype2ext(self): | ||||
|         self.assertEqual(mimetype2ext(None), None) | ||||
|         self.assertEqual(mimetype2ext('video/x-flv'), 'flv') | ||||
|         self.assertEqual(mimetype2ext('application/x-mpegURL'), 'm3u8') | ||||
|         self.assertEqual(mimetype2ext('text/vtt'), 'vtt') | ||||
|         self.assertEqual(mimetype2ext('text/vtt;charset=utf-8'), 'vtt') | ||||
|         self.assertEqual(mimetype2ext('text/html; charset=utf-8'), 'html') | ||||
|  | ||||
|     def test_month_by_name(self): | ||||
|         self.assertEqual(month_by_name(None), None) | ||||
|         self.assertEqual(month_by_name('December', 'en'), 12) | ||||
|         self.assertEqual(month_by_name('décembre', 'fr'), 12) | ||||
|         self.assertEqual(month_by_name('December'), 12) | ||||
|         self.assertEqual(month_by_name('décembre'), None) | ||||
|         self.assertEqual(month_by_name('Unknown', 'unknown'), None) | ||||
|  | ||||
|     def test_parse_codecs(self): | ||||
|         self.assertEqual(parse_codecs(''), {}) | ||||
|         self.assertEqual(parse_codecs('avc1.77.30, mp4a.40.2'), { | ||||
| @@ -765,9 +696,6 @@ class TestUtil(unittest.TestCase): | ||||
|         inp = '''{"foo":101}''' | ||||
|         self.assertEqual(js_to_json(inp), '''{"foo":101}''') | ||||
|  | ||||
|         inp = '''{"duration": "00:01:07"}''' | ||||
|         self.assertEqual(js_to_json(inp), '''{"duration": "00:01:07"}''') | ||||
|  | ||||
|     def test_js_to_json_edgecases(self): | ||||
|         on = js_to_json("{abc_def:'1\\'\\\\2\\\\\\'3\"4'}") | ||||
|         self.assertEqual(json.loads(on), {"abc_def": "1'\\2\\'3\"4"}) | ||||
| @@ -788,27 +716,12 @@ class TestUtil(unittest.TestCase): | ||||
|         on = js_to_json('["abc", "def",]') | ||||
|         self.assertEqual(json.loads(on), ['abc', 'def']) | ||||
|  | ||||
|         on = js_to_json('[/*comment\n*/"abc"/*comment\n*/,/*comment\n*/"def",/*comment\n*/]') | ||||
|         self.assertEqual(json.loads(on), ['abc', 'def']) | ||||
|  | ||||
|         on = js_to_json('[//comment\n"abc" //comment\n,//comment\n"def",//comment\n]') | ||||
|         self.assertEqual(json.loads(on), ['abc', 'def']) | ||||
|  | ||||
|         on = js_to_json('{"abc": "def",}') | ||||
|         self.assertEqual(json.loads(on), {'abc': 'def'}) | ||||
|  | ||||
|         on = js_to_json('{/*comment\n*/"abc"/*comment\n*/:/*comment\n*/"def"/*comment\n*/,/*comment\n*/}') | ||||
|         self.assertEqual(json.loads(on), {'abc': 'def'}) | ||||
|  | ||||
|         on = js_to_json('{ 0: /* " \n */ ",]" , }') | ||||
|         self.assertEqual(json.loads(on), {'0': ',]'}) | ||||
|  | ||||
|         on = js_to_json('{ /*comment\n*/0/*comment\n*/: /* " \n */ ",]" , }') | ||||
|         self.assertEqual(json.loads(on), {'0': ',]'}) | ||||
|  | ||||
|         on = js_to_json('{ 0: // comment\n1 }') | ||||
|         self.assertEqual(json.loads(on), {'0': 1}) | ||||
|  | ||||
|         on = js_to_json(r'["<p>x<\/p>"]') | ||||
|         self.assertEqual(json.loads(on), ['<p>x</p>']) | ||||
|  | ||||
| @@ -818,27 +731,15 @@ class TestUtil(unittest.TestCase): | ||||
|         on = js_to_json("['a\\\nb']") | ||||
|         self.assertEqual(json.loads(on), ['ab']) | ||||
|  | ||||
|         on = js_to_json("/*comment\n*/[/*comment\n*/'a\\\nb'/*comment\n*/]/*comment\n*/") | ||||
|         self.assertEqual(json.loads(on), ['ab']) | ||||
|  | ||||
|         on = js_to_json('{0xff:0xff}') | ||||
|         self.assertEqual(json.loads(on), {'255': 255}) | ||||
|  | ||||
|         on = js_to_json('{/*comment\n*/0xff/*comment\n*/:/*comment\n*/0xff/*comment\n*/}') | ||||
|         self.assertEqual(json.loads(on), {'255': 255}) | ||||
|  | ||||
|         on = js_to_json('{077:077}') | ||||
|         self.assertEqual(json.loads(on), {'63': 63}) | ||||
|  | ||||
|         on = js_to_json('{/*comment\n*/077/*comment\n*/:/*comment\n*/077/*comment\n*/}') | ||||
|         self.assertEqual(json.loads(on), {'63': 63}) | ||||
|  | ||||
|         on = js_to_json('{42:42}') | ||||
|         self.assertEqual(json.loads(on), {'42': 42}) | ||||
|  | ||||
|         on = js_to_json('{/*comment\n*/42/*comment\n*/:/*comment\n*/42/*comment\n*/}') | ||||
|         self.assertEqual(json.loads(on), {'42': 42}) | ||||
|  | ||||
|     def test_extract_attributes(self): | ||||
|         self.assertEqual(extract_attributes('<e x="y">'), {'x': 'y'}) | ||||
|         self.assertEqual(extract_attributes("<e x='y'>"), {'x': 'y'}) | ||||
| @@ -900,10 +801,7 @@ class TestUtil(unittest.TestCase): | ||||
|         self.assertEqual(parse_filesize('2 MiB'), 2097152) | ||||
|         self.assertEqual(parse_filesize('5 GB'), 5000000000) | ||||
|         self.assertEqual(parse_filesize('1.2Tb'), 1200000000000) | ||||
|         self.assertEqual(parse_filesize('1.2tb'), 1200000000000) | ||||
|         self.assertEqual(parse_filesize('1,24 KB'), 1240) | ||||
|         self.assertEqual(parse_filesize('1,24 kb'), 1240) | ||||
|         self.assertEqual(parse_filesize('8.5 megabytes'), 8500000) | ||||
|  | ||||
|     def test_parse_count(self): | ||||
|         self.assertEqual(parse_count(None), None) | ||||
| @@ -1054,7 +952,6 @@ The first line | ||||
|         self.assertEqual(cli_option({'proxy': '127.0.0.1:3128'}, '--proxy', 'proxy'), ['--proxy', '127.0.0.1:3128']) | ||||
|         self.assertEqual(cli_option({'proxy': None}, '--proxy', 'proxy'), []) | ||||
|         self.assertEqual(cli_option({}, '--proxy', 'proxy'), []) | ||||
|         self.assertEqual(cli_option({'retries': 10}, '--retries', 'retries'), ['--retries', '10']) | ||||
|  | ||||
|     def test_cli_valueless_option(self): | ||||
|         self.assertEqual(cli_valueless_option( | ||||
| @@ -1127,32 +1024,5 @@ The first line | ||||
|         self.assertEqual(get_element_by_class('foo', html), 'nice') | ||||
|         self.assertEqual(get_element_by_class('no-such-class', html), None) | ||||
|  | ||||
|     def test_get_element_by_attribute(self): | ||||
|         html = ''' | ||||
|             <span class="foo bar">nice</span> | ||||
|         ''' | ||||
|  | ||||
|         self.assertEqual(get_element_by_attribute('class', 'foo bar', html), 'nice') | ||||
|         self.assertEqual(get_element_by_attribute('class', 'foo', html), None) | ||||
|         self.assertEqual(get_element_by_attribute('class', 'no-such-foo', html), None) | ||||
|  | ||||
|     def test_get_elements_by_class(self): | ||||
|         html = ''' | ||||
|             <span class="foo bar">nice</span><span class="foo bar">also nice</span> | ||||
|         ''' | ||||
|  | ||||
|         self.assertEqual(get_elements_by_class('foo', html), ['nice', 'also nice']) | ||||
|         self.assertEqual(get_elements_by_class('no-such-class', html), []) | ||||
|  | ||||
|     def test_get_elements_by_attribute(self): | ||||
|         html = ''' | ||||
|             <span class="foo bar">nice</span><span class="foo bar">also nice</span> | ||||
|         ''' | ||||
|  | ||||
|         self.assertEqual(get_elements_by_attribute('class', 'foo bar', html), ['nice', 'also nice']) | ||||
|         self.assertEqual(get_elements_by_attribute('class', 'foo', html), []) | ||||
|         self.assertEqual(get_elements_by_attribute('class', 'no-such-foo', html), []) | ||||
|  | ||||
|  | ||||
| if __name__ == '__main__': | ||||
|     unittest.main() | ||||
|   | ||||
| @@ -1,71 +0,0 @@ | ||||
| #!/usr/bin/env python | ||||
| # coding: utf-8 | ||||
|  | ||||
| from __future__ import unicode_literals | ||||
|  | ||||
| import unittest | ||||
|  | ||||
| import sys | ||||
| import os | ||||
| import subprocess | ||||
| sys.path.insert(0, os.path.dirname(os.path.dirname(os.path.abspath(__file__)))) | ||||
|  | ||||
| rootDir = os.path.dirname(os.path.dirname(os.path.abspath(__file__))) | ||||
|  | ||||
|  | ||||
| class TestVerboseOutput(unittest.TestCase): | ||||
|     def test_private_info_arg(self): | ||||
|         outp = subprocess.Popen( | ||||
|             [ | ||||
|                 sys.executable, 'youtube_dl/__main__.py', '-v', | ||||
|                 '--username', 'johnsmith@gmail.com', | ||||
|                 '--password', 'secret', | ||||
|             ], cwd=rootDir, stdout=subprocess.PIPE, stderr=subprocess.PIPE) | ||||
|         sout, serr = outp.communicate() | ||||
|         self.assertTrue(b'--username' in serr) | ||||
|         self.assertTrue(b'johnsmith' not in serr) | ||||
|         self.assertTrue(b'--password' in serr) | ||||
|         self.assertTrue(b'secret' not in serr) | ||||
|  | ||||
|     def test_private_info_shortarg(self): | ||||
|         outp = subprocess.Popen( | ||||
|             [ | ||||
|                 sys.executable, 'youtube_dl/__main__.py', '-v', | ||||
|                 '-u', 'johnsmith@gmail.com', | ||||
|                 '-p', 'secret', | ||||
|             ], cwd=rootDir, stdout=subprocess.PIPE, stderr=subprocess.PIPE) | ||||
|         sout, serr = outp.communicate() | ||||
|         self.assertTrue(b'-u' in serr) | ||||
|         self.assertTrue(b'johnsmith' not in serr) | ||||
|         self.assertTrue(b'-p' in serr) | ||||
|         self.assertTrue(b'secret' not in serr) | ||||
|  | ||||
|     def test_private_info_eq(self): | ||||
|         outp = subprocess.Popen( | ||||
|             [ | ||||
|                 sys.executable, 'youtube_dl/__main__.py', '-v', | ||||
|                 '--username=johnsmith@gmail.com', | ||||
|                 '--password=secret', | ||||
|             ], cwd=rootDir, stdout=subprocess.PIPE, stderr=subprocess.PIPE) | ||||
|         sout, serr = outp.communicate() | ||||
|         self.assertTrue(b'--username' in serr) | ||||
|         self.assertTrue(b'johnsmith' not in serr) | ||||
|         self.assertTrue(b'--password' in serr) | ||||
|         self.assertTrue(b'secret' not in serr) | ||||
|  | ||||
|     def test_private_info_shortarg_eq(self): | ||||
|         outp = subprocess.Popen( | ||||
|             [ | ||||
|                 sys.executable, 'youtube_dl/__main__.py', '-v', | ||||
|                 '-u=johnsmith@gmail.com', | ||||
|                 '-p=secret', | ||||
|             ], cwd=rootDir, stdout=subprocess.PIPE, stderr=subprocess.PIPE) | ||||
|         sout, serr = outp.communicate() | ||||
|         self.assertTrue(b'-u' in serr) | ||||
|         self.assertTrue(b'johnsmith' not in serr) | ||||
|         self.assertTrue(b'-p' in serr) | ||||
|         self.assertTrue(b'secret' not in serr) | ||||
|  | ||||
|  | ||||
| if __name__ == '__main__': | ||||
|     unittest.main() | ||||
| @@ -24,7 +24,6 @@ class YoutubeDL(youtube_dl.YoutubeDL): | ||||
|         super(YoutubeDL, self).__init__(*args, **kwargs) | ||||
|         self.to_stderr = self.to_screen | ||||
|  | ||||
|  | ||||
| params = get_params({ | ||||
|     'writeannotations': True, | ||||
|     'skip_download': True, | ||||
| @@ -75,6 +74,5 @@ class TestAnnotations(unittest.TestCase): | ||||
|     def tearDown(self): | ||||
|         try_rm(ANNOTATIONS_FILE) | ||||
|  | ||||
|  | ||||
| if __name__ == '__main__': | ||||
|     unittest.main() | ||||
|   | ||||
| @@ -66,6 +66,5 @@ class TestYoutubeLists(unittest.TestCase): | ||||
|         for entry in result['entries']: | ||||
|             self.assertTrue(entry.get('title')) | ||||
|  | ||||
|  | ||||
| if __name__ == '__main__': | ||||
|     unittest.main() | ||||
|   | ||||
| @@ -114,7 +114,6 @@ def make_tfunc(url, stype, sig_input, expected_sig): | ||||
|     test_func.__name__ = str('test_signature_' + stype + '_' + test_id) | ||||
|     setattr(TestSignature, test_func.__name__, test_func) | ||||
|  | ||||
|  | ||||
| for test_spec in _TESTS: | ||||
|     make_tfunc(*test_spec) | ||||
|  | ||||
|   | ||||
| @@ -1,5 +1,5 @@ | ||||
| #!/usr/bin/env python | ||||
| # coding: utf-8 | ||||
| # -*- coding: utf-8 -*- | ||||
|  | ||||
| from __future__ import absolute_import, unicode_literals | ||||
|  | ||||
| @@ -24,7 +24,6 @@ import sys | ||||
| import time | ||||
| import tokenize | ||||
| import traceback | ||||
| import random | ||||
|  | ||||
| from .compat import ( | ||||
|     compat_basestring, | ||||
| @@ -132,9 +131,6 @@ class YoutubeDL(object): | ||||
|     username:          Username for authentication purposes. | ||||
|     password:          Password for authentication purposes. | ||||
|     videopassword:     Password for accessing a video. | ||||
|     ap_mso:            Adobe Pass multiple-system operator identifier. | ||||
|     ap_username:       Multiple-system operator account username. | ||||
|     ap_password:       Multiple-system operator account password. | ||||
|     usenetrc:          Use netrc for authentication instead. | ||||
|     verbose:           Print additional info to stdout. | ||||
|     quiet:             Do not print messages to stdout. | ||||
| @@ -160,7 +156,6 @@ class YoutubeDL(object): | ||||
|     playlistend:       Playlist item to end at. | ||||
|     playlist_items:    Specific indices of playlist to download. | ||||
|     playlistreverse:   Download playlist items in reverse order. | ||||
|     playlistrandom:    Download playlist items in random order. | ||||
|     matchtitle:        Download only matching titles. | ||||
|     rejecttitle:       Reject downloads for matching titles. | ||||
|     logger:            Log messages to a logging.Logger instance. | ||||
| @@ -254,16 +249,7 @@ class YoutubeDL(object): | ||||
|     source_address:    (Experimental) Client-side IP address to bind to. | ||||
|     call_home:         Boolean, true iff we are allowed to contact the | ||||
|                        youtube-dl servers for debugging. | ||||
|     sleep_interval:    Number of seconds to sleep before each download when | ||||
|                        used alone or a lower bound of a range for randomized | ||||
|                        sleep before each download (minimum possible number | ||||
|                        of seconds to sleep) when used along with | ||||
|                        max_sleep_interval. | ||||
|     max_sleep_interval:Upper bound of a range for randomized sleep before each | ||||
|                        download (maximum possible number of seconds to sleep). | ||||
|                        Must only be used along with sleep_interval. | ||||
|                        Actual sleep time will be a random float from range | ||||
|                        [sleep_interval; max_sleep_interval]. | ||||
|     sleep_interval:    Number of seconds to sleep before each download. | ||||
|     listformats:       Print an overview of available video formats and exit. | ||||
|     list_thumbnails:   Print a table of all thumbnails and exit. | ||||
|     match_filter:      A function that gets called with the info_dict of | ||||
| @@ -586,7 +572,7 @@ class YoutubeDL(object): | ||||
|             if autonumber_size is None: | ||||
|                 autonumber_size = 5 | ||||
|             autonumber_templ = '%0' + str(autonumber_size) + 'd' | ||||
|             template_dict['autonumber'] = autonumber_templ % (self.params.get('autonumber_start', 1) - 1 + self._num_downloads) | ||||
|             template_dict['autonumber'] = autonumber_templ % self._num_downloads | ||||
|             if template_dict.get('playlist_index') is not None: | ||||
|                 template_dict['playlist_index'] = '%0*d' % (len(str(template_dict['n_entries'])), template_dict['playlist_index']) | ||||
|             if template_dict.get('resolution') is None: | ||||
| @@ -844,9 +830,6 @@ class YoutubeDL(object): | ||||
|             if self.params.get('playlistreverse', False): | ||||
|                 entries = entries[::-1] | ||||
|  | ||||
|             if self.params.get('playlistrandom', False): | ||||
|                 random.shuffle(entries) | ||||
|  | ||||
|             for i, entry in enumerate(entries, 1): | ||||
|                 self.to_screen('[download] Downloading video %s of %s' % (i, n_entries)) | ||||
|                 extra = { | ||||
| @@ -1264,10 +1247,8 @@ class YoutubeDL(object): | ||||
|                 info_dict['thumbnails'] = thumbnails = [{'url': thumbnail}] | ||||
|         if thumbnails: | ||||
|             thumbnails.sort(key=lambda t: ( | ||||
|                 t.get('preference') if t.get('preference') is not None else -1, | ||||
|                 t.get('width') if t.get('width') is not None else -1, | ||||
|                 t.get('height') if t.get('height') is not None else -1, | ||||
|                 t.get('id') if t.get('id') is not None else '', t.get('url'))) | ||||
|                 t.get('preference'), t.get('width'), t.get('height'), | ||||
|                 t.get('id'), t.get('url'))) | ||||
|             for i, t in enumerate(thumbnails): | ||||
|                 t['url'] = sanitize_url(t['url']) | ||||
|                 if t.get('width') and t.get('height'): | ||||
| @@ -1309,7 +1290,7 @@ class YoutubeDL(object): | ||||
|                 for subtitle_format in subtitle: | ||||
|                     if subtitle_format.get('url'): | ||||
|                         subtitle_format['url'] = sanitize_url(subtitle_format['url']) | ||||
|                     if subtitle_format.get('ext') is None: | ||||
|                     if 'ext' not in subtitle_format: | ||||
|                         subtitle_format['ext'] = determine_ext(subtitle_format['url']).lower() | ||||
|  | ||||
|         if self.params.get('listsubtitles', False): | ||||
| @@ -1344,7 +1325,7 @@ class YoutubeDL(object): | ||||
|                 format['format_id'] = compat_str(i) | ||||
|             else: | ||||
|                 # Sanitize format_id from characters used in format selector expression | ||||
|                 format['format_id'] = re.sub(r'[\s,/+\[\]()]', '_', format['format_id']) | ||||
|                 format['format_id'] = re.sub('[\s,/+\[\]()]', '_', format['format_id']) | ||||
|             format_id = format['format_id'] | ||||
|             if format_id not in formats_dict: | ||||
|                 formats_dict[format_id] = [] | ||||
| @@ -1364,11 +1345,11 @@ class YoutubeDL(object): | ||||
|                     note=' ({0})'.format(format['format_note']) if format.get('format_note') is not None else '', | ||||
|                 ) | ||||
|             # Automatically determine file extension if missing | ||||
|             if format.get('ext') is None: | ||||
|             if 'ext' not in format: | ||||
|                 format['ext'] = determine_ext(format['url']).lower() | ||||
|             # Automatically determine protocol if missing (useful for format | ||||
|             # selection purposes) | ||||
|             if format.get('protocol') is None: | ||||
|             if 'protocol' not in format: | ||||
|                 format['protocol'] = determine_protocol(format) | ||||
|             # Add HTTP headers, so that external programs can use them from the | ||||
|             # json output | ||||
| @@ -1613,9 +1594,7 @@ class YoutubeDL(object): | ||||
|                         self.to_screen('[info] Video subtitle %s.%s is already_present' % (sub_lang, sub_format)) | ||||
|                     else: | ||||
|                         self.to_screen('[info] Writing video subtitles to: ' + sub_filename) | ||||
|                         # Use newline='' to prevent conversion of newline characters | ||||
|                         # See https://github.com/rg3/youtube-dl/issues/10268 | ||||
|                         with io.open(encodeFilename(sub_filename), 'w', encoding='utf-8', newline='') as subfile: | ||||
|                         with io.open(encodeFilename(sub_filename), 'w', encoding='utf-8') as subfile: | ||||
|                             subfile.write(sub_data) | ||||
|                 except (OSError, IOError): | ||||
|                     self.report_error('Cannot write subtitles file ' + sub_filename) | ||||
| @@ -1663,7 +1642,7 @@ class YoutubeDL(object): | ||||
|                         video_ext, audio_ext = audio.get('ext'), video.get('ext') | ||||
|                         if video_ext and audio_ext: | ||||
|                             COMPATIBLE_EXTS = ( | ||||
|                                 ('mp3', 'mp4', 'm4a', 'm4p', 'm4b', 'm4r', 'm4v', 'ismv', 'isma'), | ||||
|                                 ('mp3', 'mp4', 'm4a', 'm4p', 'm4b', 'm4r', 'm4v'), | ||||
|                                 ('webm') | ||||
|                             ) | ||||
|                             for exts in COMPATIBLE_EXTS: | ||||
|   | ||||
| @@ -1,5 +1,5 @@ | ||||
| #!/usr/bin/env python | ||||
| # coding: utf-8 | ||||
| # -*- coding: utf-8 -*- | ||||
|  | ||||
| from __future__ import unicode_literals | ||||
|  | ||||
| @@ -34,14 +34,12 @@ from .utils import ( | ||||
|     setproctitle, | ||||
|     std_headers, | ||||
|     write_string, | ||||
|     render_table, | ||||
| ) | ||||
| from .update import update_self | ||||
| from .downloader import ( | ||||
|     FileDownloader, | ||||
| ) | ||||
| from .extractor import gen_extractors, list_extractors | ||||
| from .extractor.adobepass import MSO_INFO | ||||
| from .YoutubeDL import YoutubeDL | ||||
|  | ||||
|  | ||||
| @@ -95,7 +93,8 @@ def _real_main(argv=None): | ||||
|                 write_string('[debug] Batch file urls: ' + repr(batch_urls) + '\n') | ||||
|         except IOError: | ||||
|             sys.exit('ERROR: batch file could not be read') | ||||
|     all_urls = batch_urls + [url.strip() for url in args]  # batch_urls are already striped in read_batch_urls | ||||
|     all_urls = batch_urls + args | ||||
|     all_urls = [url.strip() for url in all_urls] | ||||
|     _enc = preferredencoding() | ||||
|     all_urls = [url.decode(_enc, 'ignore') if isinstance(url, bytes) else url for url in all_urls] | ||||
|  | ||||
| @@ -119,32 +118,18 @@ def _real_main(argv=None): | ||||
|                 desc += ' (Example: "%s%s:%s" )' % (ie.SEARCH_KEY, random.choice(_COUNTS), random.choice(_SEARCHES)) | ||||
|             write_string(desc + '\n', out=sys.stdout) | ||||
|         sys.exit(0) | ||||
|     if opts.ap_list_mso: | ||||
|         table = [[mso_id, mso_info['name']] for mso_id, mso_info in MSO_INFO.items()] | ||||
|         write_string('Supported TV Providers:\n' + render_table(['mso', 'mso name'], table) + '\n', out=sys.stdout) | ||||
|         sys.exit(0) | ||||
|  | ||||
|     # Conflicting, missing and erroneous options | ||||
|     if opts.usenetrc and (opts.username is not None or opts.password is not None): | ||||
|         parser.error('using .netrc conflicts with giving username/password') | ||||
|     if opts.password is not None and opts.username is None: | ||||
|         parser.error('account username missing\n') | ||||
|     if opts.ap_password is not None and opts.ap_username is None: | ||||
|         parser.error('TV Provider account username missing\n') | ||||
|     if opts.outtmpl is not None and (opts.usetitle or opts.autonumber or opts.useid): | ||||
|         parser.error('using output template conflicts with using title, video ID or auto number') | ||||
|     if opts.autonumber_size is not None: | ||||
|         if opts.autonumber_size <= 0: | ||||
|             parser.error('auto number size must be positive') | ||||
|     if opts.autonumber_start is not None: | ||||
|         if opts.autonumber_start < 0: | ||||
|             parser.error('auto number start must be positive or 0') | ||||
|     if opts.usetitle and opts.useid: | ||||
|         parser.error('using title conflicts with using video ID') | ||||
|     if opts.username is not None and opts.password is None: | ||||
|         opts.password = compat_getpass('Type account password and press [Return]: ') | ||||
|     if opts.ap_username is not None and opts.ap_password is None: | ||||
|         opts.ap_password = compat_getpass('Type TV provider account password and press [Return]: ') | ||||
|     if opts.ratelimit is not None: | ||||
|         numeric_limit = FileDownloader.parse_bytes(opts.ratelimit) | ||||
|         if numeric_limit is None: | ||||
| @@ -160,18 +145,6 @@ def _real_main(argv=None): | ||||
|         if numeric_limit is None: | ||||
|             parser.error('invalid max_filesize specified') | ||||
|         opts.max_filesize = numeric_limit | ||||
|     if opts.sleep_interval is not None: | ||||
|         if opts.sleep_interval < 0: | ||||
|             parser.error('sleep interval must be positive or 0') | ||||
|     if opts.max_sleep_interval is not None: | ||||
|         if opts.max_sleep_interval < 0: | ||||
|             parser.error('max sleep interval must be positive or 0') | ||||
|         if opts.max_sleep_interval < opts.sleep_interval: | ||||
|             parser.error('max sleep interval must be greater than or equal to min sleep interval') | ||||
|     else: | ||||
|         opts.max_sleep_interval = opts.sleep_interval | ||||
|     if opts.ap_mso and opts.ap_mso not in MSO_INFO: | ||||
|         parser.error('Unsupported TV Provider, use --ap-list-mso to get a list of supported TV Providers') | ||||
|  | ||||
|     def parse_retries(retries): | ||||
|         if retries in ('inf', 'infinite'): | ||||
| @@ -271,6 +244,8 @@ def _real_main(argv=None): | ||||
|         postprocessors.append({ | ||||
|             'key': 'FFmpegEmbedSubtitle', | ||||
|         }) | ||||
|     if opts.xattrs: | ||||
|         postprocessors.append({'key': 'XAttrMetadata'}) | ||||
|     if opts.embedthumbnail: | ||||
|         already_have_thumbnail = opts.writethumbnail or opts.write_all_thumbnails | ||||
|         postprocessors.append({ | ||||
| @@ -279,10 +254,6 @@ def _real_main(argv=None): | ||||
|         }) | ||||
|         if not already_have_thumbnail: | ||||
|             opts.writethumbnail = True | ||||
|     # XAttrMetadataPP should be run after post-processors that may change file | ||||
|     # contents | ||||
|     if opts.xattrs: | ||||
|         postprocessors.append({'key': 'XAttrMetadata'}) | ||||
|     # Please keep ExecAfterDownload towards the bottom as it allows the user to modify the final file in any way. | ||||
|     # So if the user is able to remove the file before your postprocessor runs it might cause a few problems. | ||||
|     if opts.exec_cmd: | ||||
| @@ -290,6 +261,12 @@ def _real_main(argv=None): | ||||
|             'key': 'ExecAfterDownload', | ||||
|             'exec_cmd': opts.exec_cmd, | ||||
|         }) | ||||
|     if opts.xattr_set_filesize: | ||||
|         try: | ||||
|             import xattr | ||||
|             xattr  # Confuse flake8 | ||||
|         except ImportError: | ||||
|             parser.error('setting filesize xattr requested but python-xattr is not available') | ||||
|     external_downloader_args = None | ||||
|     if opts.external_downloader_args: | ||||
|         external_downloader_args = compat_shlex_split(opts.external_downloader_args) | ||||
| @@ -306,9 +283,6 @@ def _real_main(argv=None): | ||||
|         'password': opts.password, | ||||
|         'twofactor': opts.twofactor, | ||||
|         'videopassword': opts.videopassword, | ||||
|         'ap_mso': opts.ap_mso, | ||||
|         'ap_username': opts.ap_username, | ||||
|         'ap_password': opts.ap_password, | ||||
|         'quiet': (opts.quiet or any_getting or any_printing), | ||||
|         'no_warnings': opts.no_warnings, | ||||
|         'forceurl': opts.geturl, | ||||
| @@ -327,7 +301,6 @@ def _real_main(argv=None): | ||||
|         'listformats': opts.listformats, | ||||
|         'outtmpl': outtmpl, | ||||
|         'autonumber_size': opts.autonumber_size, | ||||
|         'autonumber_start': opts.autonumber_start, | ||||
|         'restrictfilenames': opts.restrictfilenames, | ||||
|         'ignoreerrors': opts.ignoreerrors, | ||||
|         'force_generic_extractor': opts.force_generic_extractor, | ||||
| @@ -335,7 +308,6 @@ def _real_main(argv=None): | ||||
|         'nooverwrites': opts.nooverwrites, | ||||
|         'retries': opts.retries, | ||||
|         'fragment_retries': opts.fragment_retries, | ||||
|         'skip_unavailable_fragments': opts.skip_unavailable_fragments, | ||||
|         'buffersize': opts.buffersize, | ||||
|         'noresizebuffer': opts.noresizebuffer, | ||||
|         'continuedl': opts.continue_dl, | ||||
| @@ -344,7 +316,6 @@ def _real_main(argv=None): | ||||
|         'playliststart': opts.playliststart, | ||||
|         'playlistend': opts.playlistend, | ||||
|         'playlistreverse': opts.playlist_reverse, | ||||
|         'playlistrandom': opts.playlist_random, | ||||
|         'noplaylist': opts.noplaylist, | ||||
|         'logtostderr': opts.outtmpl == '-', | ||||
|         'consoletitle': opts.consoletitle, | ||||
| @@ -399,7 +370,6 @@ def _real_main(argv=None): | ||||
|         'source_address': opts.source_address, | ||||
|         'call_home': opts.call_home, | ||||
|         'sleep_interval': opts.sleep_interval, | ||||
|         'max_sleep_interval': opts.max_sleep_interval, | ||||
|         'external_downloader': opts.external_downloader, | ||||
|         'list_thumbnails': opts.list_thumbnails, | ||||
|         'playlist_items': opts.playlist_items, | ||||
| @@ -413,7 +383,7 @@ def _real_main(argv=None): | ||||
|         'postprocessor_args': postprocessor_args, | ||||
|         'cn_verification_proxy': opts.cn_verification_proxy, | ||||
|         'geo_verification_proxy': opts.geo_verification_proxy, | ||||
|         'config_location': opts.config_location, | ||||
|  | ||||
|     } | ||||
|  | ||||
|     with YoutubeDL(ydl_opts) as ydl: | ||||
| @@ -457,5 +427,4 @@ def main(argv=None): | ||||
|     except KeyboardInterrupt: | ||||
|         sys.exit('\nERROR: Interrupted by user') | ||||
|  | ||||
|  | ||||
| __all__ = ['main', 'YoutubeDL', 'gen_extractors', 'list_extractors'] | ||||
|   | ||||
| @@ -174,7 +174,6 @@ def aes_decrypt_text(data, password, key_size_bytes): | ||||
|  | ||||
|     return plaintext | ||||
|  | ||||
|  | ||||
| RCON = (0x8d, 0x01, 0x02, 0x04, 0x08, 0x10, 0x20, 0x40, 0x80, 0x1b, 0x36) | ||||
| SBOX = (0x63, 0x7C, 0x77, 0x7B, 0xF2, 0x6B, 0x6F, 0xC5, 0x30, 0x01, 0x67, 0x2B, 0xFE, 0xD7, 0xAB, 0x76, | ||||
|         0xCA, 0x82, 0xC9, 0x7D, 0xFA, 0x59, 0x47, 0xF0, 0xAD, 0xD4, 0xA2, 0xAF, 0x9C, 0xA4, 0x72, 0xC0, | ||||
| @@ -329,5 +328,4 @@ def inc(data): | ||||
|             break | ||||
|     return data | ||||
|  | ||||
|  | ||||
| __all__ = ['aes_encrypt', 'key_expansion', 'aes_ctr_decrypt', 'aes_cbc_decrypt', 'aes_decrypt_text'] | ||||
|   | ||||
| @@ -2344,7 +2344,7 @@ try: | ||||
|     from urllib.parse import unquote_plus as compat_urllib_parse_unquote_plus | ||||
| except ImportError:  # Python 2 | ||||
|     _asciire = (compat_urllib_parse._asciire if hasattr(compat_urllib_parse, '_asciire') | ||||
|                 else re.compile(r'([\x00-\x7f]+)')) | ||||
|                 else re.compile('([\x00-\x7f]+)')) | ||||
|  | ||||
|     # HACK: The following are the correct unquote_to_bytes, unquote and unquote_plus | ||||
|     # implementations from cpython 3.4.3's stdlib. Python 2's version | ||||
| @@ -2491,7 +2491,6 @@ class _TreeBuilder(etree.TreeBuilder): | ||||
|     def doctype(self, name, pubid, system): | ||||
|         pass | ||||
|  | ||||
|  | ||||
| if sys.version_info[0] >= 3: | ||||
|     def compat_etree_fromstring(text): | ||||
|         return etree.XML(text, parser=etree.XMLParser(target=_TreeBuilder())) | ||||
| @@ -2529,24 +2528,6 @@ else: | ||||
|                 el.text = el.text.decode('utf-8') | ||||
|         return doc | ||||
|  | ||||
| if hasattr(etree, 'register_namespace'): | ||||
|     compat_etree_register_namespace = etree.register_namespace | ||||
| else: | ||||
|     def compat_etree_register_namespace(prefix, uri): | ||||
|         """Register a namespace prefix. | ||||
|         The registry is global, and any existing mapping for either the | ||||
|         given prefix or the namespace URI will be removed. | ||||
|         *prefix* is the namespace prefix, *uri* is a namespace uri. Tags and | ||||
|         attributes in this namespace will be serialized with prefix if possible. | ||||
|         ValueError is raised if prefix is reserved or is invalid. | ||||
|         """ | ||||
|         if re.match(r"ns\d+$", prefix): | ||||
|             raise ValueError("Prefix format reserved for internal use") | ||||
|         for k, v in list(etree._namespace_map.items()): | ||||
|             if k == uri or v == prefix: | ||||
|                 del etree._namespace_map[k] | ||||
|         etree._namespace_map[uri] = prefix | ||||
|  | ||||
| if sys.version_info < (2, 7): | ||||
|     # Here comes the crazy part: In 2.6, if the xpath is a unicode, | ||||
|     # .//node does not match if a node is a direct child of . ! | ||||
| @@ -2806,7 +2787,6 @@ def workaround_optparse_bug9161(): | ||||
|             return real_add_option(self, *bargs, **bkwargs) | ||||
|         optparse.OptionGroup.add_option = _compat_add_option | ||||
|  | ||||
|  | ||||
| if hasattr(shutil, 'get_terminal_size'):  # Python >= 3.3 | ||||
|     compat_get_terminal_size = shutil.get_terminal_size | ||||
| else: | ||||
| @@ -2883,7 +2863,6 @@ __all__ = [ | ||||
|     'compat_cookiejar', | ||||
|     'compat_cookies', | ||||
|     'compat_etree_fromstring', | ||||
|     'compat_etree_register_namespace', | ||||
|     'compat_expanduser', | ||||
|     'compat_get_terminal_size', | ||||
|     'compat_getenv', | ||||
|   | ||||
| @@ -7,7 +7,6 @@ from .http import HttpFD | ||||
| from .rtmp import RtmpFD | ||||
| from .dash import DashSegmentsFD | ||||
| from .rtsp import RtspFD | ||||
| from .ism import IsmFD | ||||
| from .external import ( | ||||
|     get_external_downloader, | ||||
|     FFmpegFD, | ||||
| @@ -25,7 +24,6 @@ PROTOCOL_MAP = { | ||||
|     'rtsp': RtspFD, | ||||
|     'f4m': F4mFD, | ||||
|     'http_dash_segments': DashSegmentsFD, | ||||
|     'ism': IsmFD, | ||||
| } | ||||
|  | ||||
|  | ||||
|   | ||||
| @@ -4,7 +4,6 @@ import os | ||||
| import re | ||||
| import sys | ||||
| import time | ||||
| import random | ||||
|  | ||||
| from ..compat import compat_os_name | ||||
| from ..utils import ( | ||||
| @@ -343,10 +342,8 @@ class FileDownloader(object): | ||||
|             }) | ||||
|             return True | ||||
|  | ||||
|         min_sleep_interval = self.params.get('sleep_interval') | ||||
|         if min_sleep_interval: | ||||
|             max_sleep_interval = self.params.get('max_sleep_interval', min_sleep_interval) | ||||
|             sleep_interval = random.uniform(min_sleep_interval, max_sleep_interval) | ||||
|         sleep_interval = self.params.get('sleep_interval') | ||||
|         if sleep_interval: | ||||
|             self.to_screen('[download] Sleeping %s seconds...' % sleep_interval) | ||||
|             time.sleep(sleep_interval) | ||||
|  | ||||
|   | ||||
| @@ -1,6 +1,7 @@ | ||||
| from __future__ import unicode_literals | ||||
|  | ||||
| import os | ||||
| import re | ||||
|  | ||||
| from .fragment import FragmentFD | ||||
| from ..compat import compat_urllib_error | ||||
| @@ -18,32 +19,32 @@ class DashSegmentsFD(FragmentFD): | ||||
|     FD_NAME = 'dashsegments' | ||||
|  | ||||
|     def real_download(self, filename, info_dict): | ||||
|         segments = info_dict['fragments'][:1] if self.params.get( | ||||
|             'test', False) else info_dict['fragments'] | ||||
|         base_url = info_dict['url'] | ||||
|         segment_urls = [info_dict['segment_urls'][0]] if self.params.get('test', False) else info_dict['segment_urls'] | ||||
|         initialization_url = info_dict.get('initialization_url') | ||||
|  | ||||
|         ctx = { | ||||
|             'filename': filename, | ||||
|             'total_frags': len(segments), | ||||
|             'total_frags': len(segment_urls) + (1 if initialization_url else 0), | ||||
|         } | ||||
|  | ||||
|         self._prepare_and_start_frag_download(ctx) | ||||
|  | ||||
|         def combine_url(base_url, target_url): | ||||
|             if re.match(r'^https?://', target_url): | ||||
|                 return target_url | ||||
|             return '%s%s%s' % (base_url, '' if base_url.endswith('/') else '/', target_url) | ||||
|  | ||||
|         segments_filenames = [] | ||||
|  | ||||
|         fragment_retries = self.params.get('fragment_retries', 0) | ||||
|         skip_unavailable_fragments = self.params.get('skip_unavailable_fragments', True) | ||||
|  | ||||
|         def process_segment(segment, tmp_filename, num): | ||||
|             segment_url = segment['url'] | ||||
|             segment_name = 'Frag%d' % num | ||||
|         def append_url_to_file(target_url, tmp_filename, segment_name): | ||||
|             target_filename = '%s-%s' % (tmp_filename, segment_name) | ||||
|             # In DASH, the first segment contains necessary headers to | ||||
|             # generate a valid MP4 file, so always abort for the first segment | ||||
|             fatal = num == 0 or not skip_unavailable_fragments | ||||
|             count = 0 | ||||
|             while count <= fragment_retries: | ||||
|                 try: | ||||
|                     success = ctx['dl'].download(target_filename, {'url': segment_url}) | ||||
|                     success = ctx['dl'].download(target_filename, {'url': combine_url(base_url, target_url)}) | ||||
|                     if not success: | ||||
|                         return False | ||||
|                     down, target_sanitized = sanitize_open(target_filename, 'rb') | ||||
| @@ -51,27 +52,26 @@ class DashSegmentsFD(FragmentFD): | ||||
|                     down.close() | ||||
|                     segments_filenames.append(target_sanitized) | ||||
|                     break | ||||
|                 except compat_urllib_error.HTTPError as err: | ||||
|                 except (compat_urllib_error.HTTPError, ) as err: | ||||
|                     # YouTube may often return 404 HTTP error for a fragment causing the | ||||
|                     # whole download to fail. However if the same fragment is immediately | ||||
|                     # retried with the same request data this usually succeeds (1-2 attemps | ||||
|                     # is usually enough) thus allowing to download the whole file successfully. | ||||
|                     # To be future-proof we will retry all fragments that fail with any | ||||
|                     # HTTP error. | ||||
|                     # So, we will retry all fragments that fail with 404 HTTP error for now. | ||||
|                     if err.code != 404: | ||||
|                         raise | ||||
|                     # Retry fragment | ||||
|                     count += 1 | ||||
|                     if count <= fragment_retries: | ||||
|                         self.report_retry_fragment(err, segment_name, count, fragment_retries) | ||||
|                         self.report_retry_fragment(segment_name, count, fragment_retries) | ||||
|             if count > fragment_retries: | ||||
|                 if not fatal: | ||||
|                     self.report_skip_fragment(segment_name) | ||||
|                     return True | ||||
|                 self.report_error('giving up after %s fragment retries' % fragment_retries) | ||||
|                 return False | ||||
|             return True | ||||
|  | ||||
|         for i, segment in enumerate(segments): | ||||
|             if not process_segment(segment, ctx['tmpfilename'], i): | ||||
|                 return False | ||||
|         if initialization_url: | ||||
|             append_url_to_file(initialization_url, ctx['tmpfilename'], 'Init') | ||||
|         for i, segment_url in enumerate(segment_urls): | ||||
|             append_url_to_file(segment_url, ctx['tmpfilename'], 'Seg%d' % i) | ||||
|  | ||||
|         self._finish_frag_download(ctx) | ||||
|  | ||||
|   | ||||
| @@ -17,7 +17,6 @@ from ..utils import ( | ||||
|     encodeArgument, | ||||
|     handle_youtubedl_headers, | ||||
|     check_executable, | ||||
|     is_outdated_version, | ||||
| ) | ||||
|  | ||||
|  | ||||
| @@ -97,12 +96,6 @@ class CurlFD(ExternalFD): | ||||
|         cmd = [self.exe, '--location', '-o', tmpfilename] | ||||
|         for key, val in info_dict['http_headers'].items(): | ||||
|             cmd += ['--header', '%s: %s' % (key, val)] | ||||
|         cmd += self._bool_option('--continue-at', 'continuedl', '-', '0') | ||||
|         cmd += self._valueless_option('--silent', 'noprogress') | ||||
|         cmd += self._valueless_option('--verbose', 'verbose') | ||||
|         cmd += self._option('--limit-rate', 'ratelimit') | ||||
|         cmd += self._option('--retry', 'retries') | ||||
|         cmd += self._option('--max-filesize', 'max_filesize') | ||||
|         cmd += self._option('--interface', 'source_address') | ||||
|         cmd += self._option('--proxy', 'proxy') | ||||
|         cmd += self._valueless_option('--insecure', 'nocheckcertificate') | ||||
| @@ -110,16 +103,6 @@ class CurlFD(ExternalFD): | ||||
|         cmd += ['--', info_dict['url']] | ||||
|         return cmd | ||||
|  | ||||
|     def _call_downloader(self, tmpfilename, info_dict): | ||||
|         cmd = [encodeArgument(a) for a in self._make_cmd(tmpfilename, info_dict)] | ||||
|  | ||||
|         self._debug_cmd(cmd) | ||||
|  | ||||
|         # curl writes the progress to stderr so don't capture it. | ||||
|         p = subprocess.Popen(cmd) | ||||
|         p.communicate() | ||||
|         return p.returncode | ||||
|  | ||||
|  | ||||
| class AxelFD(ExternalFD): | ||||
|     AVAILABLE_OPT = '-V' | ||||
| @@ -199,15 +182,6 @@ class FFmpegFD(ExternalFD): | ||||
|  | ||||
|         args = [ffpp.executable, '-y'] | ||||
|  | ||||
|         seekable = info_dict.get('_seekable') | ||||
|         if seekable is not None: | ||||
|             # setting -seekable prevents ffmpeg from guessing if the server | ||||
|             # supports seeking(by adding the header `Range: bytes=0-`), which | ||||
|             # can cause problems in some cases | ||||
|             # https://github.com/rg3/youtube-dl/issues/11800#issuecomment-275037127 | ||||
|             # http://trac.ffmpeg.org/ticket/6125#comment:10 | ||||
|             args += ['-seekable', '1' if seekable else '0'] | ||||
|  | ||||
|         args += self._configuration_args() | ||||
|  | ||||
|         # start_time = info_dict.get('start_time') or 0 | ||||
| @@ -230,12 +204,6 @@ class FFmpegFD(ExternalFD): | ||||
|         if proxy: | ||||
|             if not re.match(r'^[\da-zA-Z]+://', proxy): | ||||
|                 proxy = 'http://%s' % proxy | ||||
|  | ||||
|             if proxy.startswith('socks'): | ||||
|                 self.report_warning( | ||||
|                     '%s does not support SOCKS proxies. Downloading is likely to fail. ' | ||||
|                     'Consider adding --hls-prefer-native to your command.' % self.get_basename()) | ||||
|  | ||||
|             # Since December 2015 ffmpeg supports -http_proxy option (see | ||||
|             # http://git.videolan.org/?p=ffmpeg.git;a=commit;h=b4eb1f29ebddd60c41a2eb39f5af701e38e0d3fd) | ||||
|             # We could switch to the following code if we are able to detect version properly | ||||
| @@ -274,9 +242,7 @@ class FFmpegFD(ExternalFD): | ||||
|             if self.params.get('hls_use_mpegts', False) or tmpfilename == '-': | ||||
|                 args += ['-f', 'mpegts'] | ||||
|             else: | ||||
|                 args += ['-f', 'mp4'] | ||||
|                 if (ffpp.basename == 'ffmpeg' and is_outdated_version(ffpp._versions['ffmpeg'], '3.2', False)) and (not info_dict.get('acodec') or info_dict['acodec'].split('.')[0] in ('aac', 'mp4a')): | ||||
|                     args += ['-bsf:a', 'aac_adtstoasc'] | ||||
|                 args += ['-f', 'mp4', '-bsf:a', 'aac_adtstoasc'] | ||||
|         elif protocol == 'rtmp': | ||||
|             args += ['-f', 'flv'] | ||||
|         else: | ||||
| @@ -305,7 +271,6 @@ class FFmpegFD(ExternalFD): | ||||
| class AVconvFD(FFmpegFD): | ||||
|     pass | ||||
|  | ||||
|  | ||||
| _BY_NAME = dict( | ||||
|     (klass.get_basename(), klass) | ||||
|     for name, klass in globals().items() | ||||
|   | ||||
| @@ -314,8 +314,7 @@ class F4mFD(FragmentFD): | ||||
|         man_url = info_dict['url'] | ||||
|         requested_bitrate = info_dict.get('tbr') | ||||
|         self.to_screen('[%s] Downloading f4m manifest' % self.FD_NAME) | ||||
|  | ||||
|         urlh = self.ydl.urlopen(self._prepare_url(info_dict, man_url)) | ||||
|         urlh = self.ydl.urlopen(man_url) | ||||
|         man_url = urlh.geturl() | ||||
|         # Some manifests may be malformed, e.g. prosiebensat1 generated manifests | ||||
|         # (see https://github.com/rg3/youtube-dl/issues/6215#issuecomment-121704244 | ||||
| @@ -388,10 +387,7 @@ class F4mFD(FragmentFD): | ||||
|             url_parsed = base_url_parsed._replace(path=base_url_parsed.path + name, query='&'.join(query)) | ||||
|             frag_filename = '%s-%s' % (ctx['tmpfilename'], name) | ||||
|             try: | ||||
|                 success = ctx['dl'].download(frag_filename, { | ||||
|                     'url': url_parsed.geturl(), | ||||
|                     'http_headers': info_dict.get('http_headers'), | ||||
|                 }) | ||||
|                 success = ctx['dl'].download(frag_filename, {'url': url_parsed.geturl()}) | ||||
|                 if not success: | ||||
|                     return False | ||||
|                 (down, frag_sanitized) = sanitize_open(frag_filename, 'rb') | ||||
|   | ||||
| @@ -6,10 +6,8 @@ import time | ||||
| from .common import FileDownloader | ||||
| from .http import HttpFD | ||||
| from ..utils import ( | ||||
|     error_to_compat_str, | ||||
|     encodeFilename, | ||||
|     sanitize_open, | ||||
|     sanitized_Request, | ||||
| ) | ||||
|  | ||||
|  | ||||
| @@ -24,23 +22,13 @@ class FragmentFD(FileDownloader): | ||||
|  | ||||
|     Available options: | ||||
|  | ||||
|     fragment_retries:   Number of times to retry a fragment for HTTP error (DASH | ||||
|                         and hlsnative only) | ||||
|     skip_unavailable_fragments: | ||||
|                         Skip unavailable fragments (DASH and hlsnative only) | ||||
|     fragment_retries:   Number of times to retry a fragment for HTTP error (DASH only) | ||||
|     """ | ||||
|  | ||||
|     def report_retry_fragment(self, err, fragment_name, count, retries): | ||||
|     def report_retry_fragment(self, fragment_name, count, retries): | ||||
|         self.to_screen( | ||||
|             '[download] Got server HTTP error: %s. Retrying fragment %s (attempt %d of %s)...' | ||||
|             % (error_to_compat_str(err), fragment_name, count, self.format_retries(retries))) | ||||
|  | ||||
|     def report_skip_fragment(self, fragment_name): | ||||
|         self.to_screen('[download] Skipping fragment %s...' % fragment_name) | ||||
|  | ||||
|     def _prepare_url(self, info_dict, url): | ||||
|         headers = info_dict.get('http_headers') | ||||
|         return sanitized_Request(url, None, headers) if headers else url | ||||
|             '[download] Got server HTTP error. Retrying fragment %s (attempt %d of %s)...' | ||||
|             % (fragment_name, count, self.format_retries(retries))) | ||||
|  | ||||
|     def _prepare_and_start_frag_download(self, ctx): | ||||
|         self._prepare_frag_download(ctx) | ||||
| @@ -61,7 +49,6 @@ class FragmentFD(FileDownloader): | ||||
|                 'noprogress': True, | ||||
|                 'ratelimit': self.params.get('ratelimit'), | ||||
|                 'retries': self.params.get('retries', 0), | ||||
|                 'nopart': self.params.get('nopart', False), | ||||
|                 'test': self.params.get('test', False), | ||||
|             } | ||||
|         ) | ||||
|   | ||||
| @@ -13,7 +13,6 @@ from .fragment import FragmentFD | ||||
| from .external import FFmpegFD | ||||
|  | ||||
| from ..compat import ( | ||||
|     compat_urllib_error, | ||||
|     compat_urlparse, | ||||
|     compat_struct_pack, | ||||
| ) | ||||
| @@ -21,7 +20,6 @@ from ..utils import ( | ||||
|     encodeFilename, | ||||
|     sanitize_open, | ||||
|     parse_m3u8_attributes, | ||||
|     update_url_query, | ||||
| ) | ||||
|  | ||||
|  | ||||
| @@ -31,7 +29,7 @@ class HlsFD(FragmentFD): | ||||
|     FD_NAME = 'hlsnative' | ||||
|  | ||||
|     @staticmethod | ||||
|     def can_download(manifest, info_dict): | ||||
|     def can_download(manifest): | ||||
|         UNSUPPORTED_FEATURES = ( | ||||
|             r'#EXT-X-KEY:METHOD=(?!NONE|AES-128)',  # encrypted streams [1] | ||||
|             r'#EXT-X-BYTERANGE',  # playlists composed of byte ranges of media files [2] | ||||
| @@ -53,21 +51,16 @@ class HlsFD(FragmentFD): | ||||
|         ) | ||||
|         check_results = [not re.search(feature, manifest) for feature in UNSUPPORTED_FEATURES] | ||||
|         check_results.append(can_decrypt_frag or '#EXT-X-KEY:METHOD=AES-128' not in manifest) | ||||
|         check_results.append(not info_dict.get('is_live')) | ||||
|         return all(check_results) | ||||
|  | ||||
|     def real_download(self, filename, info_dict): | ||||
|         man_url = info_dict['url'] | ||||
|         self.to_screen('[%s] Downloading m3u8 manifest' % self.FD_NAME) | ||||
|  | ||||
|         manifest = self.ydl.urlopen(self._prepare_url(info_dict, man_url)).read() | ||||
|         manifest = self.ydl.urlopen(man_url).read() | ||||
|  | ||||
|         s = manifest.decode('utf-8', 'ignore') | ||||
|  | ||||
|         if not self.can_download(s, info_dict): | ||||
|             if info_dict.get('extra_param_to_segment_url'): | ||||
|                 self.report_error('pycrypto not found. Please install it.') | ||||
|                 return False | ||||
|         if not self.can_download(s): | ||||
|             self.report_warning( | ||||
|                 'hlsnative has detected features it does not support, ' | ||||
|                 'extraction will be delegated to ffmpeg') | ||||
| @@ -89,14 +82,6 @@ class HlsFD(FragmentFD): | ||||
|  | ||||
|         self._prepare_and_start_frag_download(ctx) | ||||
|  | ||||
|         fragment_retries = self.params.get('fragment_retries', 0) | ||||
|         skip_unavailable_fragments = self.params.get('skip_unavailable_fragments', True) | ||||
|         test = self.params.get('test', False) | ||||
|  | ||||
|         extra_query = None | ||||
|         extra_param_to_segment_url = info_dict.get('extra_param_to_segment_url') | ||||
|         if extra_param_to_segment_url: | ||||
|             extra_query = compat_urlparse.parse_qs(extra_param_to_segment_url) | ||||
|         i = 0 | ||||
|         media_sequence = 0 | ||||
|         decrypt_info = {'METHOD': 'NONE'} | ||||
| @@ -109,40 +94,13 @@ class HlsFD(FragmentFD): | ||||
|                         line | ||||
|                         if re.match(r'^https?://', line) | ||||
|                         else compat_urlparse.urljoin(man_url, line)) | ||||
|                     frag_name = 'Frag%d' % i | ||||
|                     frag_filename = '%s-%s' % (ctx['tmpfilename'], frag_name) | ||||
|                     if extra_query: | ||||
|                         frag_url = update_url_query(frag_url, extra_query) | ||||
|                     count = 0 | ||||
|                     while count <= fragment_retries: | ||||
|                         try: | ||||
|                             success = ctx['dl'].download(frag_filename, { | ||||
|                                 'url': frag_url, | ||||
|                                 'http_headers': info_dict.get('http_headers'), | ||||
|                             }) | ||||
|                             if not success: | ||||
|                                 return False | ||||
|                             down, frag_sanitized = sanitize_open(frag_filename, 'rb') | ||||
|                             frag_content = down.read() | ||||
|                             down.close() | ||||
|                             break | ||||
|                         except compat_urllib_error.HTTPError as err: | ||||
|                             # Unavailable (possibly temporary) fragments may be served. | ||||
|                             # First we try to retry then either skip or abort. | ||||
|                             # See https://github.com/rg3/youtube-dl/issues/10165, | ||||
|                             # https://github.com/rg3/youtube-dl/issues/10448). | ||||
|                             count += 1 | ||||
|                             if count <= fragment_retries: | ||||
|                                 self.report_retry_fragment(err, frag_name, count, fragment_retries) | ||||
|                     if count > fragment_retries: | ||||
|                         if skip_unavailable_fragments: | ||||
|                             i += 1 | ||||
|                             media_sequence += 1 | ||||
|                             self.report_skip_fragment(frag_name) | ||||
|                             continue | ||||
|                         self.report_error( | ||||
|                             'giving up after %s fragment retries' % fragment_retries) | ||||
|                     frag_filename = '%s-Frag%d' % (ctx['tmpfilename'], i) | ||||
|                     success = ctx['dl'].download(frag_filename, {'url': frag_url}) | ||||
|                     if not success: | ||||
|                         return False | ||||
|                     down, frag_sanitized = sanitize_open(frag_filename, 'rb') | ||||
|                     frag_content = down.read() | ||||
|                     down.close() | ||||
|                     if decrypt_info['METHOD'] == 'AES-128': | ||||
|                         iv = decrypt_info.get('IV') or compat_struct_pack('>8xq', media_sequence) | ||||
|                         frag_content = AES.new( | ||||
| @@ -150,7 +108,7 @@ class HlsFD(FragmentFD): | ||||
|                     ctx['dest_stream'].write(frag_content) | ||||
|                     frags_filenames.append(frag_sanitized) | ||||
|                     # We only download the first fragment during the test | ||||
|                     if test: | ||||
|                     if self.params.get('test', False): | ||||
|                         break | ||||
|                     i += 1 | ||||
|                     media_sequence += 1 | ||||
| @@ -158,12 +116,10 @@ class HlsFD(FragmentFD): | ||||
|                     decrypt_info = parse_m3u8_attributes(line[11:]) | ||||
|                     if decrypt_info['METHOD'] == 'AES-128': | ||||
|                         if 'IV' in decrypt_info: | ||||
|                             decrypt_info['IV'] = binascii.unhexlify(decrypt_info['IV'][2:].zfill(32)) | ||||
|                             decrypt_info['IV'] = binascii.unhexlify(decrypt_info['IV'][2:]) | ||||
|                         if not re.match(r'^https?://', decrypt_info['URI']): | ||||
|                             decrypt_info['URI'] = compat_urlparse.urljoin( | ||||
|                                 man_url, decrypt_info['URI']) | ||||
|                         if extra_query: | ||||
|                             decrypt_info['URI'] = update_url_query(decrypt_info['URI'], extra_query) | ||||
|                         decrypt_info['KEY'] = self.ydl.urlopen(decrypt_info['URI']).read() | ||||
|                 elif line.startswith('#EXT-X-MEDIA-SEQUENCE'): | ||||
|                     media_sequence = int(line[22:]) | ||||
|   | ||||
| @@ -13,9 +13,6 @@ from ..utils import ( | ||||
|     encodeFilename, | ||||
|     sanitize_open, | ||||
|     sanitized_Request, | ||||
|     write_xattr, | ||||
|     XAttrMetadataError, | ||||
|     XAttrUnavailableError, | ||||
| ) | ||||
|  | ||||
|  | ||||
| @@ -182,8 +179,9 @@ class HttpFD(FileDownloader): | ||||
|  | ||||
|                 if self.params.get('xattr_set_filesize', False) and data_len is not None: | ||||
|                     try: | ||||
|                         write_xattr(tmpfilename, 'user.ytdl.filesize', str(data_len).encode('utf-8')) | ||||
|                     except (XAttrUnavailableError, XAttrMetadataError) as err: | ||||
|                         import xattr | ||||
|                         xattr.setxattr(tmpfilename, 'user.ytdl.filesize', str(data_len)) | ||||
|                     except(OSError, IOError, ImportError) as err: | ||||
|                         self.report_error('unable to set filesize xattr: %s' % str(err)) | ||||
|  | ||||
|             try: | ||||
|   | ||||
| @@ -1,271 +0,0 @@ | ||||
| from __future__ import unicode_literals | ||||
|  | ||||
| import os | ||||
| import time | ||||
| import struct | ||||
| import binascii | ||||
| import io | ||||
|  | ||||
| from .fragment import FragmentFD | ||||
| from ..compat import compat_urllib_error | ||||
| from ..utils import ( | ||||
|     sanitize_open, | ||||
|     encodeFilename, | ||||
| ) | ||||
|  | ||||
|  | ||||
| u8 = struct.Struct(b'>B') | ||||
| u88 = struct.Struct(b'>Bx') | ||||
| u16 = struct.Struct(b'>H') | ||||
| u1616 = struct.Struct(b'>Hxx') | ||||
| u32 = struct.Struct(b'>I') | ||||
| u64 = struct.Struct(b'>Q') | ||||
|  | ||||
| s88 = struct.Struct(b'>bx') | ||||
| s16 = struct.Struct(b'>h') | ||||
| s1616 = struct.Struct(b'>hxx') | ||||
| s32 = struct.Struct(b'>i') | ||||
|  | ||||
| unity_matrix = (s32.pack(0x10000) + s32.pack(0) * 3) * 2 + s32.pack(0x40000000) | ||||
|  | ||||
| TRACK_ENABLED = 0x1 | ||||
| TRACK_IN_MOVIE = 0x2 | ||||
| TRACK_IN_PREVIEW = 0x4 | ||||
|  | ||||
| SELF_CONTAINED = 0x1 | ||||
|  | ||||
|  | ||||
| def box(box_type, payload): | ||||
|     return u32.pack(8 + len(payload)) + box_type + payload | ||||
|  | ||||
|  | ||||
| def full_box(box_type, version, flags, payload): | ||||
|     return box(box_type, u8.pack(version) + u32.pack(flags)[1:] + payload) | ||||
|  | ||||
|  | ||||
| def write_piff_header(stream, params): | ||||
|     track_id = params['track_id'] | ||||
|     fourcc = params['fourcc'] | ||||
|     duration = params['duration'] | ||||
|     timescale = params.get('timescale', 10000000) | ||||
|     language = params.get('language', 'und') | ||||
|     height = params.get('height', 0) | ||||
|     width = params.get('width', 0) | ||||
|     is_audio = width == 0 and height == 0 | ||||
|     creation_time = modification_time = int(time.time()) | ||||
|  | ||||
|     ftyp_payload = b'isml'  # major brand | ||||
|     ftyp_payload += u32.pack(1)  # minor version | ||||
|     ftyp_payload += b'piff' + b'iso2'  # compatible brands | ||||
|     stream.write(box(b'ftyp', ftyp_payload))  # File Type Box | ||||
|  | ||||
|     mvhd_payload = u64.pack(creation_time) | ||||
|     mvhd_payload += u64.pack(modification_time) | ||||
|     mvhd_payload += u32.pack(timescale) | ||||
|     mvhd_payload += u64.pack(duration) | ||||
|     mvhd_payload += s1616.pack(1)  # rate | ||||
|     mvhd_payload += s88.pack(1)  # volume | ||||
|     mvhd_payload += u16.pack(0)  # reserved | ||||
|     mvhd_payload += u32.pack(0) * 2  # reserved | ||||
|     mvhd_payload += unity_matrix | ||||
|     mvhd_payload += u32.pack(0) * 6  # pre defined | ||||
|     mvhd_payload += u32.pack(0xffffffff)  # next track id | ||||
|     moov_payload = full_box(b'mvhd', 1, 0, mvhd_payload)  # Movie Header Box | ||||
|  | ||||
|     tkhd_payload = u64.pack(creation_time) | ||||
|     tkhd_payload += u64.pack(modification_time) | ||||
|     tkhd_payload += u32.pack(track_id)  # track id | ||||
|     tkhd_payload += u32.pack(0)  # reserved | ||||
|     tkhd_payload += u64.pack(duration) | ||||
|     tkhd_payload += u32.pack(0) * 2  # reserved | ||||
|     tkhd_payload += s16.pack(0)  # layer | ||||
|     tkhd_payload += s16.pack(0)  # alternate group | ||||
|     tkhd_payload += s88.pack(1 if is_audio else 0)  # volume | ||||
|     tkhd_payload += u16.pack(0)  # reserved | ||||
|     tkhd_payload += unity_matrix | ||||
|     tkhd_payload += u1616.pack(width) | ||||
|     tkhd_payload += u1616.pack(height) | ||||
|     trak_payload = full_box(b'tkhd', 1, TRACK_ENABLED | TRACK_IN_MOVIE | TRACK_IN_PREVIEW, tkhd_payload)  # Track Header Box | ||||
|  | ||||
|     mdhd_payload = u64.pack(creation_time) | ||||
|     mdhd_payload += u64.pack(modification_time) | ||||
|     mdhd_payload += u32.pack(timescale) | ||||
|     mdhd_payload += u64.pack(duration) | ||||
|     mdhd_payload += u16.pack(((ord(language[0]) - 0x60) << 10) | ((ord(language[1]) - 0x60) << 5) | (ord(language[2]) - 0x60)) | ||||
|     mdhd_payload += u16.pack(0)  # pre defined | ||||
|     mdia_payload = full_box(b'mdhd', 1, 0, mdhd_payload)  # Media Header Box | ||||
|  | ||||
|     hdlr_payload = u32.pack(0)  # pre defined | ||||
|     hdlr_payload += b'soun' if is_audio else b'vide'  # handler type | ||||
|     hdlr_payload += u32.pack(0) * 3  # reserved | ||||
|     hdlr_payload += (b'Sound' if is_audio else b'Video') + b'Handler\0'  # name | ||||
|     mdia_payload += full_box(b'hdlr', 0, 0, hdlr_payload)  # Handler Reference Box | ||||
|  | ||||
|     if is_audio: | ||||
|         smhd_payload = s88.pack(0)  # balance | ||||
|         smhd_payload = u16.pack(0)  # reserved | ||||
|         media_header_box = full_box(b'smhd', 0, 0, smhd_payload)  # Sound Media Header | ||||
|     else: | ||||
|         vmhd_payload = u16.pack(0)  # graphics mode | ||||
|         vmhd_payload += u16.pack(0) * 3  # opcolor | ||||
|         media_header_box = full_box(b'vmhd', 0, 1, vmhd_payload)  # Video Media Header | ||||
|     minf_payload = media_header_box | ||||
|  | ||||
|     dref_payload = u32.pack(1)  # entry count | ||||
|     dref_payload += full_box(b'url ', 0, SELF_CONTAINED, b'')  # Data Entry URL Box | ||||
|     dinf_payload = full_box(b'dref', 0, 0, dref_payload)  # Data Reference Box | ||||
|     minf_payload += box(b'dinf', dinf_payload)  # Data Information Box | ||||
|  | ||||
|     stsd_payload = u32.pack(1)  # entry count | ||||
|  | ||||
|     sample_entry_payload = u8.pack(0) * 6  # reserved | ||||
|     sample_entry_payload += u16.pack(1)  # data reference index | ||||
|     if is_audio: | ||||
|         sample_entry_payload += u32.pack(0) * 2  # reserved | ||||
|         sample_entry_payload += u16.pack(params.get('channels', 2)) | ||||
|         sample_entry_payload += u16.pack(params.get('bits_per_sample', 16)) | ||||
|         sample_entry_payload += u16.pack(0)  # pre defined | ||||
|         sample_entry_payload += u16.pack(0)  # reserved | ||||
|         sample_entry_payload += u1616.pack(params['sampling_rate']) | ||||
|  | ||||
|         if fourcc == 'AACL': | ||||
|             sample_entry_box = box(b'mp4a', sample_entry_payload) | ||||
|     else: | ||||
|         sample_entry_payload = sample_entry_payload | ||||
|         sample_entry_payload += u16.pack(0)  # pre defined | ||||
|         sample_entry_payload += u16.pack(0)  # reserved | ||||
|         sample_entry_payload += u32.pack(0) * 3  # pre defined | ||||
|         sample_entry_payload += u16.pack(width) | ||||
|         sample_entry_payload += u16.pack(height) | ||||
|         sample_entry_payload += u1616.pack(0x48)  # horiz resolution 72 dpi | ||||
|         sample_entry_payload += u1616.pack(0x48)  # vert resolution 72 dpi | ||||
|         sample_entry_payload += u32.pack(0)  # reserved | ||||
|         sample_entry_payload += u16.pack(1)  # frame count | ||||
|         sample_entry_payload += u8.pack(0) * 32  # compressor name | ||||
|         sample_entry_payload += u16.pack(0x18)  # depth | ||||
|         sample_entry_payload += s16.pack(-1)  # pre defined | ||||
|  | ||||
|         codec_private_data = binascii.unhexlify(params['codec_private_data']) | ||||
|         if fourcc in ('H264', 'AVC1'): | ||||
|             sps, pps = codec_private_data.split(u32.pack(1))[1:] | ||||
|             avcc_payload = u8.pack(1)  # configuration version | ||||
|             avcc_payload += sps[1:4]  # avc profile indication + profile compatibility + avc level indication | ||||
|             avcc_payload += u8.pack(0xfc | (params.get('nal_unit_length_field', 4) - 1))  # complete represenation (1) + reserved (11111) + length size minus one | ||||
|             avcc_payload += u8.pack(1)  # reserved (0) + number of sps (0000001) | ||||
|             avcc_payload += u16.pack(len(sps)) | ||||
|             avcc_payload += sps | ||||
|             avcc_payload += u8.pack(1)  # number of pps | ||||
|             avcc_payload += u16.pack(len(pps)) | ||||
|             avcc_payload += pps | ||||
|             sample_entry_payload += box(b'avcC', avcc_payload)  # AVC Decoder Configuration Record | ||||
|             sample_entry_box = box(b'avc1', sample_entry_payload)  # AVC Simple Entry | ||||
|     stsd_payload += sample_entry_box | ||||
|  | ||||
|     stbl_payload = full_box(b'stsd', 0, 0, stsd_payload)  # Sample Description Box | ||||
|  | ||||
|     stts_payload = u32.pack(0)  # entry count | ||||
|     stbl_payload += full_box(b'stts', 0, 0, stts_payload)  # Decoding Time to Sample Box | ||||
|  | ||||
|     stsc_payload = u32.pack(0)  # entry count | ||||
|     stbl_payload += full_box(b'stsc', 0, 0, stsc_payload)  # Sample To Chunk Box | ||||
|  | ||||
|     stco_payload = u32.pack(0)  # entry count | ||||
|     stbl_payload += full_box(b'stco', 0, 0, stco_payload)  # Chunk Offset Box | ||||
|  | ||||
|     minf_payload += box(b'stbl', stbl_payload)  # Sample Table Box | ||||
|  | ||||
|     mdia_payload += box(b'minf', minf_payload)  # Media Information Box | ||||
|  | ||||
|     trak_payload += box(b'mdia', mdia_payload)  # Media Box | ||||
|  | ||||
|     moov_payload += box(b'trak', trak_payload)  # Track Box | ||||
|  | ||||
|     mehd_payload = u64.pack(duration) | ||||
|     mvex_payload = full_box(b'mehd', 1, 0, mehd_payload)  # Movie Extends Header Box | ||||
|  | ||||
|     trex_payload = u32.pack(track_id)  # track id | ||||
|     trex_payload += u32.pack(1)  # default sample description index | ||||
|     trex_payload += u32.pack(0)  # default sample duration | ||||
|     trex_payload += u32.pack(0)  # default sample size | ||||
|     trex_payload += u32.pack(0)  # default sample flags | ||||
|     mvex_payload += full_box(b'trex', 0, 0, trex_payload)  # Track Extends Box | ||||
|  | ||||
|     moov_payload += box(b'mvex', mvex_payload)  # Movie Extends Box | ||||
|     stream.write(box(b'moov', moov_payload))  # Movie Box | ||||
|  | ||||
|  | ||||
| def extract_box_data(data, box_sequence): | ||||
|     data_reader = io.BytesIO(data) | ||||
|     while True: | ||||
|         box_size = u32.unpack(data_reader.read(4))[0] | ||||
|         box_type = data_reader.read(4) | ||||
|         if box_type == box_sequence[0]: | ||||
|             box_data = data_reader.read(box_size - 8) | ||||
|             if len(box_sequence) == 1: | ||||
|                 return box_data | ||||
|             return extract_box_data(box_data, box_sequence[1:]) | ||||
|         data_reader.seek(box_size - 8, 1) | ||||
|  | ||||
|  | ||||
| class IsmFD(FragmentFD): | ||||
|     """ | ||||
|     Download segments in a ISM manifest | ||||
|     """ | ||||
|  | ||||
|     FD_NAME = 'ism' | ||||
|  | ||||
|     def real_download(self, filename, info_dict): | ||||
|         segments = info_dict['fragments'][:1] if self.params.get( | ||||
|             'test', False) else info_dict['fragments'] | ||||
|  | ||||
|         ctx = { | ||||
|             'filename': filename, | ||||
|             'total_frags': len(segments), | ||||
|         } | ||||
|  | ||||
|         self._prepare_and_start_frag_download(ctx) | ||||
|  | ||||
|         segments_filenames = [] | ||||
|  | ||||
|         fragment_retries = self.params.get('fragment_retries', 0) | ||||
|         skip_unavailable_fragments = self.params.get('skip_unavailable_fragments', True) | ||||
|  | ||||
|         track_written = False | ||||
|         for i, segment in enumerate(segments): | ||||
|             segment_url = segment['url'] | ||||
|             segment_name = 'Frag%d' % i | ||||
|             target_filename = '%s-%s' % (ctx['tmpfilename'], segment_name) | ||||
|             count = 0 | ||||
|             while count <= fragment_retries: | ||||
|                 try: | ||||
|                     success = ctx['dl'].download(target_filename, {'url': segment_url}) | ||||
|                     if not success: | ||||
|                         return False | ||||
|                     down, target_sanitized = sanitize_open(target_filename, 'rb') | ||||
|                     down_data = down.read() | ||||
|                     if not track_written: | ||||
|                         tfhd_data = extract_box_data(down_data, [b'moof', b'traf', b'tfhd']) | ||||
|                         info_dict['_download_params']['track_id'] = u32.unpack(tfhd_data[4:8])[0] | ||||
|                         write_piff_header(ctx['dest_stream'], info_dict['_download_params']) | ||||
|                         track_written = True | ||||
|                     ctx['dest_stream'].write(down_data) | ||||
|                     down.close() | ||||
|                     segments_filenames.append(target_sanitized) | ||||
|                     break | ||||
|                 except compat_urllib_error.HTTPError as err: | ||||
|                     count += 1 | ||||
|                     if count <= fragment_retries: | ||||
|                         self.report_retry_fragment(err, segment_name, count, fragment_retries) | ||||
|             if count > fragment_retries: | ||||
|                 if skip_unavailable_fragments: | ||||
|                     self.report_skip_fragment(segment_name) | ||||
|                     continue | ||||
|                 self.report_error('giving up after %s fragment retries' % fragment_retries) | ||||
|                 return False | ||||
|  | ||||
|         self._finish_frag_download(ctx) | ||||
|  | ||||
|         for segment_file in segments_filenames: | ||||
|             os.remove(encodeFilename(segment_file)) | ||||
|  | ||||
|         return True | ||||
| @@ -7,13 +7,12 @@ from ..utils import ( | ||||
|     ExtractorError, | ||||
|     js_to_json, | ||||
|     int_or_none, | ||||
|     parse_iso8601, | ||||
| ) | ||||
|  | ||||
|  | ||||
| class ABCIE(InfoExtractor): | ||||
|     IE_NAME = 'abc.net.au' | ||||
|     _VALID_URL = r'https?://(?:www\.)?abc\.net\.au/news/(?:[^/]+/){1,2}(?P<id>\d+)' | ||||
|     _VALID_URL = r'https?://www\.abc\.net\.au/news/(?:[^/]+/){1,2}(?P<id>\d+)' | ||||
|  | ||||
|     _TESTS = [{ | ||||
|         'url': 'http://www.abc.net.au/news/2014-11-05/australia-to-staff-ebola-treatment-centre-in-sierra-leone/5868334', | ||||
| @@ -94,59 +93,3 @@ class ABCIE(InfoExtractor): | ||||
|             'description': self._og_search_description(webpage), | ||||
|             'thumbnail': self._og_search_thumbnail(webpage), | ||||
|         } | ||||
|  | ||||
|  | ||||
| class ABCIViewIE(InfoExtractor): | ||||
|     IE_NAME = 'abc.net.au:iview' | ||||
|     _VALID_URL = r'https?://iview\.abc\.net\.au/programs/[^/]+/(?P<id>[^/?#]+)' | ||||
|  | ||||
|     # ABC iview programs are normally available for 14 days only. | ||||
|     _TESTS = [{ | ||||
|         'url': 'http://iview.abc.net.au/programs/diaries-of-a-broken-mind/ZX9735A001S00', | ||||
|         'md5': 'cde42d728b3b7c2b32b1b94b4a548afc', | ||||
|         'info_dict': { | ||||
|             'id': 'ZX9735A001S00', | ||||
|             'ext': 'mp4', | ||||
|             'title': 'Diaries Of A Broken Mind', | ||||
|             'description': 'md5:7de3903874b7a1be279fe6b68718fc9e', | ||||
|             'upload_date': '20161010', | ||||
|             'uploader_id': 'abc2', | ||||
|             'timestamp': 1476064920, | ||||
|         }, | ||||
|         'skip': 'Video gone', | ||||
|     }] | ||||
|  | ||||
|     def _real_extract(self, url): | ||||
|         video_id = self._match_id(url) | ||||
|         webpage = self._download_webpage(url, video_id) | ||||
|         video_params = self._parse_json(self._search_regex( | ||||
|             r'videoParams\s*=\s*({.+?});', webpage, 'video params'), video_id) | ||||
|         title = video_params.get('title') or video_params['seriesTitle'] | ||||
|         stream = next(s for s in video_params['playlist'] if s.get('type') == 'program') | ||||
|  | ||||
|         formats = self._extract_akamai_formats(stream['hds-unmetered'], video_id) | ||||
|         self._sort_formats(formats) | ||||
|  | ||||
|         subtitles = {} | ||||
|         src_vtt = stream.get('captions', {}).get('src-vtt') | ||||
|         if src_vtt: | ||||
|             subtitles['en'] = [{ | ||||
|                 'url': src_vtt, | ||||
|                 'ext': 'vtt', | ||||
|             }] | ||||
|  | ||||
|         return { | ||||
|             'id': video_id, | ||||
|             'title': title, | ||||
|             'description': self._html_search_meta(['og:description', 'twitter:description'], webpage), | ||||
|             'thumbnail': self._html_search_meta(['og:image', 'twitter:image:src'], webpage), | ||||
|             'duration': int_or_none(video_params.get('eventDuration')), | ||||
|             'timestamp': parse_iso8601(video_params.get('pubDate'), ' '), | ||||
|             'series': video_params.get('seriesTitle'), | ||||
|             'series_id': video_params.get('seriesHouseNumber') or video_id[:7], | ||||
|             'episode_number': int_or_none(self._html_search_meta('episodeNumber', webpage, default=None)), | ||||
|             'episode': self._html_search_meta('episode_title', webpage, default=None), | ||||
|             'uploader_id': video_params.get('channel'), | ||||
|             'formats': formats, | ||||
|             'subtitles': subtitles, | ||||
|         } | ||||
|   | ||||
| @@ -1,19 +1,13 @@ | ||||
| # coding: utf-8 | ||||
| from __future__ import unicode_literals | ||||
| 
 | ||||
| import re | ||||
| 
 | ||||
| from .common import InfoExtractor | ||||
| from ..utils import ( | ||||
|     int_or_none, | ||||
|     parse_iso8601, | ||||
| ) | ||||
| from ..utils import parse_iso8601 | ||||
| 
 | ||||
| 
 | ||||
| class ABCOTVSIE(InfoExtractor): | ||||
|     IE_NAME = 'abcotvs' | ||||
|     IE_DESC = 'ABC Owned Television Stations' | ||||
|     _VALID_URL = r'https?://(?:abc(?:7(?:news|ny|chicago)?|11|13|30)|6abc)\.com(?:/[^/]+/(?P<display_id>[^/]+))?/(?P<id>\d+)' | ||||
| class Abc7NewsIE(InfoExtractor): | ||||
|     _VALID_URL = r'https?://abc7news\.com(?:/[^/]+/(?P<display_id>[^/]+))?/(?P<id>\d+)' | ||||
|     _TESTS = [ | ||||
|         { | ||||
|             'url': 'http://abc7news.com/entertainment/east-bay-museum-celebrates-vintage-synthesizers/472581/', | ||||
| @@ -21,9 +15,9 @@ class ABCOTVSIE(InfoExtractor): | ||||
|                 'id': '472581', | ||||
|                 'display_id': 'east-bay-museum-celebrates-vintage-synthesizers', | ||||
|                 'ext': 'mp4', | ||||
|                 'title': 'East Bay museum celebrates vintage synthesizers', | ||||
|                 'title': 'East Bay museum celebrates history of synthesized music', | ||||
|                 'description': 'md5:a4f10fb2f2a02565c1749d4adbab4b10', | ||||
|                 'thumbnail': r're:^https?://.*\.jpg$', | ||||
|                 'thumbnail': 're:^https?://.*\.jpg$', | ||||
|                 'timestamp': 1421123075, | ||||
|                 'upload_date': '20150113', | ||||
|                 'uploader': 'Jonathan Bloom', | ||||
| @@ -47,7 +41,7 @@ class ABCOTVSIE(InfoExtractor): | ||||
|         webpage = self._download_webpage(url, display_id) | ||||
| 
 | ||||
|         m3u8 = self._html_search_meta( | ||||
|             'contentURL', webpage, 'm3u8 url', fatal=True).split('?')[0] | ||||
|             'contentURL', webpage, 'm3u8 url', fatal=True) | ||||
| 
 | ||||
|         formats = self._extract_m3u8_formats(m3u8, display_id, 'mp4') | ||||
|         self._sort_formats(formats) | ||||
| @@ -72,41 +66,3 @@ class ABCOTVSIE(InfoExtractor): | ||||
|             'uploader': uploader, | ||||
|             'formats': formats, | ||||
|         } | ||||
| 
 | ||||
| 
 | ||||
| class ABCOTVSClipsIE(InfoExtractor): | ||||
|     IE_NAME = 'abcotvs:clips' | ||||
|     _VALID_URL = r'https?://clips\.abcotvs\.com/(?:[^/]+/)*video/(?P<id>\d+)' | ||||
|     _TEST = { | ||||
|         'url': 'https://clips.abcotvs.com/kabc/video/214814', | ||||
|         'info_dict': { | ||||
|             'id': '214814', | ||||
|             'ext': 'mp4', | ||||
|             'title': 'SpaceX launch pad explosion destroys rocket, satellite', | ||||
|             'description': 'md5:9f186e5ad8f490f65409965ee9c7be1b', | ||||
|             'upload_date': '20160901', | ||||
|             'timestamp': 1472756695, | ||||
|         }, | ||||
|         'params': { | ||||
|             # m3u8 download | ||||
|             'skip_download': True, | ||||
|         }, | ||||
|     } | ||||
| 
 | ||||
|     def _real_extract(self, url): | ||||
|         video_id = self._match_id(url) | ||||
|         video_data = self._download_json('https://clips.abcotvs.com/vogo/video/getByIds?ids=' + video_id, video_id)['results'][0] | ||||
|         title = video_data['title'] | ||||
|         formats = self._extract_m3u8_formats( | ||||
|             video_data['videoURL'].split('?')[0], video_id, 'mp4') | ||||
|         self._sort_formats(formats) | ||||
| 
 | ||||
|         return { | ||||
|             'id': video_id, | ||||
|             'title': title, | ||||
|             'description': video_data.get('description'), | ||||
|             'thumbnail': video_data.get('thumbnailURL'), | ||||
|             'duration': int_or_none(video_data.get('duration')), | ||||
|             'timestamp': int_or_none(video_data.get('pubDate')), | ||||
|             'formats': formats, | ||||
|         } | ||||
| @@ -12,7 +12,7 @@ from ..compat import compat_urlparse | ||||
|  | ||||
| class AbcNewsVideoIE(AMPIE): | ||||
|     IE_NAME = 'abcnews:video' | ||||
|     _VALID_URL = r'https?://abcnews\.go\.com/[^/]+/video/(?P<display_id>[0-9a-z-]+)-(?P<id>\d+)' | ||||
|     _VALID_URL = 'http://abcnews.go.com/[^/]+/video/(?P<display_id>[0-9a-z-]+)-(?P<id>\d+)' | ||||
|  | ||||
|     _TESTS = [{ | ||||
|         'url': 'http://abcnews.go.com/ThisWeek/video/week-exclusive-irans-foreign-minister-zarif-20411932', | ||||
| @@ -23,7 +23,7 @@ class AbcNewsVideoIE(AMPIE): | ||||
|             'title': '\'This Week\' Exclusive: Iran\'s Foreign Minister Zarif', | ||||
|             'description': 'George Stephanopoulos goes one-on-one with Iranian Foreign Minister Dr. Javad Zarif.', | ||||
|             'duration': 180, | ||||
|             'thumbnail': r're:^https?://.*\.jpg$', | ||||
|             'thumbnail': 're:^https?://.*\.jpg$', | ||||
|         }, | ||||
|         'params': { | ||||
|             # m3u8 download | ||||
| @@ -49,7 +49,7 @@ class AbcNewsVideoIE(AMPIE): | ||||
|  | ||||
| class AbcNewsIE(InfoExtractor): | ||||
|     IE_NAME = 'abcnews' | ||||
|     _VALID_URL = r'https?://abcnews\.go\.com/(?:[^/]+/)+(?P<display_id>[0-9a-z-]+)/story\?id=(?P<id>\d+)' | ||||
|     _VALID_URL = 'https?://abcnews\.go\.com/(?:[^/]+/)+(?P<display_id>[0-9a-z-]+)/story\?id=(?P<id>\d+)' | ||||
|  | ||||
|     _TESTS = [{ | ||||
|         'url': 'http://abcnews.go.com/Blotter/News/dramatic-video-rare-death-job-america/story?id=10498713#.UIhwosWHLjY', | ||||
| @@ -59,7 +59,7 @@ class AbcNewsIE(InfoExtractor): | ||||
|             'display_id': 'dramatic-video-rare-death-job-america', | ||||
|             'title': 'Occupational Hazards', | ||||
|             'description': 'Nightline investigates the dangers that lurk at various jobs.', | ||||
|             'thumbnail': r're:^https?://.*\.jpg$', | ||||
|             'thumbnail': 're:^https?://.*\.jpg$', | ||||
|             'upload_date': '20100428', | ||||
|             'timestamp': 1272412800, | ||||
|         }, | ||||
|   | ||||
| @@ -8,7 +8,6 @@ from .common import InfoExtractor | ||||
| from ..compat import compat_str | ||||
| from ..utils import ( | ||||
|     int_or_none, | ||||
|     parse_iso8601, | ||||
|     OnDemandPagedList, | ||||
| ) | ||||
|  | ||||
| @@ -16,33 +15,18 @@ from ..utils import ( | ||||
| class ACastIE(InfoExtractor): | ||||
|     IE_NAME = 'acast' | ||||
|     _VALID_URL = r'https?://(?:www\.)?acast\.com/(?P<channel>[^/]+)/(?P<id>[^/#?]+)' | ||||
|     _TESTS = [{ | ||||
|         # test with one bling | ||||
|     _TEST = { | ||||
|         'url': 'https://www.acast.com/condenasttraveler/-where-are-you-taipei-101-taiwan', | ||||
|         'md5': 'ada3de5a1e3a2a381327d749854788bb', | ||||
|         'info_dict': { | ||||
|             'id': '57de3baa-4bb0-487e-9418-2692c1277a34', | ||||
|             'ext': 'mp3', | ||||
|             'title': '"Where Are You?": Taipei 101, Taiwan', | ||||
|             'timestamp': 1196172000, | ||||
|             'upload_date': '20071127', | ||||
|             'timestamp': 1196172000000, | ||||
|             'description': 'md5:a0b4ef3634e63866b542e5b1199a1a0e', | ||||
|             'duration': 211, | ||||
|         } | ||||
|     }, { | ||||
|         # test with multiple blings | ||||
|         'url': 'https://www.acast.com/sparpodcast/2.raggarmordet-rosterurdetforflutna', | ||||
|         'md5': '55c0097badd7095f494c99a172f86501', | ||||
|         'info_dict': { | ||||
|             'id': '2a92b283-1a75-4ad8-8396-499c641de0d9', | ||||
|             'ext': 'mp3', | ||||
|             'title': '2. Raggarmordet - Röster ur det förflutna', | ||||
|             'timestamp': 1477346700, | ||||
|             'upload_date': '20161024', | ||||
|             'description': 'md5:4f81f6d8cf2e12ee21a321d8bca32db4', | ||||
|             'duration': 2797, | ||||
|         } | ||||
|     }] | ||||
|     } | ||||
|  | ||||
|     def _real_extract(self, url): | ||||
|         channel, display_id = re.match(self._VALID_URL, url).groups() | ||||
| @@ -51,11 +35,11 @@ class ACastIE(InfoExtractor): | ||||
|         return { | ||||
|             'id': compat_str(cast_data['id']), | ||||
|             'display_id': display_id, | ||||
|             'url': [b['audio'] for b in cast_data['blings'] if b['type'] == 'BlingAudio'][0], | ||||
|             'url': cast_data['blings'][0]['audio'], | ||||
|             'title': cast_data['name'], | ||||
|             'description': cast_data.get('description'), | ||||
|             'thumbnail': cast_data.get('image'), | ||||
|             'timestamp': parse_iso8601(cast_data.get('publishingDate')), | ||||
|             'timestamp': int_or_none(cast_data.get('publishingDate')), | ||||
|             'duration': int_or_none(cast_data.get('duration')), | ||||
|         } | ||||
|  | ||||
|   | ||||
										
											
												File diff suppressed because it is too large
												Load Diff
											
										
									
								
							| @@ -30,7 +30,7 @@ class AdobeTVIE(AdobeTVBaseIE): | ||||
|             'ext': 'mp4', | ||||
|             'title': 'Quick Tip - How to Draw a Circle Around an Object in Photoshop', | ||||
|             'description': 'md5:99ec318dc909d7ba2a1f2b038f7d2311', | ||||
|             'thumbnail': r're:https?://.*\.jpg$', | ||||
|             'thumbnail': 're:https?://.*\.jpg$', | ||||
|             'upload_date': '20110914', | ||||
|             'duration': 60, | ||||
|             'view_count': int, | ||||
|   | ||||
| @@ -3,14 +3,16 @@ from __future__ import unicode_literals | ||||
|  | ||||
| import re | ||||
|  | ||||
| from .turner import TurnerBaseIE | ||||
| from .common import InfoExtractor | ||||
| from ..utils import ( | ||||
|     determine_ext, | ||||
|     ExtractorError, | ||||
|     int_or_none, | ||||
|     float_or_none, | ||||
|     xpath_text, | ||||
| ) | ||||
|  | ||||
|  | ||||
| class AdultSwimIE(TurnerBaseIE): | ||||
| class AdultSwimIE(InfoExtractor): | ||||
|     _VALID_URL = r'https?://(?:www\.)?adultswim\.com/videos/(?P<is_playlist>playlists/)?(?P<show_path>[^/]+)/(?P<episode_path>[^/?#]+)/?' | ||||
|  | ||||
|     _TESTS = [{ | ||||
| @@ -81,42 +83,6 @@ class AdultSwimIE(TurnerBaseIE): | ||||
|             # m3u8 download | ||||
|             'skip_download': True, | ||||
|         } | ||||
|     }, { | ||||
|         # heroMetadata.trailer | ||||
|         'url': 'http://www.adultswim.com/videos/decker/inside-decker-a-new-hero/', | ||||
|         'info_dict': { | ||||
|             'id': 'I0LQFQkaSUaFp8PnAWHhoQ', | ||||
|             'ext': 'mp4', | ||||
|             'title': 'Decker - Inside Decker: A New Hero', | ||||
|             'description': 'md5:c916df071d425d62d70c86d4399d3ee0', | ||||
|             'duration': 249.008, | ||||
|         }, | ||||
|         'params': { | ||||
|             # m3u8 download | ||||
|             'skip_download': True, | ||||
|         }, | ||||
|         'expected_warnings': ['Unable to download f4m manifest'], | ||||
|     }, { | ||||
|         'url': 'http://www.adultswim.com/videos/toonami/friday-october-14th-2016/', | ||||
|         'info_dict': { | ||||
|             'id': 'eYiLsKVgQ6qTC6agD67Sig', | ||||
|             'title': 'Toonami - Friday, October 14th, 2016', | ||||
|             'description': 'md5:99892c96ffc85e159a428de85c30acde', | ||||
|         }, | ||||
|         'playlist': [{ | ||||
|             'md5': '', | ||||
|             'info_dict': { | ||||
|                 'id': 'eYiLsKVgQ6qTC6agD67Sig', | ||||
|                 'ext': 'mp4', | ||||
|                 'title': 'Toonami - Friday, October 14th, 2016', | ||||
|                 'description': 'md5:99892c96ffc85e159a428de85c30acde', | ||||
|             }, | ||||
|         }], | ||||
|         'params': { | ||||
|             # m3u8 download | ||||
|             'skip_download': True, | ||||
|         }, | ||||
|         'expected_warnings': ['Unable to download f4m manifest'], | ||||
|     }] | ||||
|  | ||||
|     @staticmethod | ||||
| @@ -167,58 +133,79 @@ class AdultSwimIE(TurnerBaseIE): | ||||
|             if video_info is None: | ||||
|                 if bootstrapped_data.get('slugged_video', {}).get('slug') == episode_path: | ||||
|                     video_info = bootstrapped_data['slugged_video'] | ||||
|             if not video_info: | ||||
|                 video_info = bootstrapped_data.get( | ||||
|                     'heroMetadata', {}).get('trailer', {}).get('video') | ||||
|             if not video_info: | ||||
|                 video_info = bootstrapped_data.get('onlineOriginals', [None])[0] | ||||
|             if not video_info: | ||||
|                 raise ExtractorError('Unable to find video info') | ||||
|                 else: | ||||
|                     raise ExtractorError('Unable to find video info') | ||||
|  | ||||
|             show = bootstrapped_data['show'] | ||||
|             show_title = show['title'] | ||||
|             stream = video_info.get('stream') | ||||
|             if stream and stream.get('videoPlaybackID'): | ||||
|                 segment_ids = [stream['videoPlaybackID']] | ||||
|             elif video_info.get('clips'): | ||||
|                 segment_ids = [clip['videoPlaybackID'] for clip in video_info['clips']] | ||||
|             elif video_info.get('videoPlaybackID'): | ||||
|                 segment_ids = [video_info['videoPlaybackID']] | ||||
|             elif video_info.get('id'): | ||||
|                 segment_ids = [video_info['id']] | ||||
|             else: | ||||
|                 if video_info.get('auth') is True: | ||||
|                     raise ExtractorError( | ||||
|                         'This video is only available via cable service provider subscription that' | ||||
|                         ' is not currently supported. You may want to use --cookies.', expected=True) | ||||
|                 else: | ||||
|                     raise ExtractorError('Unable to find stream or clips') | ||||
|             clips = [stream] if stream else video_info.get('clips') | ||||
|             if not clips: | ||||
|                 raise ExtractorError( | ||||
|                     'This video is only available via cable service provider subscription that' | ||||
|                     ' is not currently supported. You may want to use --cookies.' | ||||
|                     if video_info.get('auth') is True else 'Unable to find stream or clips', | ||||
|                     expected=True) | ||||
|             segment_ids = [clip['videoPlaybackID'] for clip in clips] | ||||
|  | ||||
|         episode_id = video_info['id'] | ||||
|         episode_title = video_info['title'] | ||||
|         episode_description = video_info.get('description') | ||||
|         episode_duration = int_or_none(video_info.get('duration')) | ||||
|         view_count = int_or_none(video_info.get('views')) | ||||
|         episode_description = video_info['description'] | ||||
|         episode_duration = video_info.get('duration') | ||||
|  | ||||
|         entries = [] | ||||
|         for part_num, segment_id in enumerate(segment_ids): | ||||
|             segement_info = self._extract_cvp_info( | ||||
|                 'http://www.adultswim.com/videos/api/v0/assets?id=%s&platform=desktop' % segment_id, | ||||
|                 segment_id, { | ||||
|                     'secure': { | ||||
|                         'media_src': 'http://androidhls-secure.cdn.turner.com/adultswim/big', | ||||
|                         'tokenizer_src': 'http://www.adultswim.com/astv/mvpd/processors/services/token_ipadAdobe.do', | ||||
|                     }, | ||||
|                 }) | ||||
|             segment_url = 'http://www.adultswim.com/videos/api/v0/assets?id=%s&platform=desktop' % segment_id | ||||
|  | ||||
|             segment_title = '%s - %s' % (show_title, episode_title) | ||||
|             if len(segment_ids) > 1: | ||||
|                 segment_title += ' Part %d' % (part_num + 1) | ||||
|             segement_info.update({ | ||||
|  | ||||
|             idoc = self._download_xml( | ||||
|                 segment_url, segment_title, | ||||
|                 'Downloading segment information', 'Unable to download segment information') | ||||
|  | ||||
|             segment_duration = float_or_none( | ||||
|                 xpath_text(idoc, './/trt', 'segment duration').strip()) | ||||
|  | ||||
|             formats = [] | ||||
|             file_els = idoc.findall('.//files/file') or idoc.findall('./files/file') | ||||
|  | ||||
|             unique_urls = [] | ||||
|             unique_file_els = [] | ||||
|             for file_el in file_els: | ||||
|                 media_url = file_el.text | ||||
|                 if not media_url or determine_ext(media_url) == 'f4m': | ||||
|                     continue | ||||
|                 if file_el.text not in unique_urls: | ||||
|                     unique_urls.append(file_el.text) | ||||
|                     unique_file_els.append(file_el) | ||||
|  | ||||
|             for file_el in unique_file_els: | ||||
|                 bitrate = file_el.attrib.get('bitrate') | ||||
|                 ftype = file_el.attrib.get('type') | ||||
|                 media_url = file_el.text | ||||
|                 if determine_ext(media_url) == 'm3u8': | ||||
|                     formats.extend(self._extract_m3u8_formats( | ||||
|                         media_url, segment_title, 'mp4', preference=0, | ||||
|                         m3u8_id='hls', fatal=False)) | ||||
|                 else: | ||||
|                     formats.append({ | ||||
|                         'format_id': '%s_%s' % (bitrate, ftype), | ||||
|                         'url': file_el.text.strip(), | ||||
|                         # The bitrate may not be a number (for example: 'iphone') | ||||
|                         'tbr': int(bitrate) if bitrate.isdigit() else None, | ||||
|                     }) | ||||
|  | ||||
|             self._sort_formats(formats) | ||||
|  | ||||
|             entries.append({ | ||||
|                 'id': segment_id, | ||||
|                 'title': segment_title, | ||||
|                 'description': episode_description, | ||||
|                 'formats': formats, | ||||
|                 'duration': segment_duration, | ||||
|                 'description': episode_description | ||||
|             }) | ||||
|             entries.append(segement_info) | ||||
|  | ||||
|         return { | ||||
|             '_type': 'playlist', | ||||
| @@ -227,6 +214,5 @@ class AdultSwimIE(TurnerBaseIE): | ||||
|             'entries': entries, | ||||
|             'title': '%s - %s' % (show_title, episode_title), | ||||
|             'description': episode_description, | ||||
|             'duration': episode_duration, | ||||
|             'view_count': view_count, | ||||
|             'duration': episode_duration | ||||
|         } | ||||
|   | ||||
| @@ -23,10 +23,10 @@ class AENetworksBaseIE(ThePlatformIE): | ||||
| class AENetworksIE(AENetworksBaseIE): | ||||
|     IE_NAME = 'aenetworks' | ||||
|     IE_DESC = 'A+E Networks: A&E, Lifetime, History.com, FYI Network' | ||||
|     _VALID_URL = r'https?://(?:www\.)?(?P<domain>(?:history|aetv|mylifetime|lifetimemovieclub)\.com|fyi\.tv)/(?:shows/(?P<show_path>[^/]+(?:/[^/]+){0,2})|movies/(?P<movie_display_id>[^/]+)(?:/full-movie)?)' | ||||
|     _VALID_URL = r'https?://(?:www\.)?(?P<domain>(?:history|aetv|mylifetime)\.com|fyi\.tv)/(?:shows/(?P<show_path>[^/]+(?:/[^/]+){0,2})|movies/(?P<movie_display_id>[^/]+)/full-movie)' | ||||
|     _TESTS = [{ | ||||
|         'url': 'http://www.history.com/shows/mountain-men/season-1/episode-1', | ||||
|         'md5': 'a97a65f7e823ae10e9244bc5433d5fe6', | ||||
|         'md5': '8ff93eb073449f151d6b90c0ae1ef0c7', | ||||
|         'info_dict': { | ||||
|             'id': '22253814', | ||||
|             'ext': 'mp4', | ||||
| @@ -62,15 +62,11 @@ class AENetworksIE(AENetworksBaseIE): | ||||
|     }, { | ||||
|         'url': 'http://www.mylifetime.com/movies/center-stage-on-pointe/full-movie', | ||||
|         'only_matching': True | ||||
|     }, { | ||||
|         'url': 'https://www.lifetimemovieclub.com/movies/a-killer-among-us', | ||||
|         'only_matching': True | ||||
|     }] | ||||
|     _DOMAIN_TO_REQUESTOR_ID = { | ||||
|         'history.com': 'HISTORY', | ||||
|         'aetv.com': 'AETV', | ||||
|         'mylifetime.com': 'LIFETIME', | ||||
|         'lifetimemovieclub.com': 'LIFETIMEMOVIECLUB', | ||||
|         'fyi.tv': 'FYI', | ||||
|     } | ||||
|  | ||||
| @@ -91,7 +87,7 @@ class AENetworksIE(AENetworksBaseIE): | ||||
|                     self._html_search_meta('aetn:SeriesTitle', webpage)) | ||||
|             elif url_parts_len == 2: | ||||
|                 entries = [] | ||||
|                 for episode_item in re.findall(r'(?s)<[^>]+class="[^"]*(?:episode|program)-item[^"]*"[^>]*>', webpage): | ||||
|                 for episode_item in re.findall(r'(?s)<div[^>]+class="[^"]*episode-item[^"]*"[^>]*>', webpage): | ||||
|                     episode_attributes = extract_attributes(episode_item) | ||||
|                     episode_url = compat_urlparse.urljoin( | ||||
|                         url, episode_attributes['data-canonical']) | ||||
| @@ -103,7 +99,7 @@ class AENetworksIE(AENetworksBaseIE): | ||||
|  | ||||
|         query = { | ||||
|             'mbr': 'true', | ||||
|             'assetTypes': 'high_video_s3' | ||||
|             'assetTypes': 'medium_video_s3' | ||||
|         } | ||||
|         video_id = self._html_search_meta('aetn:VideoID', webpage) | ||||
|         media_url = self._search_regex( | ||||
| @@ -113,10 +109,7 @@ class AENetworksIE(AENetworksBaseIE): | ||||
|         info = self._parse_theplatform_metadata(theplatform_metadata) | ||||
|         if theplatform_metadata.get('AETN$isBehindWall'): | ||||
|             requestor_id = self._DOMAIN_TO_REQUESTOR_ID[domain] | ||||
|             resource = self._get_mvpd_resource( | ||||
|                 requestor_id, theplatform_metadata['title'], | ||||
|                 theplatform_metadata.get('AETN$PPL_pplProgramId') or theplatform_metadata.get('AETN$PPL_pplProgramId_OLD'), | ||||
|                 theplatform_metadata['ratings'][0]['rating']) | ||||
|             resource = '<rss version="2.0" xmlns:media="http://search.yahoo.com/mrss/"><channel><title>%s</title><item><title>%s</title><guid>%s</guid><media:rating scheme="urn:v-chip">%s</media:rating></item></channel></rss>' % (requestor_id, theplatform_metadata['title'], theplatform_metadata['AETN$PPL_pplProgramId'], theplatform_metadata['ratings'][0]['rating']) | ||||
|             query['auth'] = self._extract_mvpd_auth( | ||||
|                 url, video_id, requestor_id, resource) | ||||
|         info.update(self._search_json_ld(webpage, video_id, fatal=False)) | ||||
| @@ -159,7 +152,7 @@ class HistoryTopicIE(AENetworksBaseIE): | ||||
|             'id': 'world-war-i-history', | ||||
|             'title': 'World War I History', | ||||
|         }, | ||||
|         'playlist_mincount': 23, | ||||
|         'playlist_mincount': 24, | ||||
|     }, { | ||||
|         'url': 'http://www.history.com/topics/world-war-i-history/videos', | ||||
|         'only_matching': True, | ||||
| @@ -197,8 +190,7 @@ class HistoryTopicIE(AENetworksBaseIE): | ||||
|             return self.theplatform_url_result( | ||||
|                 release_url, video_id, { | ||||
|                     'mbr': 'true', | ||||
|                     'switch': 'hls', | ||||
|                     'assetTypes': 'high_video_ak', | ||||
|                     'switch': 'hls' | ||||
|                 }) | ||||
|         else: | ||||
|             webpage = self._download_webpage(url, topic_id) | ||||
| @@ -208,7 +200,6 @@ class HistoryTopicIE(AENetworksBaseIE): | ||||
|                 entries.append(self.theplatform_url_result( | ||||
|                     video_attributes['data-release-url'], video_attributes['data-id'], { | ||||
|                         'mbr': 'true', | ||||
|                         'switch': 'hls', | ||||
|                         'assetTypes': 'high_video_ak', | ||||
|                         'switch': 'hls' | ||||
|                     })) | ||||
|             return self.playlist_result(entries, topic_id, get_element_by_attribute('class', 'show-title', webpage)) | ||||
|   | ||||
| @@ -11,27 +11,19 @@ from ..compat import ( | ||||
| from ..utils import ( | ||||
|     ExtractorError, | ||||
|     int_or_none, | ||||
|     update_url_query, | ||||
|     xpath_element, | ||||
|     xpath_text, | ||||
| ) | ||||
|  | ||||
|  | ||||
| class AfreecaTVIE(InfoExtractor): | ||||
|     IE_NAME = 'afreecatv' | ||||
|     IE_DESC = 'afreecatv.com' | ||||
|     _VALID_URL = r'''(?x) | ||||
|                     https?:// | ||||
|                         (?: | ||||
|                             (?:(?:live|afbbs|www)\.)?afreeca(?:tv)?\.com(?::\d+)? | ||||
|                             (?: | ||||
|                                 /app/(?:index|read_ucc_bbs)\.cgi| | ||||
|                                 /player/[Pp]layer\.(?:swf|html) | ||||
|                             )\?.*?\bnTitleNo=| | ||||
|                             vod\.afreecatv\.com/PLAYER/STATION/ | ||||
|                         ) | ||||
|                         (?P<id>\d+) | ||||
|                     ''' | ||||
|     _VALID_URL = r'''(?x)^ | ||||
|         https?://(?:(live|afbbs|www)\.)?afreeca(?:tv)?\.com(?::\d+)? | ||||
|         (?: | ||||
|             /app/(?:index|read_ucc_bbs)\.cgi| | ||||
|             /player/[Pp]layer\.(?:swf|html)) | ||||
|         \?.*?\bnTitleNo=(?P<id>\d+)''' | ||||
|     _TESTS = [{ | ||||
|         'url': 'http://live.afreecatv.com:8079/app/index.cgi?szType=read_ucc_bbs&szBjId=dailyapril&nStationNo=16711924&nBbsNo=18605867&nTitleNo=36164052&szSkin=', | ||||
|         'md5': 'f72c89fe7ecc14c1b5ce506c4996046e', | ||||
| @@ -74,9 +66,6 @@ class AfreecaTVIE(InfoExtractor): | ||||
|     }, { | ||||
|         'url': 'http://www.afreecatv.com/player/Player.swf?szType=szBjId=djleegoon&nStationNo=11273158&nBbsNo=13161095&nTitleNo=36327652', | ||||
|         'only_matching': True, | ||||
|     }, { | ||||
|         'url': 'http://vod.afreecatv.com/PLAYER/STATION/15055030', | ||||
|         'only_matching': True, | ||||
|     }] | ||||
|  | ||||
|     @staticmethod | ||||
| @@ -94,9 +83,7 @@ class AfreecaTVIE(InfoExtractor): | ||||
|         info_url = compat_urlparse.urlunparse(parsed_url._replace( | ||||
|             netloc='afbbs.afreecatv.com:8080', | ||||
|             path='/api/video/get_video_info.php')) | ||||
|  | ||||
|         video_xml = self._download_xml( | ||||
|             update_url_query(info_url, {'nTitleNo': video_id}), video_id) | ||||
|         video_xml = self._download_xml(info_url, video_id) | ||||
|  | ||||
|         if xpath_element(video_xml, './track/video/file') is None: | ||||
|             raise ExtractorError('Specified AfreecaTV video does not exist', | ||||
| @@ -144,107 +131,3 @@ class AfreecaTVIE(InfoExtractor): | ||||
|                 expected=True) | ||||
|  | ||||
|         return info | ||||
|  | ||||
|  | ||||
| class AfreecaTVGlobalIE(AfreecaTVIE): | ||||
|     IE_NAME = 'afreecatv:global' | ||||
|     _VALID_URL = r'https?://(?:www\.)?afreeca\.tv/(?P<channel_id>\d+)(?:/v/(?P<video_id>\d+))?' | ||||
|     _TESTS = [{ | ||||
|         'url': 'http://afreeca.tv/36853014/v/58301', | ||||
|         'info_dict': { | ||||
|             'id': '58301', | ||||
|             'title': 'tryhard top100', | ||||
|             'uploader_id': '36853014', | ||||
|             'uploader': 'makgi Hearthstone Live!', | ||||
|         }, | ||||
|         'playlist_count': 3, | ||||
|     }] | ||||
|  | ||||
|     def _real_extract(self, url): | ||||
|         channel_id, video_id = re.match(self._VALID_URL, url).groups() | ||||
|         video_type = 'video' if video_id else 'live' | ||||
|         query = { | ||||
|             'pt': 'view', | ||||
|             'bid': channel_id, | ||||
|         } | ||||
|         if video_id: | ||||
|             query['vno'] = video_id | ||||
|         video_data = self._download_json( | ||||
|             'http://api.afreeca.tv/%s/view_%s.php' % (video_type, video_type), | ||||
|             video_id or channel_id, query=query)['channel'] | ||||
|  | ||||
|         if video_data.get('result') != 1: | ||||
|             raise ExtractorError('%s said: %s' % (self.IE_NAME, video_data['remsg'])) | ||||
|  | ||||
|         title = video_data['title'] | ||||
|  | ||||
|         info = { | ||||
|             'thumbnail': video_data.get('thumb'), | ||||
|             'view_count': int_or_none(video_data.get('vcnt')), | ||||
|             'age_limit': int_or_none(video_data.get('grade')), | ||||
|             'uploader_id': channel_id, | ||||
|             'uploader': video_data.get('cname'), | ||||
|         } | ||||
|  | ||||
|         if video_id: | ||||
|             entries = [] | ||||
|             for i, f in enumerate(video_data.get('flist', [])): | ||||
|                 video_key = self.parse_video_key(f.get('key', '')) | ||||
|                 f_url = f.get('file') | ||||
|                 if not video_key or not f_url: | ||||
|                     continue | ||||
|                 entries.append({ | ||||
|                     'id': '%s_%s' % (video_id, video_key.get('part', i + 1)), | ||||
|                     'title': title, | ||||
|                     'upload_date': video_key.get('upload_date'), | ||||
|                     'duration': int_or_none(f.get('length')), | ||||
|                     'url': f_url, | ||||
|                     'protocol': 'm3u8_native', | ||||
|                     'ext': 'mp4', | ||||
|                 }) | ||||
|  | ||||
|             info.update({ | ||||
|                 'id': video_id, | ||||
|                 'title': title, | ||||
|                 'duration': int_or_none(video_data.get('length')), | ||||
|             }) | ||||
|             if len(entries) > 1: | ||||
|                 info['_type'] = 'multi_video' | ||||
|                 info['entries'] = entries | ||||
|             elif len(entries) == 1: | ||||
|                 i = entries[0].copy() | ||||
|                 i.update(info) | ||||
|                 info = i | ||||
|         else: | ||||
|             formats = [] | ||||
|             for s in video_data.get('strm', []): | ||||
|                 s_url = s.get('purl') | ||||
|                 if not s_url: | ||||
|                     continue | ||||
|                 stype = s.get('stype') | ||||
|                 if stype == 'HLS': | ||||
|                     formats.extend(self._extract_m3u8_formats( | ||||
|                         s_url, channel_id, 'mp4', m3u8_id=stype, fatal=False)) | ||||
|                 elif stype == 'RTMP': | ||||
|                     format_id = [stype] | ||||
|                     label = s.get('label') | ||||
|                     if label: | ||||
|                         format_id.append(label) | ||||
|                     formats.append({ | ||||
|                         'format_id': '-'.join(format_id), | ||||
|                         'url': s_url, | ||||
|                         'tbr': int_or_none(s.get('bps')), | ||||
|                         'height': int_or_none(s.get('brt')), | ||||
|                         'ext': 'flv', | ||||
|                         'rtmp_live': True, | ||||
|                     }) | ||||
|             self._sort_formats(formats) | ||||
|  | ||||
|             info.update({ | ||||
|                 'id': channel_id, | ||||
|                 'title': self._live_title(title), | ||||
|                 'is_live': True, | ||||
|                 'formats': formats, | ||||
|             }) | ||||
|  | ||||
|         return info | ||||
|   | ||||
							
								
								
									
										64
									
								
								youtube_dl/extractor/aftonbladet.py
									
									
									
									
									
										Normal file
									
								
							
							
						
						
									
										64
									
								
								youtube_dl/extractor/aftonbladet.py
									
									
									
									
									
										Normal file
									
								
							| @@ -0,0 +1,64 @@ | ||||
| # encoding: utf-8 | ||||
| from __future__ import unicode_literals | ||||
|  | ||||
| from .common import InfoExtractor | ||||
| from ..utils import int_or_none | ||||
|  | ||||
|  | ||||
| class AftonbladetIE(InfoExtractor): | ||||
|     _VALID_URL = r'https?://tv\.aftonbladet\.se/abtv/articles/(?P<id>[0-9]+)' | ||||
|     _TEST = { | ||||
|         'url': 'http://tv.aftonbladet.se/abtv/articles/36015', | ||||
|         'info_dict': { | ||||
|             'id': '36015', | ||||
|             'ext': 'mp4', | ||||
|             'title': 'Vulkanutbrott i rymden - nu släpper NASA bilderna', | ||||
|             'description': 'Jupiters måne mest aktiv av alla himlakroppar', | ||||
|             'timestamp': 1394142732, | ||||
|             'upload_date': '20140306', | ||||
|         }, | ||||
|     } | ||||
|  | ||||
|     def _real_extract(self, url): | ||||
|         video_id = self._match_id(url) | ||||
|         webpage = self._download_webpage(url, video_id) | ||||
|  | ||||
|         # find internal video meta data | ||||
|         meta_url = 'http://aftonbladet-play-metadata.cdn.drvideo.aptoma.no/video/%s.json' | ||||
|         player_config = self._parse_json(self._html_search_regex( | ||||
|             r'data-player-config="([^"]+)"', webpage, 'player config'), video_id) | ||||
|         internal_meta_id = player_config['aptomaVideoId'] | ||||
|         internal_meta_url = meta_url % internal_meta_id | ||||
|         internal_meta_json = self._download_json( | ||||
|             internal_meta_url, video_id, 'Downloading video meta data') | ||||
|  | ||||
|         # find internal video formats | ||||
|         format_url = 'http://aftonbladet-play.videodata.drvideo.aptoma.no/actions/video/?id=%s' | ||||
|         internal_video_id = internal_meta_json['videoId'] | ||||
|         internal_formats_url = format_url % internal_video_id | ||||
|         internal_formats_json = self._download_json( | ||||
|             internal_formats_url, video_id, 'Downloading video formats') | ||||
|  | ||||
|         formats = [] | ||||
|         for fmt in internal_formats_json['formats']['http']['pseudostreaming']['mp4']: | ||||
|             p = fmt['paths'][0] | ||||
|             formats.append({ | ||||
|                 'url': 'http://%s:%d/%s/%s' % (p['address'], p['port'], p['path'], p['filename']), | ||||
|                 'ext': 'mp4', | ||||
|                 'width': int_or_none(fmt.get('width')), | ||||
|                 'height': int_or_none(fmt.get('height')), | ||||
|                 'tbr': int_or_none(fmt.get('bitrate')), | ||||
|                 'protocol': 'http', | ||||
|             }) | ||||
|         self._sort_formats(formats) | ||||
|  | ||||
|         return { | ||||
|             'id': video_id, | ||||
|             'title': internal_meta_json['title'], | ||||
|             'formats': formats, | ||||
|             'thumbnail': internal_meta_json.get('imageUrl'), | ||||
|             'description': internal_meta_json.get('shortPreamble'), | ||||
|             'timestamp': int_or_none(internal_meta_json.get('timePublished')), | ||||
|             'duration': int_or_none(internal_meta_json.get('duration')), | ||||
|             'view_count': int_or_none(internal_meta_json.get('views')), | ||||
|         } | ||||
| @@ -20,7 +20,7 @@ class AirMozillaIE(InfoExtractor): | ||||
|             'id': '6x4q2w', | ||||
|             'ext': 'mp4', | ||||
|             'title': 'Privacy Lab - a meetup for privacy minded people in San Francisco', | ||||
|             'thumbnail': r're:https?://vid\.ly/(?P<id>[0-9a-z-]+)/poster', | ||||
|             'thumbnail': 're:https?://vid\.ly/(?P<id>[0-9a-z-]+)/poster', | ||||
|             'description': 'Brings together privacy professionals and others interested in privacy at for-profits, non-profits, and NGOs in an effort to contribute to the state of the ecosystem...', | ||||
|             'timestamp': 1422487800, | ||||
|             'upload_date': '20150128', | ||||
|   | ||||
| @@ -4,7 +4,7 @@ from .common import InfoExtractor | ||||
|  | ||||
|  | ||||
| class AlJazeeraIE(InfoExtractor): | ||||
|     _VALID_URL = r'https?://(?:www\.)?aljazeera\.com/programmes/.*?/(?P<id>[^/]+)\.html' | ||||
|     _VALID_URL = r'https?://www\.aljazeera\.com/programmes/.*?/(?P<id>[^/]+)\.html' | ||||
|  | ||||
|     _TEST = { | ||||
|         'url': 'http://www.aljazeera.com/programmes/the-slum/2014/08/deliverance-201482883754237240.html', | ||||
|   | ||||
| @@ -1,109 +1,94 @@ | ||||
| # coding: utf-8 | ||||
| # -*- coding: utf-8 -*- | ||||
| from __future__ import unicode_literals | ||||
|  | ||||
| import re | ||||
| import json | ||||
|  | ||||
| from .common import InfoExtractor | ||||
| from ..compat import compat_str | ||||
| from ..utils import ( | ||||
|     remove_end, | ||||
|     qualities, | ||||
|     url_basename, | ||||
|     unescapeHTML, | ||||
|     xpath_element, | ||||
| ) | ||||
|  | ||||
|  | ||||
| class AllocineIE(InfoExtractor): | ||||
|     _VALID_URL = r'https?://(?:www\.)?allocine\.fr/(?:article|video|film)/(?:fichearticle_gen_carticle=|player_gen_cmedia=|fichefilm_gen_cfilm=|video-)(?P<id>[0-9]+)(?:\.html)?' | ||||
|     _VALID_URL = r'https?://(?:www\.)?allocine\.fr/(?P<typ>article|video|film)/(fichearticle_gen_carticle=|player_gen_cmedia=|fichefilm_gen_cfilm=|video-)(?P<id>[0-9]+)(?:\.html)?' | ||||
|  | ||||
|     _TESTS = [{ | ||||
|         'url': 'http://www.allocine.fr/article/fichearticle_gen_carticle=18635087.html', | ||||
|         'md5': '0c9fcf59a841f65635fa300ac43d8269', | ||||
|         'info_dict': { | ||||
|             'id': '19546517', | ||||
|             'display_id': '18635087', | ||||
|             'ext': 'mp4', | ||||
|             'title': 'Astérix - Le Domaine des Dieux Teaser VF', | ||||
|             'description': 'md5:4a754271d9c6f16c72629a8a993ee884', | ||||
|             'thumbnail': r're:http://.*\.jpg', | ||||
|             'description': 'md5:abcd09ce503c6560512c14ebfdb720d2', | ||||
|             'thumbnail': 're:http://.*\.jpg', | ||||
|         }, | ||||
|     }, { | ||||
|         'url': 'http://www.allocine.fr/video/player_gen_cmedia=19540403&cfilm=222257.html', | ||||
|         'md5': 'd0cdce5d2b9522ce279fdfec07ff16e0', | ||||
|         'info_dict': { | ||||
|             'id': '19540403', | ||||
|             'display_id': '19540403', | ||||
|             'ext': 'mp4', | ||||
|             'title': 'Planes 2 Bande-annonce VF', | ||||
|             'description': 'Regardez la bande annonce du film Planes 2 (Planes 2 Bande-annonce VF). Planes 2, un film de Roberts Gannaway', | ||||
|             'thumbnail': r're:http://.*\.jpg', | ||||
|             'thumbnail': 're:http://.*\.jpg', | ||||
|         }, | ||||
|     }, { | ||||
|         'url': 'http://www.allocine.fr/video/player_gen_cmedia=19544709&cfilm=181290.html', | ||||
|         'url': 'http://www.allocine.fr/film/fichefilm_gen_cfilm=181290.html', | ||||
|         'md5': '101250fb127ef9ca3d73186ff22a47ce', | ||||
|         'info_dict': { | ||||
|             'id': '19544709', | ||||
|             'display_id': '19544709', | ||||
|             'ext': 'mp4', | ||||
|             'title': 'Dragons 2 - Bande annonce finale VF', | ||||
|             'description': 'md5:6cdd2d7c2687d4c6aafe80a35e17267a', | ||||
|             'thumbnail': r're:http://.*\.jpg', | ||||
|             'description': 'md5:601d15393ac40f249648ef000720e7e3', | ||||
|             'thumbnail': 're:http://.*\.jpg', | ||||
|         }, | ||||
|     }, { | ||||
|         'url': 'http://www.allocine.fr/video/video-19550147/', | ||||
|         'md5': '3566c0668c0235e2d224fd8edb389f67', | ||||
|         'info_dict': { | ||||
|             'id': '19550147', | ||||
|             'ext': 'mp4', | ||||
|             'title': 'Faux Raccord N°123 - Les gaffes de Cliffhanger', | ||||
|             'description': 'md5:bc734b83ffa2d8a12188d9eb48bb6354', | ||||
|             'thumbnail': r're:http://.*\.jpg', | ||||
|         }, | ||||
|         'only_matching': True, | ||||
|     }] | ||||
|  | ||||
|     def _real_extract(self, url): | ||||
|         display_id = self._match_id(url) | ||||
|         mobj = re.match(self._VALID_URL, url) | ||||
|         typ = mobj.group('typ') | ||||
|         display_id = mobj.group('id') | ||||
|  | ||||
|         webpage = self._download_webpage(url, display_id) | ||||
|  | ||||
|         formats = [] | ||||
|         if typ == 'film': | ||||
|             video_id = self._search_regex(r'href="/video/player_gen_cmedia=([0-9]+).+"', webpage, 'video id') | ||||
|         else: | ||||
|             player = self._search_regex(r'data-player=\'([^\']+)\'>', webpage, 'data player', default=None) | ||||
|             if player: | ||||
|                 player_data = json.loads(player) | ||||
|                 video_id = compat_str(player_data['refMedia']) | ||||
|             else: | ||||
|                 model = self._search_regex(r'data-model="([^"]+)">', webpage, 'data model') | ||||
|                 model_data = self._parse_json(unescapeHTML(model), display_id) | ||||
|                 video_id = compat_str(model_data['id']) | ||||
|  | ||||
|         xml = self._download_xml('http://www.allocine.fr/ws/AcVisiondataV4.ashx?media=%s' % video_id, display_id) | ||||
|  | ||||
|         video = xpath_element(xml, './/AcVisionVideo').attrib | ||||
|         quality = qualities(['ld', 'md', 'hd']) | ||||
|  | ||||
|         model = self._html_search_regex( | ||||
|             r'data-model="([^"]+)"', webpage, 'data model', default=None) | ||||
|         if model: | ||||
|             model_data = self._parse_json(model, display_id) | ||||
|  | ||||
|             for video_url in model_data['sources'].values(): | ||||
|                 video_id, format_id = url_basename(video_url).split('_')[:2] | ||||
|         formats = [] | ||||
|         for k, v in video.items(): | ||||
|             if re.match(r'.+_path', k): | ||||
|                 format_id = k.split('_')[0] | ||||
|                 formats.append({ | ||||
|                     'format_id': format_id, | ||||
|                     'quality': quality(format_id), | ||||
|                     'url': video_url, | ||||
|                     'url': v, | ||||
|                 }) | ||||
|  | ||||
|             title = model_data['title'] | ||||
|         else: | ||||
|             video_id = display_id | ||||
|             media_data = self._download_json( | ||||
|                 'http://www.allocine.fr/ws/AcVisiondataV5.ashx?media=%s' % video_id, display_id) | ||||
|             for key, value in media_data['video'].items(): | ||||
|                 if not key.endswith('Path'): | ||||
|                     continue | ||||
|  | ||||
|                 format_id = key[:-len('Path')] | ||||
|                 formats.append({ | ||||
|                     'format_id': format_id, | ||||
|                     'quality': quality(format_id), | ||||
|                     'url': value, | ||||
|                 }) | ||||
|  | ||||
|             title = remove_end(self._html_search_regex( | ||||
|                 r'(?s)<title>(.+?)</title>', webpage, 'title' | ||||
|             ).strip(), ' - AlloCiné') | ||||
|  | ||||
|         self._sort_formats(formats) | ||||
|  | ||||
|         return { | ||||
|             'id': video_id, | ||||
|             'display_id': display_id, | ||||
|             'title': title, | ||||
|             'title': video['videoTitle'], | ||||
|             'thumbnail': self._og_search_thumbnail(webpage), | ||||
|             'formats': formats, | ||||
|             'description': self._og_search_description(webpage), | ||||
|   | ||||
| @@ -19,7 +19,7 @@ class AlphaPornoIE(InfoExtractor): | ||||
|             'display_id': 'sensual-striptease-porn-with-samantha-alexandra', | ||||
|             'ext': 'mp4', | ||||
|             'title': 'Sensual striptease porn with Samantha Alexandra', | ||||
|             'thumbnail': r're:https?://.*\.jpg$', | ||||
|             'thumbnail': 're:https?://.*\.jpg$', | ||||
|             'timestamp': 1418694611, | ||||
|             'upload_date': '20141216', | ||||
|             'duration': 387, | ||||
|   | ||||
| @@ -1,107 +0,0 @@ | ||||
| # coding: utf-8 | ||||
| from __future__ import unicode_literals | ||||
|  | ||||
| from .theplatform import ThePlatformIE | ||||
| from ..utils import ( | ||||
|     update_url_query, | ||||
|     parse_age_limit, | ||||
|     int_or_none, | ||||
| ) | ||||
|  | ||||
|  | ||||
| class AMCNetworksIE(ThePlatformIE): | ||||
|     _VALID_URL = r'https?://(?:www\.)?(?:amc|bbcamerica|ifc|wetv)\.com/(?:movies/|shows/[^/]+/(?:full-episodes/)?[^/]+/episode-\d+(?:-(?:[^/]+/)?|/))(?P<id>[^/?#]+)' | ||||
|     _TESTS = [{ | ||||
|         'url': 'http://www.ifc.com/shows/maron/season-04/episode-01/step-1', | ||||
|         'md5': '', | ||||
|         'info_dict': { | ||||
|             'id': 's3MX01Nl4vPH', | ||||
|             'ext': 'mp4', | ||||
|             'title': 'Maron - Season 4 - Step 1', | ||||
|             'description': 'In denial about his current situation, Marc is reluctantly convinced by his friends to enter rehab. Starring Marc Maron and Constance Zimmer.', | ||||
|             'age_limit': 17, | ||||
|             'upload_date': '20160505', | ||||
|             'timestamp': 1462468831, | ||||
|             'uploader': 'AMCN', | ||||
|         }, | ||||
|         'params': { | ||||
|             # m3u8 download | ||||
|             'skip_download': True, | ||||
|         }, | ||||
|         'skip': 'Requires TV provider accounts', | ||||
|     }, { | ||||
|         'url': 'http://www.bbcamerica.com/shows/the-hunt/full-episodes/season-1/episode-01-the-hardest-challenge', | ||||
|         'only_matching': True, | ||||
|     }, { | ||||
|         'url': 'http://www.amc.com/shows/preacher/full-episodes/season-01/episode-00/pilot', | ||||
|         'only_matching': True, | ||||
|     }, { | ||||
|         'url': 'http://www.wetv.com/shows/million-dollar-matchmaker/season-01/episode-06-the-dumped-dj-and-shallow-hal', | ||||
|         'only_matching': True, | ||||
|     }, { | ||||
|         'url': 'http://www.ifc.com/movies/chaos', | ||||
|         'only_matching': True, | ||||
|     }, { | ||||
|         'url': 'http://www.bbcamerica.com/shows/doctor-who/full-episodes/the-power-of-the-daleks/episode-01-episode-1-color-version', | ||||
|         'only_matching': True, | ||||
|     }] | ||||
|  | ||||
|     def _real_extract(self, url): | ||||
|         display_id = self._match_id(url) | ||||
|         webpage = self._download_webpage(url, display_id) | ||||
|         query = { | ||||
|             'mbr': 'true', | ||||
|             'manifest': 'm3u', | ||||
|         } | ||||
|         media_url = self._search_regex( | ||||
|             r'window\.platformLinkURL\s*=\s*[\'"]([^\'"]+)', | ||||
|             webpage, 'media url') | ||||
|         theplatform_metadata = self._download_theplatform_metadata(self._search_regex( | ||||
|             r'link\.theplatform\.com/s/([^?]+)', | ||||
|             media_url, 'theplatform_path'), display_id) | ||||
|         info = self._parse_theplatform_metadata(theplatform_metadata) | ||||
|         video_id = theplatform_metadata['pid'] | ||||
|         title = theplatform_metadata['title'] | ||||
|         rating = theplatform_metadata['ratings'][0]['rating'] | ||||
|         auth_required = self._search_regex( | ||||
|             r'window\.authRequired\s*=\s*(true|false);', | ||||
|             webpage, 'auth required') | ||||
|         if auth_required == 'true': | ||||
|             requestor_id = self._search_regex( | ||||
|                 r'window\.requestor_id\s*=\s*[\'"]([^\'"]+)', | ||||
|                 webpage, 'requestor id') | ||||
|             resource = self._get_mvpd_resource( | ||||
|                 requestor_id, title, video_id, rating) | ||||
|             query['auth'] = self._extract_mvpd_auth( | ||||
|                 url, video_id, requestor_id, resource) | ||||
|         media_url = update_url_query(media_url, query) | ||||
|         formats, subtitles = self._extract_theplatform_smil( | ||||
|             media_url, video_id) | ||||
|         self._sort_formats(formats) | ||||
|         info.update({ | ||||
|             'id': video_id, | ||||
|             'subtitles': subtitles, | ||||
|             'formats': formats, | ||||
|             'age_limit': parse_age_limit(parse_age_limit(rating)), | ||||
|         }) | ||||
|         ns_keys = theplatform_metadata.get('$xmlns', {}).keys() | ||||
|         if ns_keys: | ||||
|             ns = list(ns_keys)[0] | ||||
|             series = theplatform_metadata.get(ns + '$show') | ||||
|             season_number = int_or_none( | ||||
|                 theplatform_metadata.get(ns + '$season')) | ||||
|             episode = theplatform_metadata.get(ns + '$episodeTitle') | ||||
|             episode_number = int_or_none( | ||||
|                 theplatform_metadata.get(ns + '$episode')) | ||||
|             if season_number: | ||||
|                 title = 'Season %d - %s' % (season_number, title) | ||||
|             if series: | ||||
|                 title = '%s - %s' % (series, title) | ||||
|             info.update({ | ||||
|                 'title': title, | ||||
|                 'series': series, | ||||
|                 'season_number': season_number, | ||||
|                 'episode': episode, | ||||
|                 'episode_number': episode_number, | ||||
|             }) | ||||
|         return info | ||||
| @@ -157,16 +157,22 @@ class AnvatoIE(InfoExtractor): | ||||
|             video_data_url, video_id, transform_source=strip_jsonp, | ||||
|             data=json.dumps(payload).encode('utf-8')) | ||||
|  | ||||
|     def _get_anvato_videos(self, access_key, video_id): | ||||
|     def _extract_anvato_videos(self, webpage, video_id): | ||||
|         anvplayer_data = self._parse_json(self._html_search_regex( | ||||
|             r'<script[^>]+data-anvp=\'([^\']+)\'', webpage, | ||||
|             'Anvato player data'), video_id) | ||||
|  | ||||
|         video_id = anvplayer_data['video'] | ||||
|         access_key = anvplayer_data['accessKey'] | ||||
|  | ||||
|         video_data = self._get_video_json(access_key, video_id) | ||||
|  | ||||
|         formats = [] | ||||
|         for published_url in video_data['published_urls']: | ||||
|             video_url = published_url['embed_url'] | ||||
|             media_format = published_url.get('format') | ||||
|             ext = determine_ext(video_url) | ||||
|  | ||||
|             if ext == 'smil' or media_format == 'smil': | ||||
|             if ext == 'smil': | ||||
|                 formats.extend(self._extract_smil_formats(video_url, video_id)) | ||||
|                 continue | ||||
|  | ||||
| @@ -177,7 +183,7 @@ class AnvatoIE(InfoExtractor): | ||||
|                 'tbr': tbr if tbr != 0 else None, | ||||
|             } | ||||
|  | ||||
|             if ext == 'm3u8' or media_format in ('m3u8', 'm3u8-variant'): | ||||
|             if ext == 'm3u8': | ||||
|                 # Not using _extract_m3u8_formats here as individual media | ||||
|                 # playlists are also included in published_urls. | ||||
|                 if tbr is None: | ||||
| @@ -188,7 +194,7 @@ class AnvatoIE(InfoExtractor): | ||||
|                         'format_id': '-'.join(filter(None, ['hls', compat_str(tbr)])), | ||||
|                         'ext': 'mp4', | ||||
|                     }) | ||||
|             elif ext == 'mp3' or media_format == 'mp3': | ||||
|             elif ext == 'mp3': | ||||
|                 a_format['vcodec'] = 'none' | ||||
|             else: | ||||
|                 a_format.update({ | ||||
| @@ -212,19 +218,7 @@ class AnvatoIE(InfoExtractor): | ||||
|             'formats': formats, | ||||
|             'title': video_data.get('def_title'), | ||||
|             'description': video_data.get('def_description'), | ||||
|             'tags': video_data.get('def_tags', '').split(','), | ||||
|             'categories': video_data.get('categories'), | ||||
|             'thumbnail': video_data.get('thumbnail'), | ||||
|             'timestamp': int_or_none(video_data.get( | ||||
|                 'ts_published') or video_data.get('ts_added')), | ||||
|             'uploader': video_data.get('mcp_id'), | ||||
|             'duration': int_or_none(video_data.get('duration')), | ||||
|             'subtitles': subtitles, | ||||
|         } | ||||
|  | ||||
|     def _extract_anvato_videos(self, webpage, video_id): | ||||
|         anvplayer_data = self._parse_json(self._html_search_regex( | ||||
|             r'<script[^>]+data-anvp=\'([^\']+)\'', webpage, | ||||
|             'Anvato player data'), video_id) | ||||
|         return self._get_anvato_videos( | ||||
|             anvplayer_data['accessKey'], anvplayer_data['video']) | ||||
|   | ||||
| @@ -12,7 +12,7 @@ from ..utils import ( | ||||
|  | ||||
| class AolIE(InfoExtractor): | ||||
|     IE_NAME = 'on.aol.com' | ||||
|     _VALID_URL = r'(?:aol-video:|https?://(?:(?:www|on)\.)?aol\.com/(?:[^/]+/)*(?:[^/?#&]+-)?)(?P<id>[^/?#&]+)' | ||||
|     _VALID_URL = r'(?:aol-video:|https?://on\.aol\.com/(?:[^/]+/)*(?:[^/?#&]+-)?)(?P<id>[^/?#&]+)' | ||||
|  | ||||
|     _TESTS = [{ | ||||
|         # video with 5min ID | ||||
| @@ -33,7 +33,7 @@ class AolIE(InfoExtractor): | ||||
|         } | ||||
|     }, { | ||||
|         # video with vidible ID | ||||
|         'url': 'http://www.aol.com/video/view/netflix-is-raising-rates/5707d6b8e4b090497b04f706/', | ||||
|         'url': 'http://on.aol.com/video/netflix-is-raising-rates-5707d6b8e4b090497b04f706?context=PC:homepage:PL1944:1460189336183', | ||||
|         'info_dict': { | ||||
|             'id': '5707d6b8e4b090497b04f706', | ||||
|             'ext': 'mp4', | ||||
| @@ -108,3 +108,26 @@ class AolIE(InfoExtractor): | ||||
|             'uploader': video_data.get('videoOwner'), | ||||
|             'formats': formats, | ||||
|         } | ||||
|  | ||||
|  | ||||
| class AolFeaturesIE(InfoExtractor): | ||||
|     IE_NAME = 'features.aol.com' | ||||
|     _VALID_URL = r'https?://features\.aol\.com/video/(?P<id>[^/?#]+)' | ||||
|  | ||||
|     _TESTS = [{ | ||||
|         'url': 'http://features.aol.com/video/behind-secret-second-careers-late-night-talk-show-hosts', | ||||
|         'md5': '7db483bb0c09c85e241f84a34238cc75', | ||||
|         'info_dict': { | ||||
|             'id': '519507715', | ||||
|             'ext': 'mp4', | ||||
|             'title': 'What To Watch - February 17, 2016', | ||||
|         }, | ||||
|         'add_ie': ['FiveMin'], | ||||
|     }] | ||||
|  | ||||
|     def _real_extract(self, url): | ||||
|         display_id = self._match_id(url) | ||||
|         webpage = self._download_webpage(url, display_id) | ||||
|         return self.url_result(self._search_regex( | ||||
|             r'<script type="text/javascript" src="(https?://[^/]*?5min\.com/Scripts/PlayerSeed\.js[^"]+)"', | ||||
|             webpage, '5min embed url'), 'FiveMin') | ||||
|   | ||||
| @@ -1,6 +1,8 @@ | ||||
| # coding: utf-8 | ||||
| from __future__ import unicode_literals | ||||
|  | ||||
| import re | ||||
|  | ||||
| from .common import InfoExtractor | ||||
| from ..utils import ( | ||||
|     ExtractorError, | ||||
| @@ -13,7 +15,7 @@ class AparatIE(InfoExtractor): | ||||
|  | ||||
|     _TEST = { | ||||
|         'url': 'http://www.aparat.com/v/wP8On', | ||||
|         'md5': '131aca2e14fe7c4dcb3c4877ba300c89', | ||||
|         'md5': '6714e0af7e0d875c5a39c4dc4ab46ad1', | ||||
|         'info_dict': { | ||||
|             'id': 'wP8On', | ||||
|             'ext': 'mp4', | ||||
| @@ -29,13 +31,13 @@ class AparatIE(InfoExtractor): | ||||
|         # Note: There is an easier-to-parse configuration at | ||||
|         # http://www.aparat.com/video/video/config/videohash/%video_id | ||||
|         # but the URL in there does not work | ||||
|         embed_url = 'http://www.aparat.com/video/video/embed/vt/frame/showvideo/yes/videohash/' + video_id | ||||
|         embed_url = ('http://www.aparat.com/video/video/embed/videohash/' + | ||||
|                      video_id + '/vt/frame') | ||||
|         webpage = self._download_webpage(embed_url, video_id) | ||||
|  | ||||
|         file_list = self._parse_json(self._search_regex( | ||||
|             r'fileList\s*=\s*JSON\.parse\(\'([^\']+)\'\)', webpage, 'file list'), video_id) | ||||
|         for i, item in enumerate(file_list[0]): | ||||
|             video_url = item['file'] | ||||
|         video_urls = [video_url.replace('\\/', '/') for video_url in re.findall( | ||||
|             r'(?:fileList\[[0-9]+\]\s*=|"file"\s*:)\s*"([^"]+)"', webpage)] | ||||
|         for i, video_url in enumerate(video_urls): | ||||
|             req = HEADRequest(video_url) | ||||
|             res = self._request_webpage( | ||||
|                 req, video_id, note='Testing video URL %d' % i, errnote=False) | ||||
|   | ||||
| @@ -1,65 +1,67 @@ | ||||
| from __future__ import unicode_literals | ||||
|  | ||||
| from .common import InfoExtractor | ||||
| from ..utils import ( | ||||
|     unified_strdate, | ||||
|     clean_html, | ||||
| ) | ||||
| from ..utils import unified_strdate | ||||
|  | ||||
|  | ||||
| class ArchiveOrgIE(InfoExtractor): | ||||
|     IE_NAME = 'archive.org' | ||||
|     IE_DESC = 'archive.org videos' | ||||
|     _VALID_URL = r'https?://(?:www\.)?archive\.org/(?:details|embed)/(?P<id>[^/?#]+)(?:[?].*)?$' | ||||
|     _VALID_URL = r'https?://(?:www\.)?archive\.org/details/(?P<id>[^?/]+)(?:[?].*)?$' | ||||
|     _TESTS = [{ | ||||
|         'url': 'http://archive.org/details/XD300-23_68HighlightsAResearchCntAugHumanIntellect', | ||||
|         'md5': '8af1d4cf447933ed3c7f4871162602db', | ||||
|         'info_dict': { | ||||
|             'id': 'XD300-23_68HighlightsAResearchCntAugHumanIntellect', | ||||
|             'ext': 'ogg', | ||||
|             'ext': 'ogv', | ||||
|             'title': '1968 Demo - FJCC Conference Presentation Reel #1', | ||||
|             'description': 'md5:da45c349df039f1cc8075268eb1b5c25', | ||||
|             'description': 'md5:1780b464abaca9991d8968c877bb53ed', | ||||
|             'upload_date': '19681210', | ||||
|             'uploader': 'SRI International' | ||||
|         } | ||||
|     }, { | ||||
|         'url': 'https://archive.org/details/Cops1922', | ||||
|         'md5': 'bc73c8ab3838b5a8fc6c6651fa7b58ba', | ||||
|         'md5': '18f2a19e6d89af8425671da1cf3d4e04', | ||||
|         'info_dict': { | ||||
|             'id': 'Cops1922', | ||||
|             'ext': 'mp4', | ||||
|             'ext': 'ogv', | ||||
|             'title': 'Buster Keaton\'s "Cops" (1922)', | ||||
|             'description': 'md5:b4544662605877edd99df22f9620d858', | ||||
|             'description': 'md5:70f72ee70882f713d4578725461ffcc3', | ||||
|         } | ||||
|     }, { | ||||
|         'url': 'http://archive.org/embed/XD300-23_68HighlightsAResearchCntAugHumanIntellect', | ||||
|         'only_matching': True, | ||||
|     }] | ||||
|  | ||||
|     def _real_extract(self, url): | ||||
|         video_id = self._match_id(url) | ||||
|         webpage = self._download_webpage( | ||||
|             'http://archive.org/embed/' + video_id, video_id) | ||||
|         jwplayer_playlist = self._parse_json(self._search_regex( | ||||
|             r"(?s)Play\('[^']+'\s*,\s*(\[.+\])\s*,\s*{.*?}\);", | ||||
|             webpage, 'jwplayer playlist'), video_id) | ||||
|         info = self._parse_jwplayer_data( | ||||
|             {'playlist': jwplayer_playlist}, video_id, base_url=url) | ||||
|  | ||||
|         def get_optional(metadata, field): | ||||
|             return metadata.get(field, [None])[0] | ||||
|         json_url = url + ('&' if '?' in url else '?') + 'output=json' | ||||
|         data = self._download_json(json_url, video_id) | ||||
|  | ||||
|         metadata = self._download_json( | ||||
|             'http://archive.org/details/' + video_id, video_id, query={ | ||||
|                 'output': 'json', | ||||
|             })['metadata'] | ||||
|         info.update({ | ||||
|             'title': get_optional(metadata, 'title') or info.get('title'), | ||||
|             'description': clean_html(get_optional(metadata, 'description')), | ||||
|         }) | ||||
|         if info.get('_type') != 'playlist': | ||||
|             info.update({ | ||||
|                 'uploader': get_optional(metadata, 'creator'), | ||||
|                 'upload_date': unified_strdate(get_optional(metadata, 'date')), | ||||
|             }) | ||||
|         return info | ||||
|         def get_optional(data_dict, field): | ||||
|             return data_dict['metadata'].get(field, [None])[0] | ||||
|  | ||||
|         title = get_optional(data, 'title') | ||||
|         description = get_optional(data, 'description') | ||||
|         uploader = get_optional(data, 'creator') | ||||
|         upload_date = unified_strdate(get_optional(data, 'date')) | ||||
|  | ||||
|         formats = [ | ||||
|             { | ||||
|                 'format': fdata['format'], | ||||
|                 'url': 'http://' + data['server'] + data['dir'] + fn, | ||||
|                 'file_size': int(fdata['size']), | ||||
|             } | ||||
|             for fn, fdata in data['files'].items() | ||||
|             if 'Video' in fdata['format']] | ||||
|  | ||||
|         self._sort_formats(formats) | ||||
|  | ||||
|         return { | ||||
|             '_type': 'video', | ||||
|             'id': video_id, | ||||
|             'title': title, | ||||
|             'formats': formats, | ||||
|             'description': description, | ||||
|             'uploader': uploader, | ||||
|             'upload_date': upload_date, | ||||
|             'thumbnail': data.get('misc', {}).get('image'), | ||||
|         } | ||||
|   | ||||
| @@ -73,7 +73,6 @@ class ARDMediathekIE(InfoExtractor): | ||||
|             'description': 'md5:c0c1c8048514deaed2a73b3a60eecacb', | ||||
|             'duration': 3287, | ||||
|         }, | ||||
|         'skip': 'Video is no longer available', | ||||
|     }] | ||||
|  | ||||
|     def _extract_media_info(self, media_info_url, webpage, video_id): | ||||
| @@ -174,15 +173,11 @@ class ARDMediathekIE(InfoExtractor): | ||||
|  | ||||
|         webpage = self._download_webpage(url, video_id) | ||||
|  | ||||
|         ERRORS = ( | ||||
|             ('>Leider liegt eine Störung vor.', 'Video %s is unavailable'), | ||||
|             ('>Der gewünschte Beitrag ist nicht mehr verfügbar.<', | ||||
|              'Video %s is no longer available'), | ||||
|         ) | ||||
|         if '>Der gewünschte Beitrag ist nicht mehr verfügbar.<' in webpage: | ||||
|             raise ExtractorError('Video %s is no longer available' % video_id, expected=True) | ||||
|  | ||||
|         for pattern, message in ERRORS: | ||||
|             if pattern in webpage: | ||||
|                 raise ExtractorError(message % video_id, expected=True) | ||||
|         if 'Diese Sendung ist für Jugendliche unter 12 Jahren nicht geeignet. Der Clip ist deshalb nur von 20 bis 6 Uhr verfügbar.' in webpage: | ||||
|             raise ExtractorError('This program is only suitable for those aged 12 and older. Video %s is therefore only available between 20 pm and 6 am.' % video_id, expected=True) | ||||
|  | ||||
|         if re.search(r'[\?&]rss($|[=&])', url): | ||||
|             doc = compat_etree_fromstring(webpage.encode('utf-8')) | ||||
| @@ -242,7 +237,7 @@ class ARDMediathekIE(InfoExtractor): | ||||
|  | ||||
|  | ||||
| class ARDIE(InfoExtractor): | ||||
|     _VALID_URL = r'(?P<mainurl>https?://(www\.)?daserste\.de/[^?#]+/videos/(?P<display_id>[^/?#]+)-(?P<id>[0-9]+))\.html' | ||||
|     _VALID_URL = '(?P<mainurl>https?://(www\.)?daserste\.de/[^?#]+/videos/(?P<display_id>[^/?#]+)-(?P<id>[0-9]+))\.html' | ||||
|     _TEST = { | ||||
|         'url': 'http://www.daserste.de/information/reportage-dokumentation/dokus/videos/die-story-im-ersten-mission-unter-falscher-flagge-100.html', | ||||
|         'md5': 'd216c3a86493f9322545e045ddc3eb35', | ||||
| @@ -253,7 +248,7 @@ class ARDIE(InfoExtractor): | ||||
|             'duration': 2600, | ||||
|             'title': 'Die Story im Ersten: Mission unter falscher Flagge', | ||||
|             'upload_date': '20140804', | ||||
|             'thumbnail': r're:^https?://.*\.jpg$', | ||||
|             'thumbnail': 're:^https?://.*\.jpg$', | ||||
|         }, | ||||
|         'skip': 'HTTP Error 404: Not Found', | ||||
|     } | ||||
|   | ||||
| @@ -4,10 +4,8 @@ from __future__ import unicode_literals | ||||
| import re | ||||
|  | ||||
| from .common import InfoExtractor | ||||
| from ..compat import compat_urlparse | ||||
| from ..utils import ( | ||||
|     determine_ext, | ||||
|     ExtractorError, | ||||
|     float_or_none, | ||||
|     int_or_none, | ||||
|     mimetype2ext, | ||||
| @@ -17,13 +15,7 @@ from ..utils import ( | ||||
|  | ||||
|  | ||||
| class ArkenaIE(InfoExtractor): | ||||
|     _VALID_URL = r'''(?x) | ||||
|                         https?:// | ||||
|                             (?: | ||||
|                                 video\.arkena\.com/play2/embed/player\?| | ||||
|                                 play\.arkena\.com/(?:config|embed)/avp/v\d/player/media/(?P<id>[^/]+)/[^/]+/(?P<account_id>\d+) | ||||
|                             ) | ||||
|                         ''' | ||||
|     _VALID_URL = r'https?://play\.arkena\.com/(?:config|embed)/avp/v\d/player/media/(?P<id>[^/]+)/[^/]+/(?P<account_id>\d+)' | ||||
|     _TESTS = [{ | ||||
|         'url': 'https://play.arkena.com/embed/avp/v2/player/media/b41dda37-d8e7-4d3f-b1b5-9a9db578bdfe/1/129411', | ||||
|         'md5': 'b96f2f71b359a8ecd05ce4e1daa72365', | ||||
| @@ -45,9 +37,6 @@ class ArkenaIE(InfoExtractor): | ||||
|     }, { | ||||
|         'url': 'http://play.arkena.com/embed/avp/v1/player/media/327336/darkmatter/131064/', | ||||
|         'only_matching': True, | ||||
|     }, { | ||||
|         'url': 'http://video.arkena.com/play2/embed/player?accountId=472718&mediaId=35763b3b-00090078-bf604299&pageStyling=styled', | ||||
|         'only_matching': True, | ||||
|     }] | ||||
|  | ||||
|     @staticmethod | ||||
| @@ -64,14 +53,6 @@ class ArkenaIE(InfoExtractor): | ||||
|         video_id = mobj.group('id') | ||||
|         account_id = mobj.group('account_id') | ||||
|  | ||||
|         # Handle http://video.arkena.com/play2/embed/player URL | ||||
|         if not video_id: | ||||
|             qs = compat_urlparse.parse_qs(compat_urlparse.urlparse(url).query) | ||||
|             video_id = qs.get('mediaId', [None])[0] | ||||
|             account_id = qs.get('accountId', [None])[0] | ||||
|             if not video_id or not account_id: | ||||
|                 raise ExtractorError('Invalid URL', expected=True) | ||||
|  | ||||
|         playlist = self._download_json( | ||||
|             'https://play.arkena.com/config/avp/v2/player/media/%s/0/%s/?callbackMethod=_' | ||||
|             % (video_id, account_id), | ||||
|   | ||||
| @@ -1,4 +1,4 @@ | ||||
| # coding: utf-8 | ||||
| # encoding: utf-8 | ||||
| from __future__ import unicode_literals | ||||
|  | ||||
| import re | ||||
| @@ -410,22 +410,6 @@ class ArteTVEmbedIE(ArteTVPlus7IE): | ||||
|         return self._extract_from_json_url(json_url, video_id, lang) | ||||
|  | ||||
|  | ||||
| class TheOperaPlatformIE(ArteTVPlus7IE): | ||||
|     IE_NAME = 'theoperaplatform' | ||||
|     _VALID_URL = r'https?://(?:www\.)?theoperaplatform\.eu/(?P<lang>fr|de|en|es)/(?P<id>[^/?#&]+)' | ||||
|  | ||||
|     _TESTS = [{ | ||||
|         'url': 'http://www.theoperaplatform.eu/de/opera/verdi-otello', | ||||
|         'md5': '970655901fa2e82e04c00b955e9afe7b', | ||||
|         'info_dict': { | ||||
|             'id': '060338-009-A', | ||||
|             'ext': 'mp4', | ||||
|             'title': 'Verdi - OTELLO', | ||||
|             'upload_date': '20160927', | ||||
|         }, | ||||
|     }] | ||||
|  | ||||
|  | ||||
| class ArteTVPlaylistIE(ArteTVBaseIE): | ||||
|     IE_NAME = 'arte.tv:playlist' | ||||
|     _VALID_URL = r'https?://(?:www\.)?arte\.tv/guide/(?P<lang>fr|de|en|es)/[^#]*#collection/(?P<id>PL-\d+)' | ||||
|   | ||||
| @@ -30,7 +30,7 @@ class AtresPlayerIE(InfoExtractor): | ||||
|                 'title': 'Especial Solidario de Nochebuena', | ||||
|                 'description': 'md5:e2d52ff12214fa937107d21064075bf1', | ||||
|                 'duration': 5527.6, | ||||
|                 'thumbnail': r're:^https?://.*\.jpg$', | ||||
|                 'thumbnail': 're:^https?://.*\.jpg$', | ||||
|             }, | ||||
|             'skip': 'This video is only available for registered users' | ||||
|         }, | ||||
| @@ -43,7 +43,7 @@ class AtresPlayerIE(InfoExtractor): | ||||
|                 'title': 'David Bustamante', | ||||
|                 'description': 'md5:f33f1c0a05be57f6708d4dd83a3b81c6', | ||||
|                 'duration': 1439.0, | ||||
|                 'thumbnail': r're:^https?://.*\.jpg$', | ||||
|                 'thumbnail': 're:^https?://.*\.jpg$', | ||||
|             }, | ||||
|         }, | ||||
|         { | ||||
|   | ||||
| @@ -14,7 +14,7 @@ class ATTTechChannelIE(InfoExtractor): | ||||
|             'ext': 'flv', | ||||
|             'title': 'AT&T Archives : The UNIX System: Making Computers Easier to Use', | ||||
|             'description': 'A 1982 film about UNIX is the foundation for software in use around Bell Labs and AT&T.', | ||||
|             'thumbnail': r're:^https?://.*\.jpg$', | ||||
|             'thumbnail': 're:^https?://.*\.jpg$', | ||||
|             'upload_date': '20140127', | ||||
|         }, | ||||
|         'params': { | ||||
|   | ||||
| @@ -6,8 +6,8 @@ from ..utils import float_or_none | ||||
|  | ||||
|  | ||||
| class AudioBoomIE(InfoExtractor): | ||||
|     _VALID_URL = r'https?://(?:www\.)?audioboom\.com/(?:boos|posts)/(?P<id>[0-9]+)' | ||||
|     _TESTS = [{ | ||||
|     _VALID_URL = r'https?://(?:www\.)?audioboom\.com/boos/(?P<id>[0-9]+)' | ||||
|     _TEST = { | ||||
|         'url': 'https://audioboom.com/boos/4279833-3-09-2016-czaban-hour-3?t=0', | ||||
|         'md5': '63a8d73a055c6ed0f1e51921a10a5a76', | ||||
|         'info_dict': { | ||||
| @@ -17,12 +17,9 @@ class AudioBoomIE(InfoExtractor): | ||||
|             'description': 'Guest:   Nate Davis - NFL free agency,   Guest:   Stan Gans', | ||||
|             'duration': 2245.72, | ||||
|             'uploader': 'Steve Czaban', | ||||
|             'uploader_url': r're:https?://(?:www\.)?audioboom\.com/channel/steveczabanyahoosportsradio', | ||||
|             'uploader_url': 're:https?://(?:www\.)?audioboom\.com/channel/steveczabanyahoosportsradio', | ||||
|         } | ||||
|     }, { | ||||
|         'url': 'https://audioboom.com/posts/4279833-3-09-2016-czaban-hour-3?t=0', | ||||
|         'only_matching': True, | ||||
|     }] | ||||
|     } | ||||
|  | ||||
|     def _real_extract(self, url): | ||||
|         video_id = self._match_id(url) | ||||
|   | ||||
| @@ -1,172 +0,0 @@ | ||||
| from __future__ import unicode_literals | ||||
|  | ||||
| import re | ||||
|  | ||||
| from .common import InfoExtractor | ||||
| from .kaltura import KalturaIE | ||||
| from ..utils import ( | ||||
|     get_element_by_id, | ||||
|     strip_or_none, | ||||
|     urljoin, | ||||
| ) | ||||
|  | ||||
|  | ||||
| class AZMedienBaseIE(InfoExtractor): | ||||
|     def _kaltura_video(self, partner_id, entry_id): | ||||
|         return self.url_result( | ||||
|             'kaltura:%s:%s' % (partner_id, entry_id), ie=KalturaIE.ie_key(), | ||||
|             video_id=entry_id) | ||||
|  | ||||
|  | ||||
| class AZMedienIE(AZMedienBaseIE): | ||||
|     IE_DESC = 'AZ Medien videos' | ||||
|     _VALID_URL = r'''(?x) | ||||
|                     https?:// | ||||
|                         (?:www\.)? | ||||
|                         (?: | ||||
|                             telezueri\.ch| | ||||
|                             telebaern\.tv| | ||||
|                             telem1\.ch | ||||
|                         )/ | ||||
|                         [0-9]+-show-[^/\#]+ | ||||
|                         (?: | ||||
|                             /[0-9]+-episode-[^/\#]+ | ||||
|                             (?: | ||||
|                                 /[0-9]+-segment-(?:[^/\#]+\#)?| | ||||
|                                 \# | ||||
|                             )| | ||||
|                             \# | ||||
|                         ) | ||||
|                         (?P<id>[^\#]+) | ||||
|                     ''' | ||||
|  | ||||
|     _TESTS = [{ | ||||
|         # URL with 'segment' | ||||
|         'url': 'http://www.telezueri.ch/62-show-zuerinews/13772-episode-sonntag-18-dezember-2016/32419-segment-massenabweisungen-beim-hiltl-club-wegen-pelzboom', | ||||
|         'info_dict': { | ||||
|             'id': '1_2444peh4', | ||||
|             'ext': 'mov', | ||||
|             'title': 'Massenabweisungen beim Hiltl Club wegen Pelzboom', | ||||
|             'description': 'md5:9ea9dd1b159ad65b36ddcf7f0d7c76a8', | ||||
|             'uploader_id': 'TeleZ?ri', | ||||
|             'upload_date': '20161218', | ||||
|             'timestamp': 1482084490, | ||||
|         }, | ||||
|         'params': { | ||||
|             'skip_download': True, | ||||
|         }, | ||||
|     }, { | ||||
|         # URL with 'segment' and fragment: | ||||
|         'url': 'http://www.telebaern.tv/118-show-news/14240-episode-dienstag-17-januar-2017/33666-segment-achtung-gefahr#zu-wenig-pflegerinnen-und-pfleger', | ||||
|         'only_matching': True | ||||
|     }, { | ||||
|         # URL with 'episode' and fragment: | ||||
|         'url': 'http://www.telem1.ch/47-show-sonntalk/13986-episode-soldaten-fuer-grenzschutz-energiestrategie-obama-bilanz#soldaten-fuer-grenzschutz-energiestrategie-obama-bilanz', | ||||
|         'only_matching': True | ||||
|     }, { | ||||
|         # URL with 'show' and fragment: | ||||
|         'url': 'http://www.telezueri.ch/66-show-sonntalk#burka-plakate-trump-putin-china-besuch', | ||||
|         'only_matching': True | ||||
|     }] | ||||
|  | ||||
|     def _real_extract(self, url): | ||||
|         video_id = self._match_id(url) | ||||
|  | ||||
|         webpage = self._download_webpage(url, video_id) | ||||
|  | ||||
|         partner_id = self._search_regex( | ||||
|             r'<script[^>]+src=["\'](?:https?:)?//(?:[^/]+\.)?kaltura\.com(?:/[^/]+)*/(?:p|partner_id)/([0-9]+)', | ||||
|             webpage, 'kaltura partner id') | ||||
|         entry_id = self._html_search_regex( | ||||
|             r'<a[^>]+data-id=(["\'])(?P<id>(?:(?!\1).)+)\1[^>]+data-slug=["\']%s' | ||||
|             % re.escape(video_id), webpage, 'kaltura entry id', group='id') | ||||
|  | ||||
|         return self._kaltura_video(partner_id, entry_id) | ||||
|  | ||||
|  | ||||
| class AZMedienPlaylistIE(AZMedienBaseIE): | ||||
|     IE_DESC = 'AZ Medien playlists' | ||||
|     _VALID_URL = r'''(?x) | ||||
|                     https?:// | ||||
|                         (?:www\.)? | ||||
|                         (?: | ||||
|                             telezueri\.ch| | ||||
|                             telebaern\.tv| | ||||
|                             telem1\.ch | ||||
|                         )/ | ||||
|                         (?P<id>[0-9]+- | ||||
|                             (?: | ||||
|                                 show| | ||||
|                                 topic| | ||||
|                                 themen | ||||
|                             )-[^/\#]+ | ||||
|                             (?: | ||||
|                                 /[0-9]+-episode-[^/\#]+ | ||||
|                             )? | ||||
|                         )$ | ||||
|                     ''' | ||||
|  | ||||
|     _TESTS = [{ | ||||
|         # URL with 'episode' | ||||
|         'url': 'http://www.telebaern.tv/118-show-news/13735-episode-donnerstag-15-dezember-2016', | ||||
|         'info_dict': { | ||||
|             'id': '118-show-news/13735-episode-donnerstag-15-dezember-2016', | ||||
|             'title': 'News - Donnerstag, 15. Dezember 2016', | ||||
|         }, | ||||
|         'playlist_count': 9, | ||||
|     }, { | ||||
|         # URL with 'themen' | ||||
|         'url': 'http://www.telem1.ch/258-themen-tele-m1-classics', | ||||
|         'info_dict': { | ||||
|             'id': '258-themen-tele-m1-classics', | ||||
|             'title': 'Tele M1 Classics', | ||||
|         }, | ||||
|         'playlist_mincount': 15, | ||||
|     }, { | ||||
|         # URL with 'topic', contains nested playlists | ||||
|         'url': 'http://www.telezueri.ch/219-topic-aera-trump-hat-offiziell-begonnen', | ||||
|         'only_matching': True, | ||||
|     }, { | ||||
|         # URL with 'show' only | ||||
|         'url': 'http://www.telezueri.ch/86-show-talktaeglich', | ||||
|         'only_matching': True | ||||
|     }] | ||||
|  | ||||
|     def _real_extract(self, url): | ||||
|         show_id = self._match_id(url) | ||||
|         webpage = self._download_webpage(url, show_id) | ||||
|  | ||||
|         entries = [] | ||||
|  | ||||
|         partner_id = self._search_regex( | ||||
|             r'src=["\'](?:https?:)?//(?:[^/]+\.)kaltura\.com/(?:[^/]+/)*(?:p|partner_id)/(\d+)', | ||||
|             webpage, 'kaltura partner id', default=None) | ||||
|  | ||||
|         if partner_id: | ||||
|             entries = [ | ||||
|                 self._kaltura_video(partner_id, m.group('id')) | ||||
|                 for m in re.finditer( | ||||
|                     r'data-id=(["\'])(?P<id>(?:(?!\1).)+)\1', webpage)] | ||||
|  | ||||
|         if not entries: | ||||
|             entries = [ | ||||
|                 self.url_result(m.group('url'), ie=AZMedienIE.ie_key()) | ||||
|                 for m in re.finditer( | ||||
|                     r'<a[^>]+data-real=(["\'])(?P<url>http.+?)\1', webpage)] | ||||
|  | ||||
|         if not entries: | ||||
|             entries = [ | ||||
|                 # May contain nested playlists (e.g. [1]) thus no explicit | ||||
|                 # ie_key | ||||
|                 # 1. http://www.telezueri.ch/219-topic-aera-trump-hat-offiziell-begonnen) | ||||
|                 self.url_result(urljoin(url, m.group('url'))) | ||||
|                 for m in re.finditer( | ||||
|                     r'<a[^>]+name=[^>]+href=(["\'])(?P<url>/.+?)\1', webpage)] | ||||
|  | ||||
|         title = self._search_regex( | ||||
|             r'episodeShareTitle\s*=\s*(["\'])(?P<title>(?:(?!\1).)+)\1', | ||||
|             webpage, 'title', | ||||
|             default=strip_or_none(get_element_by_id( | ||||
|                 'video-title', webpage)), group='title') | ||||
|  | ||||
|         return self.playlist_result(entries, show_id, title) | ||||
| @@ -11,7 +11,7 @@ from ..utils import ( | ||||
|  | ||||
|  | ||||
| class AzubuIE(InfoExtractor): | ||||
|     _VALID_URL = r'https?://(?:www\.)?azubu\.(?:tv|uol.com.br)/[^/]+#!/play/(?P<id>\d+)' | ||||
|     _VALID_URL = r'https?://(?:www\.)?azubu\.tv/[^/]+#!/play/(?P<id>\d+)' | ||||
|     _TESTS = [ | ||||
|         { | ||||
|             'url': 'http://www.azubu.tv/GSL#!/play/15575/2014-hot6-cup-last-big-match-ro8-day-1', | ||||
| @@ -21,7 +21,7 @@ class AzubuIE(InfoExtractor): | ||||
|                 'ext': 'mp4', | ||||
|                 'title': '2014 HOT6 CUP LAST BIG MATCH Ro8 Day 1', | ||||
|                 'description': 'md5:d06bdea27b8cc4388a90ad35b5c66c01', | ||||
|                 'thumbnail': r're:^https?://.*\.jpe?g', | ||||
|                 'thumbnail': 're:^https?://.*\.jpe?g', | ||||
|                 'timestamp': 1417523507.334, | ||||
|                 'upload_date': '20141202', | ||||
|                 'duration': 9988.7, | ||||
| @@ -38,7 +38,7 @@ class AzubuIE(InfoExtractor): | ||||
|                 'ext': 'mp4', | ||||
|                 'title': 'Fnatic at Worlds 2014: Toyz - "I love Rekkles, he has amazing mechanics"', | ||||
|                 'description': 'md5:4a649737b5f6c8b5c5be543e88dc62af', | ||||
|                 'thumbnail': r're:^https?://.*\.jpe?g', | ||||
|                 'thumbnail': 're:^https?://.*\.jpe?g', | ||||
|                 'timestamp': 1410530893.320, | ||||
|                 'upload_date': '20140912', | ||||
|                 'duration': 172.385, | ||||
| @@ -103,15 +103,12 @@ class AzubuIE(InfoExtractor): | ||||
|  | ||||
|  | ||||
| class AzubuLiveIE(InfoExtractor): | ||||
|     _VALID_URL = r'https?://(?:www\.)?azubu\.(?:tv|uol.com.br)/(?P<id>[^/]+)$' | ||||
|     _VALID_URL = r'https?://www.azubu.tv/(?P<id>[^/]+)$' | ||||
|  | ||||
|     _TESTS = [{ | ||||
|     _TEST = { | ||||
|         'url': 'http://www.azubu.tv/MarsTVMDLen', | ||||
|         'only_matching': True, | ||||
|     }, { | ||||
|         'url': 'http://azubu.uol.com.br/adolfz', | ||||
|         'only_matching': True, | ||||
|     }] | ||||
|     } | ||||
|  | ||||
|     def _real_extract(self, url): | ||||
|         user = self._match_id(url) | ||||
|   | ||||
| @@ -1,9 +1,7 @@ | ||||
| from __future__ import unicode_literals | ||||
|  | ||||
| import json | ||||
| import random | ||||
| import re | ||||
| import time | ||||
|  | ||||
| from .common import InfoExtractor | ||||
| from ..compat import ( | ||||
| @@ -14,9 +12,6 @@ from ..utils import ( | ||||
|     ExtractorError, | ||||
|     float_or_none, | ||||
|     int_or_none, | ||||
|     parse_filesize, | ||||
|     unescapeHTML, | ||||
|     update_url_query, | ||||
| ) | ||||
|  | ||||
|  | ||||
| @@ -86,68 +81,35 @@ class BandcampIE(InfoExtractor): | ||||
|             r'(?ms)var TralbumData = .*?[{,]\s*id: (?P<id>\d+),?$', | ||||
|             webpage, 'video id') | ||||
|  | ||||
|         download_webpage = self._download_webpage( | ||||
|             download_link, video_id, 'Downloading free downloads page') | ||||
|  | ||||
|         blob = self._parse_json( | ||||
|             self._search_regex( | ||||
|                 r'data-blob=(["\'])(?P<blob>{.+?})\1', download_webpage, | ||||
|                 'blob', group='blob'), | ||||
|             video_id, transform_source=unescapeHTML) | ||||
|  | ||||
|         info = blob['digital_items'][0] | ||||
|  | ||||
|         downloads = info['downloads'] | ||||
|         track = info['title'] | ||||
|  | ||||
|         artist = info.get('artist') | ||||
|         title = '%s - %s' % (artist, track) if artist else track | ||||
|  | ||||
|         download_formats = {} | ||||
|         for f in blob['download_formats']: | ||||
|             name, ext = f.get('name'), f.get('file_extension') | ||||
|             if all(isinstance(x, compat_str) for x in (name, ext)): | ||||
|                 download_formats[name] = ext.strip('.') | ||||
|  | ||||
|         formats = [] | ||||
|         for format_id, f in downloads.items(): | ||||
|             format_url = f.get('url') | ||||
|             if not format_url: | ||||
|                 continue | ||||
|             # Stat URL generation algorithm is reverse engineered from | ||||
|             # download_*_bundle_*.js | ||||
|             stat_url = update_url_query( | ||||
|                 format_url.replace('/download/', '/statdownload/'), { | ||||
|                     '.rand': int(time.time() * 1000 * random.random()), | ||||
|                 }) | ||||
|             format_id = f.get('encoding_name') or format_id | ||||
|             stat = self._download_json( | ||||
|                 stat_url, video_id, 'Downloading %s JSON' % format_id, | ||||
|                 transform_source=lambda s: s[s.index('{'):s.rindex('}') + 1], | ||||
|                 fatal=False) | ||||
|             if not stat: | ||||
|                 continue | ||||
|             retry_url = stat.get('retry_url') | ||||
|             if not isinstance(retry_url, compat_str): | ||||
|                 continue | ||||
|             formats.append({ | ||||
|                 'url': self._proto_relative_url(retry_url, 'http:'), | ||||
|                 'ext': download_formats.get(format_id), | ||||
|                 'format_id': format_id, | ||||
|                 'format_note': f.get('description'), | ||||
|                 'filesize': parse_filesize(f.get('size_mb')), | ||||
|                 'vcodec': 'none', | ||||
|             }) | ||||
|         self._sort_formats(formats) | ||||
|         download_webpage = self._download_webpage(download_link, video_id, 'Downloading free downloads page') | ||||
|         # We get the dictionary of the track from some javascript code | ||||
|         all_info = self._parse_json(self._search_regex( | ||||
|             r'(?sm)items: (.*?),$', download_webpage, 'items'), video_id) | ||||
|         info = all_info[0] | ||||
|         # We pick mp3-320 for now, until format selection can be easily implemented. | ||||
|         mp3_info = info['downloads']['mp3-320'] | ||||
|         # If we try to use this url it says the link has expired | ||||
|         initial_url = mp3_info['url'] | ||||
|         m_url = re.match( | ||||
|             r'(?P<server>http://(.*?)\.bandcamp\.com)/download/track\?enc=mp3-320&fsig=(?P<fsig>.*?)&id=(?P<id>.*?)&ts=(?P<ts>.*)$', | ||||
|             initial_url) | ||||
|         # We build the url we will use to get the final track url | ||||
|         # This url is build in Bandcamp in the script download_bunde_*.js | ||||
|         request_url = '%s/statdownload/track?enc=mp3-320&fsig=%s&id=%s&ts=%s&.rand=665028774616&.vrs=1' % (m_url.group('server'), m_url.group('fsig'), video_id, m_url.group('ts')) | ||||
|         final_url_webpage = self._download_webpage(request_url, video_id, 'Requesting download url') | ||||
|         # If we could correctly generate the .rand field the url would be | ||||
|         # in the "download_url" key | ||||
|         final_url = self._proto_relative_url(self._search_regex( | ||||
|             r'"retry_url":"(.+?)"', final_url_webpage, 'final video URL'), 'http:') | ||||
|  | ||||
|         return { | ||||
|             'id': video_id, | ||||
|             'title': title, | ||||
|             'title': info['title'], | ||||
|             'ext': 'mp3', | ||||
|             'vcodec': 'none', | ||||
|             'url': final_url, | ||||
|             'thumbnail': info.get('thumb_url'), | ||||
|             'uploader': info.get('artist'), | ||||
|             'artist': artist, | ||||
|             'track': track, | ||||
|             'formats': formats, | ||||
|         } | ||||
|  | ||||
|  | ||||
| @@ -200,24 +162,6 @@ class BandcampAlbumIE(InfoExtractor): | ||||
|             'uploader_id': 'dotscale', | ||||
|         }, | ||||
|         'playlist_mincount': 7, | ||||
|     }, { | ||||
|         # with escaped quote in title | ||||
|         'url': 'https://jstrecords.bandcamp.com/album/entropy-ep', | ||||
|         'info_dict': { | ||||
|             'title': '"Entropy" EP', | ||||
|             'uploader_id': 'jstrecords', | ||||
|             'id': 'entropy-ep', | ||||
|         }, | ||||
|         'playlist_mincount': 3, | ||||
|     }, { | ||||
|         # not all tracks have songs | ||||
|         'url': 'https://insulters.bandcamp.com/album/we-are-the-plague', | ||||
|         'info_dict': { | ||||
|             'id': 'we-are-the-plague', | ||||
|             'title': 'WE ARE THE PLAGUE', | ||||
|             'uploader_id': 'insulters', | ||||
|         }, | ||||
|         'playlist_count': 2, | ||||
|     }] | ||||
|  | ||||
|     def _real_extract(self, url): | ||||
| @@ -226,21 +170,14 @@ class BandcampAlbumIE(InfoExtractor): | ||||
|         album_id = mobj.group('album_id') | ||||
|         playlist_id = album_id or uploader_id | ||||
|         webpage = self._download_webpage(url, playlist_id) | ||||
|         track_elements = re.findall( | ||||
|             r'(?s)<div[^>]*>(.*?<a[^>]+href="([^"]+?)"[^>]+itemprop="url"[^>]*>.*?)</div>', webpage) | ||||
|         if not track_elements: | ||||
|         tracks_paths = re.findall(r'<a href="(.*?)" itemprop="url">', webpage) | ||||
|         if not tracks_paths: | ||||
|             raise ExtractorError('The page doesn\'t contain any tracks') | ||||
|         # Only tracks with duration info have songs | ||||
|         entries = [ | ||||
|             self.url_result(compat_urlparse.urljoin(url, t_path), ie=BandcampIE.ie_key()) | ||||
|             for elem_content, t_path in track_elements | ||||
|             if self._html_search_meta('duration', elem_content, default=None)] | ||||
|  | ||||
|         title = self._html_search_regex( | ||||
|             r'album_title\s*:\s*"((?:\\.|[^"\\])+?)"', | ||||
|             webpage, 'title', fatal=False) | ||||
|         if title: | ||||
|             title = title.replace(r'\"', '"') | ||||
|             for t_path in tracks_paths] | ||||
|         title = self._search_regex( | ||||
|             r'album_title\s*:\s*"(.*?)"', webpage, 'title', fatal=False) | ||||
|         return { | ||||
|             '_type': 'playlist', | ||||
|             'uploader_id': uploader_id, | ||||
|   | ||||
| @@ -2,23 +2,19 @@ | ||||
| from __future__ import unicode_literals | ||||
|  | ||||
| import re | ||||
| import itertools | ||||
|  | ||||
| from .common import InfoExtractor | ||||
| from ..utils import ( | ||||
|     dict_get, | ||||
|     ExtractorError, | ||||
|     float_or_none, | ||||
|     int_or_none, | ||||
|     parse_duration, | ||||
|     parse_iso8601, | ||||
|     try_get, | ||||
|     unescapeHTML, | ||||
| ) | ||||
| from ..compat import ( | ||||
|     compat_etree_fromstring, | ||||
|     compat_HTTPError, | ||||
|     compat_urlparse, | ||||
| ) | ||||
|  | ||||
|  | ||||
| @@ -225,8 +221,6 @@ class BBCCoUkIE(InfoExtractor): | ||||
|         } | ||||
|     ] | ||||
|  | ||||
|     _USP_RE = r'/([^/]+?)\.ism(?:\.hlsv2\.ism)?/[^/]+\.m3u8' | ||||
|  | ||||
|     class MediaSelectionError(Exception): | ||||
|         def __init__(self, id): | ||||
|             self.id = id | ||||
| @@ -235,6 +229,51 @@ class BBCCoUkIE(InfoExtractor): | ||||
|         asx = self._download_xml(connection.get('href'), programme_id, 'Downloading ASX playlist') | ||||
|         return [ref.get('href') for ref in asx.findall('./Entry/ref')] | ||||
|  | ||||
|     def _extract_connection(self, connection, programme_id): | ||||
|         formats = [] | ||||
|         kind = connection.get('kind') | ||||
|         protocol = connection.get('protocol') | ||||
|         supplier = connection.get('supplier') | ||||
|         if protocol == 'http': | ||||
|             href = connection.get('href') | ||||
|             transfer_format = connection.get('transferFormat') | ||||
|             # ASX playlist | ||||
|             if supplier == 'asx': | ||||
|                 for i, ref in enumerate(self._extract_asx_playlist(connection, programme_id)): | ||||
|                     formats.append({ | ||||
|                         'url': ref, | ||||
|                         'format_id': 'ref%s_%s' % (i, supplier), | ||||
|                     }) | ||||
|             # Skip DASH until supported | ||||
|             elif transfer_format == 'dash': | ||||
|                 pass | ||||
|             elif transfer_format == 'hls': | ||||
|                 formats.extend(self._extract_m3u8_formats( | ||||
|                     href, programme_id, ext='mp4', entry_protocol='m3u8_native', | ||||
|                     m3u8_id=supplier, fatal=False)) | ||||
|             # Direct link | ||||
|             else: | ||||
|                 formats.append({ | ||||
|                     'url': href, | ||||
|                     'format_id': supplier or kind or protocol, | ||||
|                 }) | ||||
|         elif protocol == 'rtmp': | ||||
|             application = connection.get('application', 'ondemand') | ||||
|             auth_string = connection.get('authString') | ||||
|             identifier = connection.get('identifier') | ||||
|             server = connection.get('server') | ||||
|             formats.append({ | ||||
|                 'url': '%s://%s/%s?%s' % (protocol, server, application, auth_string), | ||||
|                 'play_path': identifier, | ||||
|                 'app': '%s?%s' % (application, auth_string), | ||||
|                 'page_url': 'http://www.bbc.co.uk', | ||||
|                 'player_url': 'http://www.bbc.co.uk/emp/releases/iplayer/revisions/617463_618125_4/617463_618125_4_emp.swf', | ||||
|                 'rtmp_live': False, | ||||
|                 'ext': 'flv', | ||||
|                 'format_id': supplier, | ||||
|             }) | ||||
|         return formats | ||||
|  | ||||
|     def _extract_items(self, playlist): | ||||
|         return playlist.findall('./{%s}item' % self._EMP_PLAYLIST_NS) | ||||
|  | ||||
| @@ -255,6 +294,46 @@ class BBCCoUkIE(InfoExtractor): | ||||
|     def _extract_connections(self, media): | ||||
|         return self._findall_ns(media, './{%s}connection') | ||||
|  | ||||
|     def _extract_video(self, media, programme_id): | ||||
|         formats = [] | ||||
|         vbr = int_or_none(media.get('bitrate')) | ||||
|         vcodec = media.get('encoding') | ||||
|         service = media.get('service') | ||||
|         width = int_or_none(media.get('width')) | ||||
|         height = int_or_none(media.get('height')) | ||||
|         file_size = int_or_none(media.get('media_file_size')) | ||||
|         for connection in self._extract_connections(media): | ||||
|             conn_formats = self._extract_connection(connection, programme_id) | ||||
|             for format in conn_formats: | ||||
|                 format.update({ | ||||
|                     'width': width, | ||||
|                     'height': height, | ||||
|                     'vbr': vbr, | ||||
|                     'vcodec': vcodec, | ||||
|                     'filesize': file_size, | ||||
|                 }) | ||||
|                 if service: | ||||
|                     format['format_id'] = '%s_%s' % (service, format['format_id']) | ||||
|             formats.extend(conn_formats) | ||||
|         return formats | ||||
|  | ||||
|     def _extract_audio(self, media, programme_id): | ||||
|         formats = [] | ||||
|         abr = int_or_none(media.get('bitrate')) | ||||
|         acodec = media.get('encoding') | ||||
|         service = media.get('service') | ||||
|         for connection in self._extract_connections(media): | ||||
|             conn_formats = self._extract_connection(connection, programme_id) | ||||
|             for format in conn_formats: | ||||
|                 format.update({ | ||||
|                     'format_id': '%s_%s' % (service, format['format_id']), | ||||
|                     'abr': abr, | ||||
|                     'acodec': acodec, | ||||
|                     'vcodec': 'none', | ||||
|                 }) | ||||
|             formats.extend(conn_formats) | ||||
|         return formats | ||||
|  | ||||
|     def _get_subtitles(self, media, programme_id): | ||||
|         subtitles = {} | ||||
|         for connection in self._extract_connections(media): | ||||
| @@ -300,96 +379,13 @@ class BBCCoUkIE(InfoExtractor): | ||||
|     def _process_media_selector(self, media_selection, programme_id): | ||||
|         formats = [] | ||||
|         subtitles = None | ||||
|         urls = [] | ||||
|  | ||||
|         for media in self._extract_medias(media_selection): | ||||
|             kind = media.get('kind') | ||||
|             if kind in ('video', 'audio'): | ||||
|                 bitrate = int_or_none(media.get('bitrate')) | ||||
|                 encoding = media.get('encoding') | ||||
|                 service = media.get('service') | ||||
|                 width = int_or_none(media.get('width')) | ||||
|                 height = int_or_none(media.get('height')) | ||||
|                 file_size = int_or_none(media.get('media_file_size')) | ||||
|                 for connection in self._extract_connections(media): | ||||
|                     href = connection.get('href') | ||||
|                     if href in urls: | ||||
|                         continue | ||||
|                     if href: | ||||
|                         urls.append(href) | ||||
|                     conn_kind = connection.get('kind') | ||||
|                     protocol = connection.get('protocol') | ||||
|                     supplier = connection.get('supplier') | ||||
|                     transfer_format = connection.get('transferFormat') | ||||
|                     format_id = supplier or conn_kind or protocol | ||||
|                     if service: | ||||
|                         format_id = '%s_%s' % (service, format_id) | ||||
|                     # ASX playlist | ||||
|                     if supplier == 'asx': | ||||
|                         for i, ref in enumerate(self._extract_asx_playlist(connection, programme_id)): | ||||
|                             formats.append({ | ||||
|                                 'url': ref, | ||||
|                                 'format_id': 'ref%s_%s' % (i, format_id), | ||||
|                             }) | ||||
|                     elif transfer_format == 'dash': | ||||
|                         formats.extend(self._extract_mpd_formats( | ||||
|                             href, programme_id, mpd_id=format_id, fatal=False)) | ||||
|                     elif transfer_format == 'hls': | ||||
|                         formats.extend(self._extract_m3u8_formats( | ||||
|                             href, programme_id, ext='mp4', entry_protocol='m3u8_native', | ||||
|                             m3u8_id=format_id, fatal=False)) | ||||
|                         if re.search(self._USP_RE, href): | ||||
|                             usp_formats = self._extract_m3u8_formats( | ||||
|                                 re.sub(self._USP_RE, r'/\1.ism/\1.m3u8', href), | ||||
|                                 programme_id, ext='mp4', entry_protocol='m3u8_native', | ||||
|                                 m3u8_id=format_id, fatal=False) | ||||
|                             for f in usp_formats: | ||||
|                                 if f.get('height') and f['height'] > 720: | ||||
|                                     continue | ||||
|                                 formats.append(f) | ||||
|                     elif transfer_format == 'hds': | ||||
|                         formats.extend(self._extract_f4m_formats( | ||||
|                             href, programme_id, f4m_id=format_id, fatal=False)) | ||||
|                     else: | ||||
|                         if not service and not supplier and bitrate: | ||||
|                             format_id += '-%d' % bitrate | ||||
|                         fmt = { | ||||
|                             'format_id': format_id, | ||||
|                             'filesize': file_size, | ||||
|                         } | ||||
|                         if kind == 'video': | ||||
|                             fmt.update({ | ||||
|                                 'width': width, | ||||
|                                 'height': height, | ||||
|                                 'vbr': bitrate, | ||||
|                                 'vcodec': encoding, | ||||
|                             }) | ||||
|                         else: | ||||
|                             fmt.update({ | ||||
|                                 'abr': bitrate, | ||||
|                                 'acodec': encoding, | ||||
|                                 'vcodec': 'none', | ||||
|                             }) | ||||
|                         if protocol == 'http': | ||||
|                             # Direct link | ||||
|                             fmt.update({ | ||||
|                                 'url': href, | ||||
|                             }) | ||||
|                         elif protocol == 'rtmp': | ||||
|                             application = connection.get('application', 'ondemand') | ||||
|                             auth_string = connection.get('authString') | ||||
|                             identifier = connection.get('identifier') | ||||
|                             server = connection.get('server') | ||||
|                             fmt.update({ | ||||
|                                 'url': '%s://%s/%s?%s' % (protocol, server, application, auth_string), | ||||
|                                 'play_path': identifier, | ||||
|                                 'app': '%s?%s' % (application, auth_string), | ||||
|                                 'page_url': 'http://www.bbc.co.uk', | ||||
|                                 'player_url': 'http://www.bbc.co.uk/emp/releases/iplayer/revisions/617463_618125_4/617463_618125_4_emp.swf', | ||||
|                                 'rtmp_live': False, | ||||
|                                 'ext': 'flv', | ||||
|                             }) | ||||
|                         formats.append(fmt) | ||||
|             if kind == 'audio': | ||||
|                 formats.extend(self._extract_audio(media, programme_id)) | ||||
|             elif kind == 'video': | ||||
|                 formats.extend(self._extract_video(media, programme_id)) | ||||
|             elif kind == 'captions': | ||||
|                 subtitles = self.extract_subtitles(media, programme_id) | ||||
|         return formats, subtitles | ||||
| @@ -593,7 +589,7 @@ class BBCIE(BBCCoUkIE): | ||||
|         'info_dict': { | ||||
|             'id': '150615_telabyad_kentin_cogu', | ||||
|             'ext': 'mp4', | ||||
|             'title': "YPG: Tel Abyad'ın tamamı kontrolümüzde", | ||||
|             'title': "Tel Abyad'da IŞİD bayrağı indirildi YPG bayrağı çekildi", | ||||
|             'description': 'md5:33a4805a855c9baf7115fcbde57e7025', | ||||
|             'timestamp': 1434397334, | ||||
|             'upload_date': '20150615', | ||||
| @@ -658,23 +654,6 @@ class BBCIE(BBCCoUkIE): | ||||
|             # rtmp download | ||||
|             'skip_download': True, | ||||
|         } | ||||
|     }, { | ||||
|         # single video embedded with Morph | ||||
|         'url': 'http://www.bbc.co.uk/sport/live/olympics/36895975', | ||||
|         'info_dict': { | ||||
|             'id': 'p041vhd0', | ||||
|             'ext': 'mp4', | ||||
|             'title': "Nigeria v Japan - Men's First Round", | ||||
|             'description': 'Live coverage of the first round from Group B at the Amazonia Arena.', | ||||
|             'duration': 7980, | ||||
|             'uploader': 'BBC Sport', | ||||
|             'uploader_id': 'bbc_sport', | ||||
|         }, | ||||
|         'params': { | ||||
|             # m3u8 download | ||||
|             'skip_download': True, | ||||
|         }, | ||||
|         'skip': 'Georestricted to UK', | ||||
|     }, { | ||||
|         # single video with playlist.sxml URL in playlist param | ||||
|         'url': 'http://www.bbc.com/sport/0/football/33653409', | ||||
| @@ -772,7 +751,7 @@ class BBCIE(BBCCoUkIE): | ||||
|  | ||||
|         webpage = self._download_webpage(url, playlist_id) | ||||
|  | ||||
|         json_ld_info = self._search_json_ld(webpage, playlist_id, default={}) | ||||
|         json_ld_info = self._search_json_ld(webpage, playlist_id, default=None) | ||||
|         timestamp = json_ld_info.get('timestamp') | ||||
|  | ||||
|         playlist_title = json_ld_info.get('title') | ||||
| @@ -841,19 +820,13 @@ class BBCIE(BBCCoUkIE): | ||||
|                         # http://www.bbc.com/turkce/multimedya/2015/10/151010_vid_ankara_patlama_ani) | ||||
|                         playlist = data_playable.get('otherSettings', {}).get('playlist', {}) | ||||
|                         if playlist: | ||||
|                             entry = None | ||||
|                             for key in ('streaming', 'progressiveDownload'): | ||||
|                             for key in ('progressiveDownload', 'streaming'): | ||||
|                                 playlist_url = playlist.get('%sUrl' % key) | ||||
|                                 if not playlist_url: | ||||
|                                     continue | ||||
|                                 try: | ||||
|                                     info = self._extract_from_playlist_sxml( | ||||
|                                         playlist_url, playlist_id, timestamp) | ||||
|                                     if not entry: | ||||
|                                         entry = info | ||||
|                                     else: | ||||
|                                         entry['title'] = info['title'] | ||||
|                                         entry['formats'].extend(info['formats']) | ||||
|                                     entries.append(self._extract_from_playlist_sxml( | ||||
|                                         playlist_url, playlist_id, timestamp)) | ||||
|                                 except Exception as e: | ||||
|                                     # Some playlist URL may fail with 500, at the same time | ||||
|                                     # the other one may work fine (e.g. | ||||
| @@ -861,9 +834,6 @@ class BBCIE(BBCCoUkIE): | ||||
|                                     if isinstance(e.cause, compat_HTTPError) and e.cause.code == 500: | ||||
|                                         continue | ||||
|                                     raise | ||||
|                             if entry: | ||||
|                                 self._sort_formats(entry['formats']) | ||||
|                                 entries.append(entry) | ||||
|  | ||||
|         if entries: | ||||
|             return self.playlist_result(entries, playlist_id, playlist_title, playlist_description) | ||||
| @@ -896,50 +866,6 @@ class BBCIE(BBCCoUkIE): | ||||
|                 'subtitles': subtitles, | ||||
|             } | ||||
|  | ||||
|         # Morph based embed (e.g. http://www.bbc.co.uk/sport/live/olympics/36895975) | ||||
|         # There are several setPayload calls may be present but the video | ||||
|         # seems to be always related to the first one | ||||
|         morph_payload = self._parse_json( | ||||
|             self._search_regex( | ||||
|                 r'Morph\.setPayload\([^,]+,\s*({.+?})\);', | ||||
|                 webpage, 'morph payload', default='{}'), | ||||
|             playlist_id, fatal=False) | ||||
|         if morph_payload: | ||||
|             components = try_get(morph_payload, lambda x: x['body']['components'], list) or [] | ||||
|             for component in components: | ||||
|                 if not isinstance(component, dict): | ||||
|                     continue | ||||
|                 lead_media = try_get(component, lambda x: x['props']['leadMedia'], dict) | ||||
|                 if not lead_media: | ||||
|                     continue | ||||
|                 identifiers = lead_media.get('identifiers') | ||||
|                 if not identifiers or not isinstance(identifiers, dict): | ||||
|                     continue | ||||
|                 programme_id = identifiers.get('vpid') or identifiers.get('playablePid') | ||||
|                 if not programme_id: | ||||
|                     continue | ||||
|                 title = lead_media.get('title') or self._og_search_title(webpage) | ||||
|                 formats, subtitles = self._download_media_selector(programme_id) | ||||
|                 self._sort_formats(formats) | ||||
|                 description = lead_media.get('summary') | ||||
|                 uploader = lead_media.get('masterBrand') | ||||
|                 uploader_id = lead_media.get('mid') | ||||
|                 duration = None | ||||
|                 duration_d = lead_media.get('duration') | ||||
|                 if isinstance(duration_d, dict): | ||||
|                     duration = parse_duration(dict_get( | ||||
|                         duration_d, ('rawDuration', 'formattedDuration', 'spokenDuration'))) | ||||
|                 return { | ||||
|                     'id': programme_id, | ||||
|                     'title': title, | ||||
|                     'description': description, | ||||
|                     'duration': duration, | ||||
|                     'uploader': uploader, | ||||
|                     'uploader_id': uploader_id, | ||||
|                     'formats': formats, | ||||
|                     'subtitles': subtitles, | ||||
|                 } | ||||
|  | ||||
|         def extract_all(pattern): | ||||
|             return list(filter(None, map( | ||||
|                 lambda s: self._parse_json(s, playlist_id, fatal=False), | ||||
| @@ -957,7 +883,7 @@ class BBCIE(BBCCoUkIE): | ||||
|             r'setPlaylist\("(%s)"\)' % EMBED_URL, webpage)) | ||||
|         if entries: | ||||
|             return self.playlist_result( | ||||
|                 [self.url_result(entry_, 'BBCCoUk') for entry_ in entries], | ||||
|                 [self.url_result(entry, 'BBCCoUk') for entry in entries], | ||||
|                 playlist_id, playlist_title, playlist_description) | ||||
|  | ||||
|         # Multiple video article (e.g. http://www.bbc.com/news/world-europe-32668511) | ||||
| @@ -1039,7 +965,7 @@ class BBCIE(BBCCoUkIE): | ||||
|  | ||||
|  | ||||
| class BBCCoUkArticleIE(InfoExtractor): | ||||
|     _VALID_URL = r'https?://(?:www\.)?bbc\.co\.uk/programmes/articles/(?P<id>[a-zA-Z0-9]+)' | ||||
|     _VALID_URL = r'https?://www.bbc.co.uk/programmes/articles/(?P<id>[a-zA-Z0-9]+)' | ||||
|     IE_NAME = 'bbc.co.uk:article' | ||||
|     IE_DESC = 'BBC articles' | ||||
|  | ||||
| @@ -1069,35 +995,19 @@ class BBCCoUkArticleIE(InfoExtractor): | ||||
|  | ||||
|  | ||||
| class BBCCoUkPlaylistBaseIE(InfoExtractor): | ||||
|     def _entries(self, webpage, url, playlist_id): | ||||
|         single_page = 'page' in compat_urlparse.parse_qs( | ||||
|             compat_urlparse.urlparse(url).query) | ||||
|         for page_num in itertools.count(2): | ||||
|             for video_id in re.findall( | ||||
|                     self._VIDEO_ID_TEMPLATE % BBCCoUkIE._ID_REGEX, webpage): | ||||
|                 yield self.url_result( | ||||
|                     self._URL_TEMPLATE % video_id, BBCCoUkIE.ie_key()) | ||||
|             if single_page: | ||||
|                 return | ||||
|             next_page = self._search_regex( | ||||
|                 r'<li[^>]+class=(["\'])pagination_+next\1[^>]*><a[^>]+href=(["\'])(?P<url>(?:(?!\2).)+)\2', | ||||
|                 webpage, 'next page url', default=None, group='url') | ||||
|             if not next_page: | ||||
|                 break | ||||
|             webpage = self._download_webpage( | ||||
|                 compat_urlparse.urljoin(url, next_page), playlist_id, | ||||
|                 'Downloading page %d' % page_num, page_num) | ||||
|  | ||||
|     def _real_extract(self, url): | ||||
|         playlist_id = self._match_id(url) | ||||
|  | ||||
|         webpage = self._download_webpage(url, playlist_id) | ||||
|  | ||||
|         entries = [ | ||||
|             self.url_result(self._URL_TEMPLATE % video_id, BBCCoUkIE.ie_key()) | ||||
|             for video_id in re.findall( | ||||
|                 self._VIDEO_ID_TEMPLATE % BBCCoUkIE._ID_REGEX, webpage)] | ||||
|  | ||||
|         title, description = self._extract_title_and_description(webpage) | ||||
|  | ||||
|         return self.playlist_result( | ||||
|             self._entries(webpage, url, playlist_id), | ||||
|             playlist_id, title, description) | ||||
|         return self.playlist_result(entries, playlist_id, title, description) | ||||
|  | ||||
|  | ||||
| class BBCCoUkIPlayerPlaylistIE(BBCCoUkPlaylistBaseIE): | ||||
| @@ -1146,24 +1056,6 @@ class BBCCoUkPlaylistIE(BBCCoUkPlaylistBaseIE): | ||||
|             'description': 'French thriller serial about a missing teenager.', | ||||
|         }, | ||||
|         'playlist_mincount': 7, | ||||
|     }, { | ||||
|         # multipage playlist, explicit page | ||||
|         'url': 'http://www.bbc.co.uk/programmes/b00mfl7n/clips?page=1', | ||||
|         'info_dict': { | ||||
|             'id': 'b00mfl7n', | ||||
|             'title': 'Frozen Planet - Clips - BBC One', | ||||
|             'description': 'md5:65dcbf591ae628dafe32aa6c4a4a0d8c', | ||||
|         }, | ||||
|         'playlist_mincount': 24, | ||||
|     }, { | ||||
|         # multipage playlist, all pages | ||||
|         'url': 'http://www.bbc.co.uk/programmes/b00mfl7n/clips', | ||||
|         'info_dict': { | ||||
|             'id': 'b00mfl7n', | ||||
|             'title': 'Frozen Planet - Clips - BBC One', | ||||
|             'description': 'md5:65dcbf591ae628dafe32aa6c4a4a0d8c', | ||||
|         }, | ||||
|         'playlist_mincount': 142, | ||||
|     }, { | ||||
|         'url': 'http://www.bbc.co.uk/programmes/b05rcz9v/broadcasts/2016/06', | ||||
|         'only_matching': True, | ||||
|   | ||||
| @@ -1,73 +0,0 @@ | ||||
| # coding: utf-8 | ||||
| from __future__ import unicode_literals | ||||
|  | ||||
| from .common import InfoExtractor | ||||
| from ..utils import ( | ||||
|     ExtractorError, | ||||
|     clean_html, | ||||
|     compat_str, | ||||
|     int_or_none, | ||||
|     parse_iso8601, | ||||
|     try_get, | ||||
| ) | ||||
|  | ||||
|  | ||||
| class BeamProLiveIE(InfoExtractor): | ||||
|     IE_NAME = 'Beam:live' | ||||
|     _VALID_URL = r'https?://(?:\w+\.)?beam\.pro/(?P<id>[^/?#&]+)' | ||||
|     _RATINGS = {'family': 0, 'teen': 13, '18+': 18} | ||||
|     _TEST = { | ||||
|         'url': 'http://www.beam.pro/niterhayven', | ||||
|         'info_dict': { | ||||
|             'id': '261562', | ||||
|             'ext': 'mp4', | ||||
|             'title': 'Introducing The Witcher 3 //  The Grind Starts Now!', | ||||
|             'description': 'md5:0b161ac080f15fe05d18a07adb44a74d', | ||||
|             'thumbnail': r're:https://.*\.jpg$', | ||||
|             'timestamp': 1483477281, | ||||
|             'upload_date': '20170103', | ||||
|             'uploader': 'niterhayven', | ||||
|             'uploader_id': '373396', | ||||
|             'age_limit': 18, | ||||
|             'is_live': True, | ||||
|             'view_count': int, | ||||
|         }, | ||||
|         'skip': 'niterhayven is offline', | ||||
|         'params': { | ||||
|             'skip_download': True, | ||||
|         }, | ||||
|     } | ||||
|  | ||||
|     def _real_extract(self, url): | ||||
|         channel_name = self._match_id(url) | ||||
|  | ||||
|         chan = self._download_json( | ||||
|             'https://beam.pro/api/v1/channels/%s' % channel_name, channel_name) | ||||
|  | ||||
|         if chan.get('online') is False: | ||||
|             raise ExtractorError( | ||||
|                 '{0} is offline'.format(channel_name), expected=True) | ||||
|  | ||||
|         channel_id = chan['id'] | ||||
|  | ||||
|         formats = self._extract_m3u8_formats( | ||||
|             'https://beam.pro/api/v1/channels/%s/manifest.m3u8' % channel_id, | ||||
|             channel_name, ext='mp4', m3u8_id='hls', fatal=False) | ||||
|         self._sort_formats(formats) | ||||
|  | ||||
|         user_id = chan.get('userId') or try_get(chan, lambda x: x['user']['id']) | ||||
|  | ||||
|         return { | ||||
|             'id': compat_str(chan.get('id') or channel_name), | ||||
|             'title': self._live_title(chan.get('name') or channel_name), | ||||
|             'description': clean_html(chan.get('description')), | ||||
|             'thumbnail': try_get(chan, lambda x: x['thumbnail']['url'], compat_str), | ||||
|             'timestamp': parse_iso8601(chan.get('updatedAt')), | ||||
|             'uploader': chan.get('token') or try_get( | ||||
|                 chan, lambda x: x['user']['username'], compat_str), | ||||
|             'uploader_id': compat_str(user_id) if user_id else None, | ||||
|             'age_limit': self._RATINGS.get(chan.get('audience')), | ||||
|             'is_live': True, | ||||
|             'view_count': int_or_none(chan.get('viewersTotal')), | ||||
|             'formats': formats, | ||||
|         } | ||||
| @@ -8,10 +8,10 @@ from ..compat import compat_str | ||||
| from ..utils import int_or_none | ||||
| 
 | ||||
| 
 | ||||
| class BeatportIE(InfoExtractor): | ||||
|     _VALID_URL = r'https?://(?:www\.|pro\.)?beatport\.com/track/(?P<display_id>[^/]+)/(?P<id>[0-9]+)' | ||||
| class BeatportProIE(InfoExtractor): | ||||
|     _VALID_URL = r'https?://pro\.beatport\.com/track/(?P<display_id>[^/]+)/(?P<id>[0-9]+)' | ||||
|     _TESTS = [{ | ||||
|         'url': 'https://beatport.com/track/synesthesia-original-mix/5379371', | ||||
|         'url': 'https://pro.beatport.com/track/synesthesia-original-mix/5379371', | ||||
|         'md5': 'b3c34d8639a2f6a7f734382358478887', | ||||
|         'info_dict': { | ||||
|             'id': '5379371', | ||||
| @@ -20,7 +20,7 @@ class BeatportIE(InfoExtractor): | ||||
|             'title': 'Froxic - Synesthesia (Original Mix)', | ||||
|         }, | ||||
|     }, { | ||||
|         'url': 'https://beatport.com/track/love-and-war-original-mix/3756896', | ||||
|         'url': 'https://pro.beatport.com/track/love-and-war-original-mix/3756896', | ||||
|         'md5': 'e44c3025dfa38c6577fbaeb43da43514', | ||||
|         'info_dict': { | ||||
|             'id': '3756896', | ||||
| @@ -29,7 +29,7 @@ class BeatportIE(InfoExtractor): | ||||
|             'title': 'Wolfgang Gartner - Love & War (Original Mix)', | ||||
|         }, | ||||
|     }, { | ||||
|         'url': 'https://beatport.com/track/birds-original-mix/4991738', | ||||
|         'url': 'https://pro.beatport.com/track/birds-original-mix/4991738', | ||||
|         'md5': 'a1fd8e8046de3950fd039304c186c05f', | ||||
|         'info_dict': { | ||||
|             'id': '4991738', | ||||
| @@ -46,19 +46,19 @@ class BeegIE(InfoExtractor): | ||||
|                 self._proto_relative_url(cpl_url), video_id, | ||||
|                 'Downloading cpl JS', fatal=False) | ||||
|             if cpl: | ||||
|                 beeg_version = int_or_none(self._search_regex( | ||||
|                     r'beeg_version\s*=\s*([^\b]+)', cpl, | ||||
|                     'beeg version', default=None)) or self._search_regex( | ||||
|                 beeg_version = self._search_regex( | ||||
|                     r'beeg_version\s*=\s*(\d+)', cpl, | ||||
|                     'beeg version', default=None) or self._search_regex( | ||||
|                     r'/(\d+)\.js', cpl_url, 'beeg version', default=None) | ||||
|                 beeg_salt = self._search_regex( | ||||
|                     r'beeg_salt\s*=\s*(["\'])(?P<beeg_salt>.+?)\1', cpl, 'beeg salt', | ||||
|                     r'beeg_salt\s*=\s*(["\'])(?P<beeg_salt>.+?)\1', cpl, 'beeg beeg_salt', | ||||
|                     default=None, group='beeg_salt') | ||||
|  | ||||
|         beeg_version = beeg_version or '2000' | ||||
|         beeg_salt = beeg_salt or 'pmweAkq8lAYKdfWcFCUj0yoVgoPlinamH5UE1CB3H' | ||||
|         beeg_version = beeg_version or '1750' | ||||
|         beeg_salt = beeg_salt or 'MIDtGaw96f0N1kMMAM1DE46EC9pmFr' | ||||
|  | ||||
|         video = self._download_json( | ||||
|             'https://api.beeg.com/api/v6/%s/video/%s' % (beeg_version, video_id), | ||||
|             'http://api.beeg.com/api/v6/%s/video/%s' % (beeg_version, video_id), | ||||
|             video_id) | ||||
|  | ||||
|         def split(o, e): | ||||
|   | ||||
| @@ -1,78 +0,0 @@ | ||||
| # coding: utf-8 | ||||
| from __future__ import unicode_literals | ||||
|  | ||||
| import re | ||||
|  | ||||
| from .common import InfoExtractor | ||||
|  | ||||
|  | ||||
| class BellMediaIE(InfoExtractor): | ||||
|     _VALID_URL = r'''(?x)https?://(?:www\.)? | ||||
|         (?P<domain> | ||||
|             (?: | ||||
|                 ctv| | ||||
|                 tsn| | ||||
|                 bnn| | ||||
|                 thecomedynetwork| | ||||
|                 discovery| | ||||
|                 discoveryvelocity| | ||||
|                 sciencechannel| | ||||
|                 investigationdiscovery| | ||||
|                 animalplanet| | ||||
|                 bravo| | ||||
|                 mtv| | ||||
|                 space | ||||
|             )\.ca| | ||||
|             much\.com | ||||
|         )/.*?(?:\bvid=|-vid|~|%7E|/(?:episode)?)(?P<id>[0-9]{6,})''' | ||||
|     _TESTS = [{ | ||||
|         'url': 'http://www.ctv.ca/video/player?vid=706966', | ||||
|         'md5': 'ff2ebbeae0aa2dcc32a830c3fd69b7b0', | ||||
|         'info_dict': { | ||||
|             'id': '706966', | ||||
|             'ext': 'mp4', | ||||
|             'title': 'Larry Day and Richard Jutras on the TIFF red carpet of \'Stonewall\'', | ||||
|             'description': 'etalk catches up with Larry Day and Richard Jutras on the TIFF red carpet of "Stonewall”.', | ||||
|             'upload_date': '20150919', | ||||
|             'timestamp': 1442624700, | ||||
|         }, | ||||
|         'expected_warnings': ['HTTP Error 404'], | ||||
|     }, { | ||||
|         'url': 'http://www.thecomedynetwork.ca/video/player?vid=923582', | ||||
|         'only_matching': True, | ||||
|     }, { | ||||
|         'url': 'http://www.tsn.ca/video/expectations-high-for-milos-raonic-at-us-open~939549', | ||||
|         'only_matching': True, | ||||
|     }, { | ||||
|         'url': 'http://www.bnn.ca/video/berman-s-call-part-two-viewer-questions~939654', | ||||
|         'only_matching': True, | ||||
|     }, { | ||||
|         'url': 'http://www.ctv.ca/YourMorning/Video/S1E6-Monday-August-29-2016-vid938009', | ||||
|         'only_matching': True, | ||||
|     }, { | ||||
|         'url': 'http://www.much.com/shows/atmidnight/episode948007/tuesday-september-13-2016', | ||||
|         'only_matching': True, | ||||
|     }, { | ||||
|         'url': 'http://www.much.com/shows/the-almost-impossible-gameshow/928979/episode-6', | ||||
|         'only_matching': True, | ||||
|     }, { | ||||
|         'url': 'http://www.ctv.ca/DCs-Legends-of-Tomorrow/Video/S2E11-Turncoat-vid1051430', | ||||
|         'only_matching': True, | ||||
|     }] | ||||
|     _DOMAINS = { | ||||
|         'thecomedynetwork': 'comedy', | ||||
|         'discoveryvelocity': 'discvel', | ||||
|         'sciencechannel': 'discsci', | ||||
|         'investigationdiscovery': 'invdisc', | ||||
|         'animalplanet': 'aniplan', | ||||
|     } | ||||
|  | ||||
|     def _real_extract(self, url): | ||||
|         domain, video_id = re.match(self._VALID_URL, url).groups() | ||||
|         domain = domain.split('.')[0] | ||||
|         return { | ||||
|             '_type': 'url_transparent', | ||||
|             'id': video_id, | ||||
|             'url': '9c9media:%s_web:%s' % (self._DOMAINS.get(domain, domain), video_id), | ||||
|             'ie_key': 'NineCNineMedia', | ||||
|         } | ||||
| @@ -2,6 +2,7 @@ from __future__ import unicode_literals | ||||
|  | ||||
| from .mtv import MTVServicesInfoExtractor | ||||
| from ..utils import unified_strdate | ||||
| from ..compat import compat_urllib_parse_urlencode | ||||
|  | ||||
|  | ||||
| class BetIE(MTVServicesInfoExtractor): | ||||
| @@ -17,7 +18,7 @@ class BetIE(MTVServicesInfoExtractor): | ||||
|                 'description': 'President Obama urges persistence in confronting racism and bias.', | ||||
|                 'duration': 1534, | ||||
|                 'upload_date': '20141208', | ||||
|                 'thumbnail': r're:(?i)^https?://.*\.jpg$', | ||||
|                 'thumbnail': 're:(?i)^https?://.*\.jpg$', | ||||
|                 'subtitles': { | ||||
|                     'en': 'mincount:2', | ||||
|                 } | ||||
| @@ -37,7 +38,7 @@ class BetIE(MTVServicesInfoExtractor): | ||||
|                 'description': 'A BET News special.', | ||||
|                 'duration': 1696, | ||||
|                 'upload_date': '20141125', | ||||
|                 'thumbnail': r're:(?i)^https?://.*\.jpg$', | ||||
|                 'thumbnail': 're:(?i)^https?://.*\.jpg$', | ||||
|                 'subtitles': { | ||||
|                     'en': 'mincount:2', | ||||
|                 } | ||||
| @@ -52,9 +53,9 @@ class BetIE(MTVServicesInfoExtractor): | ||||
|     _FEED_URL = "http://feeds.mtvnservices.com/od/feed/bet-mrss-player" | ||||
|  | ||||
|     def _get_feed_query(self, uri): | ||||
|         return { | ||||
|         return compat_urllib_parse_urlencode({ | ||||
|             'uuid': uri, | ||||
|         } | ||||
|         }) | ||||
|  | ||||
|     def _extract_mgid(self, webpage): | ||||
|         return self._search_regex(r'data-uri="([^"]+)', webpage, 'mgid') | ||||
|   | ||||
| @@ -11,6 +11,15 @@ from ..compat import compat_urllib_parse_unquote | ||||
| class BigflixIE(InfoExtractor): | ||||
|     _VALID_URL = r'https?://(?:www\.)?bigflix\.com/.+/(?P<id>[0-9]+)' | ||||
|     _TESTS = [{ | ||||
|         'url': 'http://www.bigflix.com/Hindi-movies/Action-movies/Singham-Returns/16537', | ||||
|         'md5': 'dc1b4aebb46e3a7077ecc0d9f43f61e3', | ||||
|         'info_dict': { | ||||
|             'id': '16537', | ||||
|             'ext': 'mp4', | ||||
|             'title': 'Singham Returns', | ||||
|             'description': 'md5:3d2ba5815f14911d5cc6a501ae0cf65d', | ||||
|         } | ||||
|     }, { | ||||
|         # 2 formats | ||||
|         'url': 'http://www.bigflix.com/Tamil-movies/Drama-movies/Madarasapatinam/16070', | ||||
|         'info_dict': { | ||||
|   | ||||
| @@ -19,7 +19,7 @@ class BildIE(InfoExtractor): | ||||
|             'ext': 'mp4', | ||||
|             'title': 'Das können die  neuen iPads', | ||||
|             'description': 'md5:a4058c4fa2a804ab59c00d7244bbf62f', | ||||
|             'thumbnail': r're:^https?://.*\.jpg$', | ||||
|             'thumbnail': 're:^https?://.*\.jpg$', | ||||
|             'duration': 196, | ||||
|         } | ||||
|     } | ||||
|   | ||||
| @@ -1,153 +1,209 @@ | ||||
| # coding: utf-8 | ||||
| from __future__ import unicode_literals | ||||
|  | ||||
| import hashlib | ||||
| import calendar | ||||
| import datetime | ||||
| import re | ||||
|  | ||||
| from .common import InfoExtractor | ||||
| from ..compat import ( | ||||
|     compat_etree_fromstring, | ||||
|     compat_str, | ||||
|     compat_parse_qs, | ||||
|     compat_urlparse, | ||||
|     compat_xml_parse_error, | ||||
| ) | ||||
| from ..utils import ( | ||||
|     ExtractorError, | ||||
|     int_or_none, | ||||
|     float_or_none, | ||||
|     parse_iso8601, | ||||
|     smuggle_url, | ||||
|     strip_jsonp, | ||||
|     unified_timestamp, | ||||
|     unsmuggle_url, | ||||
|     urlencode_postdata, | ||||
|     xpath_text, | ||||
| ) | ||||
|  | ||||
|  | ||||
| class BiliBiliIE(InfoExtractor): | ||||
|     _VALID_URL = r'https?://(?:www\.|bangumi\.|)bilibili\.(?:tv|com)/(?:video/av|anime/(?P<anime_id>\d+)/play#)(?P<id>\d+)' | ||||
|     _VALID_URL = r'https?://www\.bilibili\.(?:tv|com)/video/av(?P<id>\d+)' | ||||
|  | ||||
|     _TESTS = [{ | ||||
|         'url': 'http://www.bilibili.tv/video/av1074402/', | ||||
|         'md5': '9fa226fe2b8a9a4d5a69b4c6a183417e', | ||||
|         'md5': '5f7d29e1a2872f3df0cf76b1f87d3788', | ||||
|         'info_dict': { | ||||
|             'id': '1074402', | ||||
|             'ext': 'mp4', | ||||
|             'id': '1554319', | ||||
|             'ext': 'flv', | ||||
|             'title': '【金坷垃】金泡沫', | ||||
|             'description': 'md5:ce18c2a2d2193f0df2917d270f2e5923', | ||||
|             'duration': 308.315, | ||||
|             'duration': 308.067, | ||||
|             'timestamp': 1398012660, | ||||
|             'upload_date': '20140420', | ||||
|             'thumbnail': r're:^https?://.+\.jpg', | ||||
|             'thumbnail': 're:^https?://.+\.jpg', | ||||
|             'uploader': '菊子桑', | ||||
|             'uploader_id': '156160', | ||||
|         }, | ||||
|     }, { | ||||
|         # Tested in BiliBiliBangumiIE | ||||
|         'url': 'http://bangumi.bilibili.com/anime/1869/play#40062', | ||||
|         'only_matching': True, | ||||
|     }, { | ||||
|         'url': 'http://bangumi.bilibili.com/anime/5802/play#100643', | ||||
|         'md5': '3f721ad1e75030cc06faf73587cfec57', | ||||
|         'url': 'http://www.bilibili.com/video/av1041170/', | ||||
|         'info_dict': { | ||||
|             'id': '100643', | ||||
|             'ext': 'mp4', | ||||
|             'title': 'CHAOS;CHILD', | ||||
|             'description': '如果你是神明,并且能够让妄想成为现实。那你会进行怎么样的妄想?是淫靡的世界?独裁社会?毁灭性的制裁?还是……2015年,涩谷。从6年前发生的大灾害“涩谷地震”之后复兴了的这个街区里新设立的私立高中...', | ||||
|             'id': '1041170', | ||||
|             'title': '【BD1080P】刀语【诸神&异域】', | ||||
|             'description': '这是个神奇的故事~每个人不留弹幕不给走哦~切利哦!~', | ||||
|         }, | ||||
|         'skip': 'Geo-restricted to China', | ||||
|         'playlist_count': 9, | ||||
|     }, { | ||||
|         'url': 'http://www.bilibili.com/video/av4808130/', | ||||
|         'info_dict': { | ||||
|             'id': '4808130', | ||||
|             'title': '【长篇】哆啦A梦443【钉铛】', | ||||
|             'description': '(2016.05.27)来组合客人的脸吧&amp;寻母六千里锭 抱歉,又轮到周日上班现在才到家 封面www.pixiv.net/member_illust.php?mode=medium&amp;illust_id=56912929', | ||||
|         }, | ||||
|         'playlist': [{ | ||||
|             'md5': '55cdadedf3254caaa0d5d27cf20a8f9c', | ||||
|             'info_dict': { | ||||
|                 'id': '4808130_part1', | ||||
|                 'ext': 'flv', | ||||
|                 'title': '【长篇】哆啦A梦443【钉铛】', | ||||
|                 'description': '(2016.05.27)来组合客人的脸吧&amp;寻母六千里锭 抱歉,又轮到周日上班现在才到家 封面www.pixiv.net/member_illust.php?mode=medium&amp;illust_id=56912929', | ||||
|                 'timestamp': 1464564180, | ||||
|                 'upload_date': '20160529', | ||||
|                 'uploader': '喜欢拉面', | ||||
|                 'uploader_id': '151066', | ||||
|             }, | ||||
|         }, { | ||||
|             'md5': '926f9f67d0c482091872fbd8eca7ea3d', | ||||
|             'info_dict': { | ||||
|                 'id': '4808130_part2', | ||||
|                 'ext': 'flv', | ||||
|                 'title': '【长篇】哆啦A梦443【钉铛】', | ||||
|                 'description': '(2016.05.27)来组合客人的脸吧&amp;寻母六千里锭 抱歉,又轮到周日上班现在才到家 封面www.pixiv.net/member_illust.php?mode=medium&amp;illust_id=56912929', | ||||
|                 'timestamp': 1464564180, | ||||
|                 'upload_date': '20160529', | ||||
|                 'uploader': '喜欢拉面', | ||||
|                 'uploader_id': '151066', | ||||
|             }, | ||||
|         }, { | ||||
|             'md5': '4b7b225b968402d7c32348c646f1fd83', | ||||
|             'info_dict': { | ||||
|                 'id': '4808130_part3', | ||||
|                 'ext': 'flv', | ||||
|                 'title': '【长篇】哆啦A梦443【钉铛】', | ||||
|                 'description': '(2016.05.27)来组合客人的脸吧&amp;寻母六千里锭 抱歉,又轮到周日上班现在才到家 封面www.pixiv.net/member_illust.php?mode=medium&amp;illust_id=56912929', | ||||
|                 'timestamp': 1464564180, | ||||
|                 'upload_date': '20160529', | ||||
|                 'uploader': '喜欢拉面', | ||||
|                 'uploader_id': '151066', | ||||
|             }, | ||||
|         }, { | ||||
|             'md5': '7b795e214166501e9141139eea236e91', | ||||
|             'info_dict': { | ||||
|                 'id': '4808130_part4', | ||||
|                 'ext': 'flv', | ||||
|                 'title': '【长篇】哆啦A梦443【钉铛】', | ||||
|                 'description': '(2016.05.27)来组合客人的脸吧&amp;寻母六千里锭 抱歉,又轮到周日上班现在才到家 封面www.pixiv.net/member_illust.php?mode=medium&amp;illust_id=56912929', | ||||
|                 'timestamp': 1464564180, | ||||
|                 'upload_date': '20160529', | ||||
|                 'uploader': '喜欢拉面', | ||||
|                 'uploader_id': '151066', | ||||
|             }, | ||||
|         }], | ||||
|     }, { | ||||
|         # Missing upload time | ||||
|         'url': 'http://www.bilibili.com/video/av1867637/', | ||||
|         'info_dict': { | ||||
|             'id': '2880301', | ||||
|             'ext': 'flv', | ||||
|             'title': '【HDTV】【喜剧】岳父岳母真难当 (2014)【法国票房冠军】', | ||||
|             'description': '一个信奉天主教的法国旧式传统资产阶级家庭中有四个女儿。三个女儿却分别找了阿拉伯、犹太、中国丈夫,老夫老妻唯独期盼剩下未嫁的小女儿能找一个信奉天主教的法国白人,结果没想到小女儿找了一位非裔黑人……【这次应该不会跳帧了】', | ||||
|             'uploader': '黑夜为猫', | ||||
|             'uploader_id': '610729', | ||||
|         }, | ||||
|         'params': { | ||||
|             # Just to test metadata extraction | ||||
|             'skip_download': True, | ||||
|         }, | ||||
|         'expected_warnings': ['upload time'], | ||||
|     }] | ||||
|  | ||||
|     _APP_KEY = '84956560bc028eb7' | ||||
|     _BILIBILI_KEY = '94aba54af9065f71de72f5508f1cd42e' | ||||
|  | ||||
|     def _report_error(self, result): | ||||
|         if 'message' in result: | ||||
|             raise ExtractorError('%s said: %s' % (self.IE_NAME, result['message']), expected=True) | ||||
|         elif 'code' in result: | ||||
|             raise ExtractorError('%s returns error %d' % (self.IE_NAME, result['code']), expected=True) | ||||
|         else: | ||||
|             raise ExtractorError('Can\'t extract Bangumi episode ID') | ||||
|     # BiliBili blocks keys from time to time. The current key is extracted from | ||||
|     # the Android client | ||||
|     # TODO: find the sign algorithm used in the flash player | ||||
|     _APP_KEY = '86385cdc024c0f6c' | ||||
|  | ||||
|     def _real_extract(self, url): | ||||
|         url, smuggled_data = unsmuggle_url(url, {}) | ||||
|  | ||||
|         mobj = re.match(self._VALID_URL, url) | ||||
|         video_id = mobj.group('id') | ||||
|         anime_id = mobj.group('anime_id') | ||||
|  | ||||
|         webpage = self._download_webpage(url, video_id) | ||||
|  | ||||
|         if 'anime/' not in url: | ||||
|             cid = compat_parse_qs(self._search_regex( | ||||
|                 [r'EmbedPlayer\([^)]+,\s*"([^"]+)"\)', | ||||
|                  r'<iframe[^>]+src="https://secure\.bilibili\.com/secure,([^"]+)"'], | ||||
|                 webpage, 'player parameters'))['cid'][0] | ||||
|         params = compat_parse_qs(self._search_regex( | ||||
|             [r'EmbedPlayer\([^)]+,\s*"([^"]+)"\)', | ||||
|              r'<iframe[^>]+src="https://secure\.bilibili\.com/secure,([^"]+)"'], | ||||
|             webpage, 'player parameters')) | ||||
|         cid = params['cid'][0] | ||||
|  | ||||
|         info_xml_str = self._download_webpage( | ||||
|             'http://interface.bilibili.com/v_cdn_play', | ||||
|             cid, query={'appkey': self._APP_KEY, 'cid': cid}, | ||||
|             note='Downloading video info page') | ||||
|  | ||||
|         err_msg = None | ||||
|         durls = None | ||||
|         info_xml = None | ||||
|         try: | ||||
|             info_xml = compat_etree_fromstring(info_xml_str.encode('utf-8')) | ||||
|         except compat_xml_parse_error: | ||||
|             info_json = self._parse_json(info_xml_str, video_id, fatal=False) | ||||
|             err_msg = (info_json or {}).get('error_text') | ||||
|         else: | ||||
|             if 'no_bangumi_tip' not in smuggled_data: | ||||
|                 self.to_screen('Downloading episode %s. To download all videos in anime %s, re-run youtube-dl with %s' % ( | ||||
|                     video_id, anime_id, compat_urlparse.urljoin(url, '//bangumi.bilibili.com/anime/%s' % anime_id))) | ||||
|             headers = { | ||||
|                 'Content-Type': 'application/x-www-form-urlencoded; charset=UTF-8', | ||||
|             } | ||||
|             headers.update(self.geo_verification_headers()) | ||||
|             err_msg = xpath_text(info_xml, './message') | ||||
|  | ||||
|             js = self._download_json( | ||||
|                 'http://bangumi.bilibili.com/web_api/get_source', video_id, | ||||
|                 data=urlencode_postdata({'episode_id': video_id}), | ||||
|                 headers=headers) | ||||
|             if 'result' not in js: | ||||
|                 self._report_error(js) | ||||
|             cid = js['result']['cid'] | ||||
|  | ||||
|         payload = 'appkey=%s&cid=%s&otype=json&quality=2&type=mp4' % (self._APP_KEY, cid) | ||||
|         sign = hashlib.md5((payload + self._BILIBILI_KEY).encode('utf-8')).hexdigest() | ||||
|  | ||||
|         video_info = self._download_json( | ||||
|             'http://interface.bilibili.com/playurl?%s&sign=%s' % (payload, sign), | ||||
|             video_id, note='Downloading video info page', | ||||
|             headers=self.geo_verification_headers()) | ||||
|  | ||||
|         if 'durl' not in video_info: | ||||
|             self._report_error(video_info) | ||||
|         if info_xml is not None: | ||||
|             durls = info_xml.findall('./durl') | ||||
|         if not durls: | ||||
|             if err_msg: | ||||
|                 raise ExtractorError('%s said: %s' % (self.IE_NAME, err_msg), expected=True) | ||||
|             else: | ||||
|                 raise ExtractorError('No videos found!') | ||||
|  | ||||
|         entries = [] | ||||
|  | ||||
|         for idx, durl in enumerate(video_info['durl']): | ||||
|         for durl in durls: | ||||
|             size = xpath_text(durl, ['./filesize', './size']) | ||||
|             formats = [{ | ||||
|                 'url': durl['url'], | ||||
|                 'filesize': int_or_none(durl['size']), | ||||
|                 'url': durl.find('./url').text, | ||||
|                 'filesize': int_or_none(size), | ||||
|             }] | ||||
|             for backup_url in durl.get('backup_url', []): | ||||
|             for backup_url in durl.findall('./backup_url/url'): | ||||
|                 formats.append({ | ||||
|                     'url': backup_url, | ||||
|                     'url': backup_url.text, | ||||
|                     # backup URLs have lower priorities | ||||
|                     'preference': -2 if 'hd.mp4' in backup_url else -3, | ||||
|                     'preference': -2 if 'hd.mp4' in backup_url.text else -3, | ||||
|                 }) | ||||
|  | ||||
|             self._sort_formats(formats) | ||||
|  | ||||
|             entries.append({ | ||||
|                 'id': '%s_part%s' % (video_id, idx), | ||||
|                 'duration': float_or_none(durl.get('length'), 1000), | ||||
|                 'id': '%s_part%s' % (cid, xpath_text(durl, './order')), | ||||
|                 'duration': int_or_none(xpath_text(durl, './length'), 1000), | ||||
|                 'formats': formats, | ||||
|             }) | ||||
|  | ||||
|         title = self._html_search_regex('<h1[^>]+title="([^"]+)">', webpage, 'title') | ||||
|         description = self._html_search_meta('description', webpage) | ||||
|         timestamp = unified_timestamp(self._html_search_regex( | ||||
|             r'<time[^>]+datetime="([^"]+)"', webpage, 'upload time', default=None)) | ||||
|         thumbnail = self._html_search_meta(['og:image', 'thumbnailUrl'], webpage) | ||||
|         datetime_str = self._html_search_regex( | ||||
|             r'<time[^>]+datetime="([^"]+)"', webpage, 'upload time', fatal=False) | ||||
|         timestamp = None | ||||
|         if datetime_str: | ||||
|             timestamp = calendar.timegm(datetime.datetime.strptime(datetime_str, '%Y-%m-%dT%H:%M').timetuple()) | ||||
|  | ||||
|         # TODO 'view_count' requires deobfuscating Javascript | ||||
|         info = { | ||||
|             'id': video_id, | ||||
|             'id': compat_str(cid), | ||||
|             'title': title, | ||||
|             'description': description, | ||||
|             'timestamp': timestamp, | ||||
|             'thumbnail': thumbnail, | ||||
|             'duration': float_or_none(video_info.get('timelength'), scale=1000), | ||||
|             'thumbnail': self._html_search_meta('thumbnailUrl', webpage), | ||||
|             'duration': float_or_none(xpath_text(info_xml, './timelength'), scale=1000), | ||||
|         } | ||||
|  | ||||
|         uploader_mobj = re.search( | ||||
|             r'<a[^>]+href="(?:https?:)?//space\.bilibili\.com/(?P<id>\d+)"[^>]+title="(?P<name>[^"]+)"', | ||||
|             r'<a[^>]+href="https?://space\.bilibili\.com/(?P<id>\d+)"[^>]+title="(?P<name>[^"]+)"', | ||||
|             webpage) | ||||
|         if uploader_mobj: | ||||
|             info.update({ | ||||
| @@ -171,70 +227,3 @@ class BiliBiliIE(InfoExtractor): | ||||
|                 'description': description, | ||||
|                 'entries': entries, | ||||
|             } | ||||
|  | ||||
|  | ||||
| class BiliBiliBangumiIE(InfoExtractor): | ||||
|     _VALID_URL = r'https?://bangumi\.bilibili\.com/anime/(?P<id>\d+)' | ||||
|  | ||||
|     IE_NAME = 'bangumi.bilibili.com' | ||||
|     IE_DESC = 'BiliBili番剧' | ||||
|  | ||||
|     _TESTS = [{ | ||||
|         'url': 'http://bangumi.bilibili.com/anime/1869', | ||||
|         'info_dict': { | ||||
|             'id': '1869', | ||||
|             'title': '混沌武士', | ||||
|             'description': 'md5:6a9622b911565794c11f25f81d6a97d2', | ||||
|         }, | ||||
|         'playlist_count': 26, | ||||
|     }, { | ||||
|         'url': 'http://bangumi.bilibili.com/anime/1869', | ||||
|         'info_dict': { | ||||
|             'id': '1869', | ||||
|             'title': '混沌武士', | ||||
|             'description': 'md5:6a9622b911565794c11f25f81d6a97d2', | ||||
|         }, | ||||
|         'playlist': [{ | ||||
|             'md5': '91da8621454dd58316851c27c68b0c13', | ||||
|             'info_dict': { | ||||
|                 'id': '40062', | ||||
|                 'ext': 'mp4', | ||||
|                 'title': '混沌武士', | ||||
|                 'description': '故事发生在日本的江户时代。风是一个小酒馆的打工女。一日,酒馆里来了一群恶霸,虽然他们的举动令风十分不满,但是毕竟风只是一届女流,无法对他们采取什么行动,只能在心里嘟哝。这时,酒家里又进来了个“不良份子...', | ||||
|                 'timestamp': 1414538739, | ||||
|                 'upload_date': '20141028', | ||||
|                 'episode': '疾风怒涛 Tempestuous Temperaments', | ||||
|                 'episode_number': 1, | ||||
|             }, | ||||
|         }], | ||||
|         'params': { | ||||
|             'playlist_items': '1', | ||||
|         }, | ||||
|     }] | ||||
|  | ||||
|     @classmethod | ||||
|     def suitable(cls, url): | ||||
|         return False if BiliBiliIE.suitable(url) else super(BiliBiliBangumiIE, cls).suitable(url) | ||||
|  | ||||
|     def _real_extract(self, url): | ||||
|         bangumi_id = self._match_id(url) | ||||
|  | ||||
|         # Sometimes this API returns a JSONP response | ||||
|         season_info = self._download_json( | ||||
|             'http://bangumi.bilibili.com/jsonp/seasoninfo/%s.ver' % bangumi_id, | ||||
|             bangumi_id, transform_source=strip_jsonp)['result'] | ||||
|  | ||||
|         entries = [{ | ||||
|             '_type': 'url_transparent', | ||||
|             'url': smuggle_url(episode['webplay_url'], {'no_bangumi_tip': 1}), | ||||
|             'ie_key': BiliBiliIE.ie_key(), | ||||
|             'timestamp': parse_iso8601(episode.get('update_time'), delimiter=' '), | ||||
|             'episode': episode.get('index_title'), | ||||
|             'episode_number': int_or_none(episode.get('index')), | ||||
|         } for episode in season_info['episodes']] | ||||
|  | ||||
|         entries = sorted(entries, key=lambda entry: entry.get('episode_number')) | ||||
|  | ||||
|         return self.playlist_result( | ||||
|             entries, bangumi_id, | ||||
|             season_info.get('bangumi_title'), season_info.get('evaluate')) | ||||
|   | ||||
| @@ -19,7 +19,7 @@ class BioBioChileTVIE(InfoExtractor): | ||||
|             'id': 'sobre-camaras-y-camarillas-parlamentarias', | ||||
|             'ext': 'mp4', | ||||
|             'title': 'Sobre Cámaras y camarillas parlamentarias', | ||||
|             'thumbnail': r're:^https?://.*\.jpg$', | ||||
|             'thumbnail': 're:^https?://.*\.jpg$', | ||||
|             'uploader': 'Fernando Atria', | ||||
|         }, | ||||
|         'skip': 'URL expired and redirected to http://www.biobiochile.cl/portada/bbtv/index.html', | ||||
| @@ -31,7 +31,7 @@ class BioBioChileTVIE(InfoExtractor): | ||||
|             'id': 'natalia-valdebenito-repasa-a-diputado-hasbun-paso-a-la-categoria-de-hablar-brutalidades', | ||||
|             'ext': 'mp4', | ||||
|             'title': 'Natalia Valdebenito repasa a diputado Hasbún: Pasó a la categoría de hablar brutalidades', | ||||
|             'thumbnail': r're:^https?://.*\.jpg$', | ||||
|             'thumbnail': 're:^https?://.*\.jpg$', | ||||
|             'uploader': 'Piangella Obrador', | ||||
|         }, | ||||
|         'params': { | ||||
|   | ||||
| @@ -24,8 +24,7 @@ class BIQLEIE(InfoExtractor): | ||||
|             'ext': 'mp4', | ||||
|             'title': 'Ребенок в шоке от автоматической мойки', | ||||
|             'uploader': 'Dmitry Kotov', | ||||
|         }, | ||||
|         'skip': ' This video was marked as adult.  Embedding adult videos on external sites is prohibited.', | ||||
|         } | ||||
|     }] | ||||
|  | ||||
|     def _real_extract(self, url): | ||||
|   | ||||
| @@ -1,4 +1,3 @@ | ||||
| # coding: utf-8 | ||||
| from __future__ import unicode_literals | ||||
|  | ||||
| import re | ||||
| @@ -21,22 +20,6 @@ class BloombergIE(InfoExtractor): | ||||
|         'params': { | ||||
|             'format': 'best[format_id^=hds]', | ||||
|         }, | ||||
|     }, { | ||||
|         # video ID in BPlayer(...) | ||||
|         'url': 'http://www.bloomberg.com/features/2016-hello-world-new-zealand/', | ||||
|         'info_dict': { | ||||
|             'id': '938c7e72-3f25-4ddb-8b85-a9be731baa74', | ||||
|             'ext': 'flv', | ||||
|             'title': 'Meet the Real-Life Tech Wizards of Middle Earth', | ||||
|             'description': 'Hello World, Episode 1: New Zealand’s freaky AI babies, robot exoskeletons, and a virtual you.', | ||||
|         }, | ||||
|         'params': { | ||||
|             'format': 'best[format_id^=hds]', | ||||
|         }, | ||||
|     }, { | ||||
|         # data-bmmrid= | ||||
|         'url': 'https://www.bloomberg.com/politics/articles/2017-02-08/le-pen-aide-briefed-french-central-banker-on-plan-to-print-money', | ||||
|         'only_matching': True, | ||||
|     }, { | ||||
|         'url': 'http://www.bloomberg.com/news/articles/2015-11-12/five-strange-things-that-have-been-happening-in-financial-markets', | ||||
|         'only_matching': True, | ||||
| @@ -49,14 +32,8 @@ class BloombergIE(InfoExtractor): | ||||
|         name = self._match_id(url) | ||||
|         webpage = self._download_webpage(url, name) | ||||
|         video_id = self._search_regex( | ||||
|             (r'["\']bmmrId["\']\s*:\s*(["\'])(?P<id>(?:(?!\1).)+)\1', | ||||
|              r'videoId\s*:\s*(["\'])(?P<id>(?:(?!\1).)+)\1', | ||||
|              r'data-bmmrid=(["\'])(?P<id>(?:(?!\1).)+)\1'), | ||||
|             webpage, 'id', group='id', default=None) | ||||
|         if not video_id: | ||||
|             bplayer_data = self._parse_json(self._search_regex( | ||||
|                 r'BPlayer\(null,\s*({[^;]+})\);', webpage, 'id'), name) | ||||
|             video_id = bplayer_data['id'] | ||||
|             r'["\']bmmrId["\']\s*:\s*(["\'])(?P<url>.+?)\1', | ||||
|             webpage, 'id', group='url') | ||||
|         title = re.sub(': Video$', '', self._og_search_title(webpage)) | ||||
|  | ||||
|         embed_info = self._download_json( | ||||
|   | ||||
| @@ -12,7 +12,7 @@ from ..utils import ( | ||||
|  | ||||
| class BpbIE(InfoExtractor): | ||||
|     IE_DESC = 'Bundeszentrale für politische Bildung' | ||||
|     _VALID_URL = r'https?://(?:www\.)?bpb\.de/mediathek/(?P<id>[0-9]+)/' | ||||
|     _VALID_URL = r'https?://www\.bpb\.de/mediathek/(?P<id>[0-9]+)/' | ||||
|  | ||||
|     _TEST = { | ||||
|         'url': 'http://www.bpb.de/mediathek/297/joachim-gauck-zu-1989-und-die-erinnerung-an-die-ddr', | ||||
|   | ||||
| @@ -1,74 +1,31 @@ | ||||
| # coding: utf-8 | ||||
| from __future__ import unicode_literals | ||||
|  | ||||
| from .adobepass import AdobePassIE | ||||
| from ..utils import ( | ||||
|     smuggle_url, | ||||
|     update_url_query, | ||||
|     int_or_none, | ||||
| ) | ||||
| from .common import InfoExtractor | ||||
| from ..utils import smuggle_url | ||||
|  | ||||
|  | ||||
| class BravoTVIE(AdobePassIE): | ||||
|     _VALID_URL = r'https?://(?:www\.)?bravotv\.com/(?:[^/]+/)+(?P<id>[^/?#]+)' | ||||
|     _TESTS = [{ | ||||
| class BravoTVIE(InfoExtractor): | ||||
|     _VALID_URL = r'https?://(?:www\.)?bravotv\.com/(?:[^/]+/)+videos/(?P<id>[^/?]+)' | ||||
|     _TEST = { | ||||
|         'url': 'http://www.bravotv.com/last-chance-kitchen/season-5/videos/lck-ep-12-fishy-finale', | ||||
|         'md5': '9086d0b7ef0ea2aabc4781d75f4e5863', | ||||
|         'md5': 'd60cdf68904e854fac669bd26cccf801', | ||||
|         'info_dict': { | ||||
|             'id': 'zHyk1_HU_mPy', | ||||
|             'id': 'LitrBdX64qLn', | ||||
|             'ext': 'mp4', | ||||
|             'title': 'LCK Ep 12: Fishy Finale', | ||||
|             'description': 'S13/E12: Two eliminated chefs have just 12 minutes to cook up a delicious fish dish.', | ||||
|             'title': 'Last Chance Kitchen Returns', | ||||
|             'description': 'S13: Last Chance Kitchen Returns for Top Chef Season 13', | ||||
|             'timestamp': 1448926740, | ||||
|             'upload_date': '20151130', | ||||
|             'uploader': 'NBCU-BRAV', | ||||
|             'upload_date': '20160302', | ||||
|             'timestamp': 1456945320, | ||||
|         } | ||||
|     }, { | ||||
|         'url': 'http://www.bravotv.com/below-deck/season-3/ep-14-reunion-part-1', | ||||
|         'only_matching': True, | ||||
|     }] | ||||
|     } | ||||
|  | ||||
|     def _real_extract(self, url): | ||||
|         display_id = self._match_id(url) | ||||
|         webpage = self._download_webpage(url, display_id) | ||||
|         settings = self._parse_json(self._search_regex( | ||||
|             r'jQuery\.extend\(Drupal\.settings\s*,\s*({.+?})\);', webpage, 'drupal settings'), | ||||
|             display_id) | ||||
|         info = {} | ||||
|         query = { | ||||
|             'mbr': 'true', | ||||
|         } | ||||
|         account_pid, release_pid = [None] * 2 | ||||
|         tve = settings.get('sharedTVE') | ||||
|         if tve: | ||||
|             query['manifest'] = 'm3u' | ||||
|             account_pid = 'HNK2IC' | ||||
|             release_pid = tve['release_pid'] | ||||
|             if tve.get('entitlement') == 'auth': | ||||
|                 adobe_pass = settings.get('adobePass', {}) | ||||
|                 resource = self._get_mvpd_resource( | ||||
|                     adobe_pass.get('adobePassResourceId', 'bravo'), | ||||
|                     tve['title'], release_pid, tve.get('rating')) | ||||
|                 query['auth'] = self._extract_mvpd_auth( | ||||
|                     url, release_pid, adobe_pass.get('adobePassRequestorId', 'bravo'), resource) | ||||
|         else: | ||||
|             shared_playlist = settings['shared_playlist'] | ||||
|             account_pid = shared_playlist['account_pid'] | ||||
|             metadata = shared_playlist['video_metadata'][shared_playlist['default_clip']] | ||||
|             release_pid = metadata['release_pid'] | ||||
|             info.update({ | ||||
|                 'title': metadata['title'], | ||||
|                 'description': metadata.get('description'), | ||||
|                 'season_number': int_or_none(metadata.get('season_num')), | ||||
|                 'episode_number': int_or_none(metadata.get('episode_num')), | ||||
|             }) | ||||
|             query['switch'] = 'progressive' | ||||
|         info.update({ | ||||
|             '_type': 'url_transparent', | ||||
|             'id': release_pid, | ||||
|             'url': smuggle_url(update_url_query( | ||||
|                 'http://link.theplatform.com/s/%s/%s' % (account_pid, release_pid), | ||||
|                 query), {'force_smil_url': True}), | ||||
|             'ie_key': 'ThePlatform', | ||||
|         }) | ||||
|         return info | ||||
|         video_id = self._match_id(url) | ||||
|         webpage = self._download_webpage(url, video_id) | ||||
|         account_pid = self._search_regex(r'"account_pid"\s*:\s*"([^"]+)"', webpage, 'account pid') | ||||
|         release_pid = self._search_regex(r'"release_pid"\s*:\s*"([^"]+)"', webpage, 'release pid') | ||||
|         return self.url_result(smuggle_url( | ||||
|             'http://link.theplatform.com/s/%s/%s?mbr=true&switch=progressive' % (account_pid, release_pid), | ||||
|             {'force_smil_url': True}), 'ThePlatform', release_pid) | ||||
|   | ||||
| @@ -1,9 +1,9 @@ | ||||
| from __future__ import unicode_literals | ||||
|  | ||||
| import re | ||||
| import json | ||||
|  | ||||
| from .common import InfoExtractor | ||||
| from ..compat import compat_str | ||||
| from ..utils import ( | ||||
|     int_or_none, | ||||
|     parse_age_limit, | ||||
| @@ -11,7 +11,7 @@ from ..utils import ( | ||||
|  | ||||
|  | ||||
| class BreakIE(InfoExtractor): | ||||
|     _VALID_URL = r'https?://(?:www\.)?(?P<site>break|screenjunkies)\.com/video/(?P<display_id>[^/]+?)(?:-(?P<id>\d+))?(?:[/?#&]|$)' | ||||
|     _VALID_URL = r'https?://(?:www\.)?break\.com/video/(?:[^/]+/)*.+-(?P<id>\d+)' | ||||
|     _TESTS = [{ | ||||
|         'url': 'http://www.break.com/video/when-girls-act-like-guys-2468056', | ||||
|         'info_dict': { | ||||
| @@ -20,124 +20,45 @@ class BreakIE(InfoExtractor): | ||||
|             'title': 'When Girls Act Like D-Bags', | ||||
|             'age_limit': 13, | ||||
|         } | ||||
|     }, { | ||||
|         'url': 'http://www.screenjunkies.com/video/best-quentin-tarantino-movie-2841915', | ||||
|         'md5': '5c2b686bec3d43de42bde9ec047536b0', | ||||
|         'info_dict': { | ||||
|             'id': '2841915', | ||||
|             'display_id': 'best-quentin-tarantino-movie', | ||||
|             'ext': 'mp4', | ||||
|             'title': 'Best Quentin Tarantino Movie', | ||||
|             'thumbnail': r're:^https?://.*\.jpg', | ||||
|             'duration': 3671, | ||||
|             'age_limit': 13, | ||||
|             'tags': list, | ||||
|         }, | ||||
|     }, { | ||||
|         'url': 'http://www.screenjunkies.com/video/honest-trailers-the-dark-knight', | ||||
|         'info_dict': { | ||||
|             'id': '2348808', | ||||
|             'display_id': 'honest-trailers-the-dark-knight', | ||||
|             'ext': 'mp4', | ||||
|             'title': 'Honest Trailers - The Dark Knight', | ||||
|             'thumbnail': r're:^https?://.*\.(?:jpg|png)', | ||||
|             'age_limit': 10, | ||||
|             'tags': list, | ||||
|         }, | ||||
|     }, { | ||||
|         # requires subscription but worked around | ||||
|         'url': 'http://www.screenjunkies.com/video/knocking-dead-ep-1-the-show-so-far-3003285', | ||||
|         'info_dict': { | ||||
|             'id': '3003285', | ||||
|             'display_id': 'knocking-dead-ep-1-the-show-so-far', | ||||
|             'ext': 'mp4', | ||||
|             'title': 'State of The Dead Recap: Knocking Dead Pilot', | ||||
|             'thumbnail': r're:^https?://.*\.jpg', | ||||
|             'duration': 3307, | ||||
|             'age_limit': 13, | ||||
|             'tags': list, | ||||
|         }, | ||||
|     }, { | ||||
|         'url': 'http://www.break.com/video/ugc/baby-flex-2773063', | ||||
|         'only_matching': True, | ||||
|     }] | ||||
|  | ||||
|     _DEFAULT_BITRATES = (48, 150, 320, 496, 864, 2240, 3264) | ||||
|  | ||||
|     def _real_extract(self, url): | ||||
|         site, display_id, video_id = re.match(self._VALID_URL, url).groups() | ||||
|  | ||||
|         if not video_id: | ||||
|             webpage = self._download_webpage(url, display_id) | ||||
|             video_id = self._search_regex( | ||||
|                 (r'src=["\']/embed/(\d+)', r'data-video-content-id=["\'](\d+)'), | ||||
|                 webpage, 'video id') | ||||
|  | ||||
|         video_id = self._match_id(url) | ||||
|         webpage = self._download_webpage( | ||||
|             'http://www.%s.com/embed/%s' % (site, video_id), | ||||
|             display_id, 'Downloading video embed page') | ||||
|         embed_vars = self._parse_json( | ||||
|             self._search_regex( | ||||
|                 r'(?s)embedVars\s*=\s*({.+?})\s*</script>', webpage, 'embed vars'), | ||||
|             display_id) | ||||
|             'http://www.break.com/embed/%s' % video_id, video_id) | ||||
|         info = json.loads(self._search_regex( | ||||
|             r'var embedVars = ({.*})\s*?</script>', | ||||
|             webpage, 'info json', flags=re.DOTALL)) | ||||
|  | ||||
|         youtube_id = embed_vars.get('youtubeId') | ||||
|         youtube_id = info.get('youtubeId') | ||||
|         if youtube_id: | ||||
|             return self.url_result(youtube_id, 'Youtube') | ||||
|  | ||||
|         title = embed_vars['contentName'] | ||||
|         formats = [{ | ||||
|             'url': media['uri'] + '?' + info['AuthToken'], | ||||
|             'tbr': media['bitRate'], | ||||
|             'width': media['width'], | ||||
|             'height': media['height'], | ||||
|         } for media in info['media'] if media.get('mediaPurpose') == 'play'] | ||||
|  | ||||
|         formats = [] | ||||
|         bitrates = [] | ||||
|         for f in embed_vars.get('media', []): | ||||
|             if not f.get('uri') or f.get('mediaPurpose') != 'play': | ||||
|                 continue | ||||
|             bitrate = int_or_none(f.get('bitRate')) | ||||
|             if bitrate: | ||||
|                 bitrates.append(bitrate) | ||||
|         if not formats: | ||||
|             formats.append({ | ||||
|                 'url': f['uri'], | ||||
|                 'format_id': 'http-%d' % bitrate if bitrate else 'http', | ||||
|                 'width': int_or_none(f.get('width')), | ||||
|                 'height': int_or_none(f.get('height')), | ||||
|                 'tbr': bitrate, | ||||
|                 'format': 'mp4', | ||||
|                 'url': info['videoUri'] | ||||
|             }) | ||||
|  | ||||
|         if not bitrates: | ||||
|             # When subscriptionLevel > 0, i.e. plus subscription is required | ||||
|             # media list will be empty. However, hds and hls uris are still | ||||
|             # available. We can grab them assuming bitrates to be default. | ||||
|             bitrates = self._DEFAULT_BITRATES | ||||
|  | ||||
|         auth_token = embed_vars.get('AuthToken') | ||||
|  | ||||
|         def construct_manifest_url(base_url, ext): | ||||
|             pieces = [base_url] | ||||
|             pieces.extend([compat_str(b) for b in bitrates]) | ||||
|             pieces.append('_kbps.mp4.%s?%s' % (ext, auth_token)) | ||||
|             return ','.join(pieces) | ||||
|  | ||||
|         if bitrates and auth_token: | ||||
|             hds_url = embed_vars.get('hdsUri') | ||||
|             if hds_url: | ||||
|                 formats.extend(self._extract_f4m_formats( | ||||
|                     construct_manifest_url(hds_url, 'f4m'), | ||||
|                     display_id, f4m_id='hds', fatal=False)) | ||||
|             hls_url = embed_vars.get('hlsUri') | ||||
|             if hls_url: | ||||
|                 formats.extend(self._extract_m3u8_formats( | ||||
|                     construct_manifest_url(hls_url, 'm3u8'), | ||||
|                     display_id, 'mp4', entry_protocol='m3u8_native', m3u8_id='hls', fatal=False)) | ||||
|         self._sort_formats(formats) | ||||
|  | ||||
|         duration = int_or_none(info.get('videoLengthInSeconds')) | ||||
|         age_limit = parse_age_limit(info.get('audienceRating')) | ||||
|  | ||||
|         return { | ||||
|             'id': video_id, | ||||
|             'display_id': display_id, | ||||
|             'title': title, | ||||
|             'thumbnail': embed_vars.get('thumbUri'), | ||||
|             'duration': int_or_none(embed_vars.get('videoLengthInSeconds')) or None, | ||||
|             'age_limit': parse_age_limit(embed_vars.get('audienceRating')), | ||||
|             'tags': embed_vars.get('tags', '').split(','), | ||||
|             'title': info['contentName'], | ||||
|             'thumbnail': info['thumbUri'], | ||||
|             'duration': duration, | ||||
|             'age_limit': age_limit, | ||||
|             'formats': formats, | ||||
|         } | ||||
|   | ||||
| @@ -1,4 +1,4 @@ | ||||
| # coding: utf-8 | ||||
| # encoding: utf-8 | ||||
| from __future__ import unicode_literals | ||||
|  | ||||
| import re | ||||
| @@ -179,7 +179,7 @@ class BrightcoveLegacyIE(InfoExtractor): | ||||
|  | ||||
|         params = {} | ||||
|  | ||||
|         playerID = find_param('playerID') or find_param('playerId') | ||||
|         playerID = find_param('playerID') | ||||
|         if playerID is None: | ||||
|             raise ExtractorError('Cannot find player ID') | ||||
|         params['playerID'] = playerID | ||||
| @@ -204,7 +204,7 @@ class BrightcoveLegacyIE(InfoExtractor): | ||||
|         #   // build Brightcove <object /> XML | ||||
|         # } | ||||
|         m = re.search( | ||||
|             r'''(?x)customBC\.createVideo\( | ||||
|             r'''(?x)customBC.\createVideo\( | ||||
|                 .*?                                                  # skipping width and height | ||||
|                 ["\'](?P<playerID>\d+)["\']\s*,\s*                   # playerID | ||||
|                 ["\'](?P<playerKey>AQ[^"\']{48})[^"\']*["\']\s*,\s*  # playerKey begins with AQ and is 50 characters | ||||
| @@ -232,16 +232,13 @@ class BrightcoveLegacyIE(InfoExtractor): | ||||
|         """Return a list of all Brightcove URLs from the webpage """ | ||||
|  | ||||
|         url_m = re.search( | ||||
|             r'''(?x) | ||||
|                 <meta\s+ | ||||
|                     (?:property|itemprop)=([\'"])(?:og:video|embedURL)\1[^>]+ | ||||
|                     content=([\'"])(?P<url>https?://(?:secure|c)\.brightcove.com/(?:(?!\2).)+)\2 | ||||
|             ''', webpage) | ||||
|             r'<meta\s+property=[\'"]og:video[\'"]\s+content=[\'"](https?://(?:secure|c)\.brightcove.com/[^\'"]+)[\'"]', | ||||
|             webpage) | ||||
|         if url_m: | ||||
|             url = unescapeHTML(url_m.group('url')) | ||||
|             url = unescapeHTML(url_m.group(1)) | ||||
|             # Some sites don't add it, we can't download with this url, for example: | ||||
|             # http://www.ktvu.com/videos/news/raw-video-caltrain-releases-video-of-man-almost/vCTZdY/ | ||||
|             if 'playerKey' in url or 'videoId' in url or 'idVideo' in url: | ||||
|             if 'playerKey' in url or 'videoId' in url: | ||||
|                 return [url] | ||||
|  | ||||
|         matches = re.findall( | ||||
| @@ -262,7 +259,7 @@ class BrightcoveLegacyIE(InfoExtractor): | ||||
|         url, smuggled_data = unsmuggle_url(url, {}) | ||||
|  | ||||
|         # Change the 'videoId' and others field to '@videoPlayer' | ||||
|         url = re.sub(r'(?<=[?&])(videoI(d|D)|idVideo|bctid)', '%40videoPlayer', url) | ||||
|         url = re.sub(r'(?<=[?&])(videoI(d|D)|bctid)', '%40videoPlayer', url) | ||||
|         # Change bckey (used by bcove.me urls) to playerKey | ||||
|         url = re.sub(r'(?<=[?&])bckey', 'playerKey', url) | ||||
|         mobj = re.match(self._VALID_URL, url) | ||||
| @@ -551,7 +548,7 @@ class BrightcoveNewIE(InfoExtractor): | ||||
|             container = source.get('container') | ||||
|             ext = mimetype2ext(source.get('type')) | ||||
|             src = source.get('src') | ||||
|             if ext == 'ism' or container == 'WVM': | ||||
|             if ext == 'ism': | ||||
|                 continue | ||||
|             elif ext == 'm3u8' or container == 'M2TS': | ||||
|                 if not src: | ||||
| @@ -624,21 +621,15 @@ class BrightcoveNewIE(InfoExtractor): | ||||
|                     'url': text_track['src'], | ||||
|                 }) | ||||
|  | ||||
|         is_live = False | ||||
|         duration = float_or_none(json_data.get('duration'), 1000) | ||||
|         if duration and duration < 0: | ||||
|             is_live = True | ||||
|  | ||||
|         return { | ||||
|             'id': video_id, | ||||
|             'title': self._live_title(title) if is_live else title, | ||||
|             'title': title, | ||||
|             'description': clean_html(json_data.get('description')), | ||||
|             'thumbnail': json_data.get('thumbnail') or json_data.get('poster'), | ||||
|             'duration': duration, | ||||
|             'duration': float_or_none(json_data.get('duration'), 1000), | ||||
|             'timestamp': parse_iso8601(json_data.get('published_at')), | ||||
|             'uploader_id': account_id, | ||||
|             'formats': formats, | ||||
|             'subtitles': subtitles, | ||||
|             'tags': json_data.get('tags', []), | ||||
|             'is_live': is_live, | ||||
|         } | ||||
|   | ||||
| @@ -1,5 +1,6 @@ | ||||
| from __future__ import unicode_literals | ||||
|  | ||||
| import json | ||||
| import re | ||||
|  | ||||
| from .common import InfoExtractor | ||||
| @@ -7,63 +8,17 @@ from ..utils import ExtractorError | ||||
|  | ||||
|  | ||||
| class BYUtvIE(InfoExtractor): | ||||
|     _VALID_URL = r'https?://(?:www\.)?byutv\.org/watch/(?!event/)(?P<id>[0-9a-f-]+)(?:/(?P<display_id>[^/?#&]+))?' | ||||
|     _TESTS = [{ | ||||
|         'url': 'http://www.byutv.org/watch/6587b9a3-89d2-42a6-a7f7-fd2f81840a7d/studio-c-season-5-episode-5', | ||||
|         'info_dict': { | ||||
|             'id': '6587b9a3-89d2-42a6-a7f7-fd2f81840a7d', | ||||
|             'display_id': 'studio-c-season-5-episode-5', | ||||
|             'ext': 'mp4', | ||||
|             'title': 'Season 5 Episode 5', | ||||
|             'description': 'md5:e07269172baff037f8e8bf9956bc9747', | ||||
|             'thumbnail': r're:^https?://.*\.jpg$', | ||||
|             'duration': 1486.486, | ||||
|         }, | ||||
|         'params': { | ||||
|             'skip_download': True, | ||||
|         }, | ||||
|         'add_ie': ['Ooyala'], | ||||
|     }, { | ||||
|         'url': 'http://www.byutv.org/watch/6587b9a3-89d2-42a6-a7f7-fd2f81840a7d', | ||||
|         'only_matching': True, | ||||
|     }] | ||||
|  | ||||
|     def _real_extract(self, url): | ||||
|         mobj = re.match(self._VALID_URL, url) | ||||
|         video_id = mobj.group('id') | ||||
|         display_id = mobj.group('display_id') or video_id | ||||
|  | ||||
|         webpage = self._download_webpage(url, display_id) | ||||
|         episode_code = self._search_regex( | ||||
|             r'(?s)episode:(.*?\}),\s*\n', webpage, 'episode information') | ||||
|  | ||||
|         ep = self._parse_json( | ||||
|             episode_code, display_id, transform_source=lambda s: | ||||
|             re.sub(r'(\n\s+)([a-zA-Z]+):\s+\'(.*?)\'', r'\1"\2": "\3"', s)) | ||||
|  | ||||
|         if ep['providerType'] != 'Ooyala': | ||||
|             raise ExtractorError('Unsupported provider %s' % ep['provider']) | ||||
|  | ||||
|         return { | ||||
|             '_type': 'url_transparent', | ||||
|             'ie_key': 'Ooyala', | ||||
|             'url': 'ooyala:%s' % ep['providerId'], | ||||
|             'id': video_id, | ||||
|             'display_id': display_id, | ||||
|             'title': ep['title'], | ||||
|             'description': ep.get('description'), | ||||
|             'thumbnail': ep.get('imageThumbnail'), | ||||
|         } | ||||
|  | ||||
|  | ||||
| class BYUtvEventIE(InfoExtractor): | ||||
|     _VALID_URL = r'https?://(?:www\.)?byutv\.org/watch/event/(?P<id>[0-9a-f-]+)' | ||||
|     _VALID_URL = r'^https?://(?:www\.)?byutv.org/watch/[0-9a-f-]+/(?P<video_id>[^/?#]+)' | ||||
|     _TEST = { | ||||
|         'url': 'http://www.byutv.org/watch/event/29941b9b-8bf6-48d2-aebf-7a87add9e34b', | ||||
|         'url': 'http://www.byutv.org/watch/6587b9a3-89d2-42a6-a7f7-fd2f81840a7d/studio-c-season-5-episode-5', | ||||
|         'md5': '05850eb8c749e2ee05ad5a1c34668493', | ||||
|         'info_dict': { | ||||
|             'id': '29941b9b-8bf6-48d2-aebf-7a87add9e34b', | ||||
|             'id': 'studio-c-season-5-episode-5', | ||||
|             'ext': 'mp4', | ||||
|             'title': 'Toledo vs. BYU (9/30/16)', | ||||
|             'description': 'md5:e07269172baff037f8e8bf9956bc9747', | ||||
|             'title': 'Season 5 Episode 5', | ||||
|             'thumbnail': 're:^https?://.*\.jpg$', | ||||
|             'duration': 1486.486, | ||||
|         }, | ||||
|         'params': { | ||||
|             'skip_download': True, | ||||
| @@ -72,22 +27,25 @@ class BYUtvEventIE(InfoExtractor): | ||||
|     } | ||||
|  | ||||
|     def _real_extract(self, url): | ||||
|         video_id = self._match_id(url) | ||||
|         mobj = re.match(self._VALID_URL, url) | ||||
|         video_id = mobj.group('video_id') | ||||
|  | ||||
|         webpage = self._download_webpage(url, video_id) | ||||
|         episode_code = self._search_regex( | ||||
|             r'(?s)episode:(.*?\}),\s*\n', webpage, 'episode information') | ||||
|         episode_json = re.sub( | ||||
|             r'(\n\s+)([a-zA-Z]+):\s+\'(.*?)\'', r'\1"\2": "\3"', episode_code) | ||||
|         ep = json.loads(episode_json) | ||||
|  | ||||
|         ooyala_id = self._search_regex( | ||||
|             r'providerId\s*:\s*(["\'])(?P<id>(?:(?!\1).)+)\1', | ||||
|             webpage, 'ooyala id', group='id') | ||||
|  | ||||
|         title = self._search_regex( | ||||
|             r'class=["\']description["\'][^>]*>\s*<h1>([^<]+)</h1>', webpage, | ||||
|             'title').strip() | ||||
|  | ||||
|         return { | ||||
|             '_type': 'url_transparent', | ||||
|             'ie_key': 'Ooyala', | ||||
|             'url': 'ooyala:%s' % ooyala_id, | ||||
|             'id': video_id, | ||||
|             'title': title, | ||||
|         } | ||||
|         if ep['providerType'] == 'Ooyala': | ||||
|             return { | ||||
|                 '_type': 'url_transparent', | ||||
|                 'ie_key': 'Ooyala', | ||||
|                 'url': 'ooyala:%s' % ep['providerId'], | ||||
|                 'id': video_id, | ||||
|                 'title': ep['title'], | ||||
|                 'description': ep.get('description'), | ||||
|                 'thumbnail': ep.get('imageThumbnail'), | ||||
|             } | ||||
|         else: | ||||
|             raise ExtractorError('Unsupported provider %s' % ep['provider']) | ||||
|   | ||||
| @@ -26,7 +26,7 @@ class CamdemyIE(InfoExtractor): | ||||
|             'id': '5181', | ||||
|             'ext': 'mp4', | ||||
|             'title': 'Ch1-1 Introduction, Signals (02-23-2012)', | ||||
|             'thumbnail': r're:^https?://.*\.jpg$', | ||||
|             'thumbnail': 're:^https?://.*\.jpg$', | ||||
|             'creator': 'ss11spring', | ||||
|             'duration': 1591, | ||||
|             'upload_date': '20130114', | ||||
| @@ -41,7 +41,7 @@ class CamdemyIE(InfoExtractor): | ||||
|             'id': '13885', | ||||
|             'ext': 'mp4', | ||||
|             'title': 'EverCam + Camdemy QuickStart', | ||||
|             'thumbnail': r're:^https?://.*\.jpg$', | ||||
|             'thumbnail': 're:^https?://.*\.jpg$', | ||||
|             'description': 'md5:2a9f989c2b153a2342acee579c6e7db6', | ||||
|             'creator': 'evercam', | ||||
|             'duration': 318, | ||||
| @@ -112,7 +112,7 @@ class CamdemyIE(InfoExtractor): | ||||
|  | ||||
|  | ||||
| class CamdemyFolderIE(InfoExtractor): | ||||
|     _VALID_URL = r'https?://(?:www\.)?camdemy\.com/folder/(?P<id>\d+)' | ||||
|     _VALID_URL = r'https?://www.camdemy.com/folder/(?P<id>\d+)' | ||||
|     _TESTS = [{ | ||||
|         # links with trailing slash | ||||
|         'url': 'http://www.camdemy.com/folder/450', | ||||
|   | ||||
Some files were not shown because too many files have changed in this diff Show More
		Reference in New Issue
	
	Block a user