Welcome, guest | Sign In | My Account | Store | Cart

Youtube Playlist Parser/Extractor (Python recipe) by p@ntut$
ActiveState Code (http://code.activestate.com/recipes/578284/)

Parse/extract URLs from playlist in youtube (User's videos not favorites) Link: http://pantuts.com/2013/02/16/youparse-extract-urls-from-youtube/

      # Name: youParse.py
# Version: 1.3
# Author: pantuts
# Description: Parse URLs in Youtube User's Playlist (Video Playlist not Favorites)
# Use python3 and later
# Agreement: You can use, modify, or redistribute this tool under
# the terms of GNU General Public License (GPLv3).
# This tool is for educational purposes only. Any damage you make will not affect the author.
# Usage: python3 youParse.py youtubeURLhere

import re
import urllib.request
import urllib.error
import sys
import time

def crawl(url):
    sTUBE = ''
    cPL = ''
    amp = 0
    final_url = []
    
    if 'list=' in url:
        eq = url.index('=') + 1
        cPL = url[eq:]
        if '&' in url:
            amp = url.index('&')
            cPL = url[eq:amp]
            
    else:
        print('Incorrect Playlist.')
        exit(1)
    
    try:
        yTUBE = urllib.request.urlopen(url).read()
        sTUBE = str(yTUBE)
    except urllib.error.URLError as e:
        print(e.reason)
    
    tmp_mat = re.compile(r'watch\?v=\S+?list=' + cPL)
    mat = re.findall(tmp_mat, sTUBE)

    if mat:
        
        if mat[0] == mat[1]:
            mat.remove(mat[0]) #if there is duplicate, remove
            
        for PL in mat:
            yPL = str(PL)
            if '&' in yPL:
                yPL_amp = yPL.index('&')
            final_url.append('http://www.youtube.com/' + yPL[:yPL_amp])
                
        i = 0
        while i < len(mat):
            sys.stdout.write(final_url[i] + '\n')
            time.sleep(0.04)
            i = i + 1
        
    else:
        print('No videos found.')
        exit(1)
        
if len(sys.argv) < 2 or len(sys.argv) > 2:
    print('USAGE: python3 youParse.py YOUTUBEurl')
    exit(1)
    
else:
    url = sys.argv[1]
    if 'http' not in url:
        url = 'http://' + url
    crawl(url)

      

Tags: extractor, playlist, youtube

Created by p@ntut$ on Fri, 12 Oct 2012 (GPL3)

◄	Python recipes (4591)	►
◄	p@ntut$'s recipes (7)	►

Required Modules

(none specified)

Other Information and Tasks

Licensed under the GPL 3
Viewed 10990 times
Revision 18 (updated 9 years ago)

Accounts

Code Recipes

Feedback & Information

ActiveState

© 2024 ActiveState Software Inc. All rights reserved. ActiveState®, Komodo®, ActiveState Perl Dev Kit®, ActiveState Tcl Dev Kit®, ActivePerl®, ActivePython®, and ActiveTcl® are registered trademarks of ActiveState. All other marks are property of their respective owners.

Youtube Playlist Parser/Extractor (Python recipe) by p@ntut$ ActiveState Code (http://code.activestate.com/recipes/578284/)