Welcome, guest | Sign In | My Account | Store | Cart

Python script to find linux distros details from distrowatch (Python recipe) by Emil george james
ActiveState Code (http://code.activestate.com/recipes/579038/)

this script is a simlpe python script to find linux distros details from distrowatch using beautifulsoup,urllib2 modules.The script finds distros distribution details from distrowatch.com when the distribution name is called as argument.

      from bs4 import BeautifulSoup
from mechanize import Browser
import urllib2 
import sys,re


if len(sys.argv) == 0:
    print "\nSyntax: python %s 'distribution title'" % (sys.argv[0])
    exit()
else :
     distribution = '+'.join(sys.argv[1].split())

try:
  br = Browser()
  br.open("http://distrowatch.com/table.php?distribution="+distribution)
  br.response().read()
  print br.title()
  url = br.geturl()

  content = urllib2.urlopen(url).read()
except urllib2.URLError :
       print "Unable to connect to internet !! OR  not connected to internet !!"
else :
     soup=BeautifulSoup(content)

try :
   title = soup.find("h1").contents[0].strip()
   print "DISTRIBUTION:",title
   ul = soup.findAll("ul")
   li = soup.ul.findAll("li")
   
   for i in li:
       print("{} {}.".format(i.b.text,"".join([a.text for a in i.findAll("a")])))
except:
    print("Link not found Distribution name ERROR")
   
    


    
  
  
  

      

for output and more information visit https://emilgeorgejames.wordpress.com/2015/03/25/python-script-to-find-linux-distros-details-from-distrowatch-3/

Tags: beautifulsoup, internet, module, python, url, web

1 comment

Vatay Világi Norbert 8 years, 3 months ago # | flag

No parser was explicitly specified, so I'm using the best available HTML parser for this system ("lxml"). This usually isn't a problem, but if you run this code on another system, or in a different virtual environment, it may use a different parser and behave differently.

To get rid of this warning, change this (line 24):

 soup=BeautifulSoup(content)

to this:

 soup=BeautifulSoup(content, "lxml")

Created by Emil george james on Thu, 26 Mar 2015 (MIT)

◄	Python recipes (4591)	►
◄	Emil george james's recipes (5)	►

Required Modules

Other Information and Tasks

Licensed under the MIT License
Viewed 9042 times
Revision 1

Accounts

Code Recipes

Feedback & Information

ActiveState

© 2024 ActiveState Software Inc. All rights reserved. ActiveState®, Komodo®, ActiveState Perl Dev Kit®, ActiveState Tcl Dev Kit®, ActivePerl®, ActivePython®, and ActiveTcl® are registered trademarks of ActiveState. All other marks are property of their respective owners.

Python script to find linux distros details from distrowatch (Python recipe) by Emil george james ActiveState Code (http://code.activestate.com/recipes/579038/)