Popular recipes tagged "html" but not "crawler"http://code.activestate.com/recipes/tags/html-crawler/2015-03-07T20:22:54-08:00ActiveState Code RecipesConvert HTML to PDF with the PDFcrowd API (Python)
2015-03-07T20:22:54-08:00Vasudev Ramhttp://code.activestate.com/recipes/users/4173351/http://code.activestate.com/recipes/579032-convert-html-to-pdf-with-the-pdfcrowd-api/
<p style="color: grey">
Python
recipe 579032
by <a href="/recipes/users/4173351/">Vasudev Ram</a>
(<a href="/recipes/tags/api/">api</a>, <a href="/recipes/tags/html/">html</a>, <a href="/recipes/tags/pdf/">pdf</a>, <a href="/recipes/tags/pdfcrowd/">pdfcrowd</a>).
</p>
<p>This recipe shows how to use Python and the PDFcrowd API to convert HTML content to PDF. The HTML input can come from a remote URL, a local HTML file, or a string containing HTML.</p>
Composing a POSTable HTTP request with multipart/form-data Content-Type to simulate a form/file upload. (Python)
2014-03-08T17:34:38-08:00István Pásztorhttp://code.activestate.com/recipes/users/4189380/http://code.activestate.com/recipes/578846-composing-a-postable-http-request-with-multipartfo/
<p style="color: grey">
Python
recipe 578846
by <a href="/recipes/users/4189380/">István Pásztor</a>
(<a href="/recipes/tags/field/">field</a>, <a href="/recipes/tags/file/">file</a>, <a href="/recipes/tags/form/">form</a>, <a href="/recipes/tags/html/">html</a>, <a href="/recipes/tags/httpclient/">httpclient</a>, <a href="/recipes/tags/mime/">mime</a>, <a href="/recipes/tags/multipart/">multipart</a>, <a href="/recipes/tags/post/">post</a>, <a href="/recipes/tags/upload/">upload</a>, <a href="/recipes/tags/web/">web</a>).
Revision 5.
</p>
<p>This code is useful if you are using a http client and you want to simulate a request similar to that of a browser that submits a form containing several input fields (including file upload fields). I've used this with python 2.x.</p>
Pretty and Stated HTMLParsers (Python)
2013-12-14T00:28:36-08:00Ádám Szieberthhttp://code.activestate.com/recipes/users/4188745/http://code.activestate.com/recipes/578787-pretty-and-stated-htmlparsers/
<p style="color: grey">
Python
recipe 578787
by <a href="/recipes/users/4188745/">Ádám Szieberth</a>
(<a href="/recipes/tags/html/">html</a>, <a href="/recipes/tags/htmlparser/">htmlparser</a>, <a href="/recipes/tags/state/">state</a>).
Revision 2.
</p>
<p>Extensions of html.parser.HTMLParser().</p>
<p>PrettyHTMLParser() does not splits data into chuncks by HTML entities.
StatedHTMLParser() can have many state-dependent handlers which helps parsing HTML pages alot.</p>
Python HTML Stripper (Python)
2013-04-08T13:58:00-07:00Granning Stolinehttp://code.activestate.com/recipes/users/4186069/http://code.activestate.com/recipes/578511-python-html-stripper/
<p style="color: grey">
Python
recipe 578511
by <a href="/recipes/users/4186069/">Granning Stoline</a>
(<a href="/recipes/tags/html/">html</a>, <a href="/recipes/tags/python/">python</a>, <a href="/recipes/tags/stripper/">stripper</a>).
</p>
<p>Python HTML Stripper</p>
Easy to use, easy to read, python based HTML generation (Python)
2013-06-21T14:47:21-07:00Pavloshttp://code.activestate.com/recipes/users/4185038/http://code.activestate.com/recipes/578436-easy-to-use-easy-to-read-python-based-html-generat/
<p style="color: grey">
Python
recipe 578436
by <a href="/recipes/users/4185038/">Pavlos</a>
(<a href="/recipes/tags/html/">html</a>, <a href="/recipes/tags/template/">template</a>, <a href="/recipes/tags/text/">text</a>).
Revision 4.
</p>
<p>I was looking for a simple way to generate HTML directly in python that does not require learning a new template 'language' nor requires the installation of a big complex package. Closest thing I found was James Casbon's attempt(https://gist.github.com/1461441). This is my version of the same idea. </p>
<p>(2013-04-21) added some simplifications and support for switching off string interpolation. Added to github:</p>
<p><a href="https://github.com/pavlos-christoforou/web" rel="nofollow">https://github.com/pavlos-christoforou/web</a></p>
Komodo JS Macro -- Escape HTML Special Characters in current file (JavaScript)
2012-05-25T20:04:49-07:00Keegan Brownhttp://code.activestate.com/recipes/users/4182206/http://code.activestate.com/recipes/578146-komodo-js-macro-escape-html-special-characters-in-/
<p style="color: grey">
JavaScript
recipe 578146
by <a href="/recipes/users/4182206/">Keegan Brown</a>
(<a href="/recipes/tags/characters/">characters</a>, <a href="/recipes/tags/html/">html</a>, <a href="/recipes/tags/macro/">macro</a>, <a href="/recipes/tags/special/">special</a>, <a href="/recipes/tags/xhtml/">xhtml</a>).
</p>
<p>Escapes all Special Characters to their HTML Special Character equivalent.</p>
Safe HTML string and unicode (Python)
2012-01-10T08:14:14-08:00Garel Alexhttp://code.activestate.com/recipes/users/2757636/http://code.activestate.com/recipes/578008-safe-html-string-and-unicode/
<p style="color: grey">
Python
recipe 578008
by <a href="/recipes/users/2757636/">Garel Alex</a>
(<a href="/recipes/tags/html/">html</a>, <a href="/recipes/tags/security/">security</a>, <a href="/recipes/tags/web/">web</a>).
Revision 2.
</p>
<p>As you display message on a web page, you have to sanitize input data coming from users to avoid <a href="https://en.wikipedia.org/wiki/Cross-site_scripting">XSS</a>. Here is a small recipe where we can use a special class for our string to be sure we get safe all the way long.</p>
Show all the telecommuting jobs from the Python Job Board (Python)
2011-12-09T07:38:28-08:00Victor Yanghttp://code.activestate.com/recipes/users/627255/http://code.activestate.com/recipes/577979-show-all-the-telecommuting-jobs-from-the-python-jo/
<p style="color: grey">
Python
recipe 577979
by <a href="/recipes/users/627255/">Victor Yang</a>
(<a href="/recipes/tags/html/">html</a>, <a href="/recipes/tags/network/">network</a>, <a href="/recipes/tags/screenscrape/">screenscrape</a>).
</p>
<p>It is running as a cronjob on a VPS(Virutal Private Server). The output html can be served by any web server. </p>
ActiveState recipe statistics (Python)
2011-06-02T14:52:50-07:00Kaan Ozturkhttp://code.activestate.com/recipes/users/4178179/http://code.activestate.com/recipes/577732-activestate-recipe-statistics/
<p style="color: grey">
Python
recipe 577732
by <a href="/recipes/users/4178179/">Kaan Ozturk</a>
(<a href="/recipes/tags/html/">html</a>, <a href="/recipes/tags/regular_expressions/">regular_expressions</a>, <a href="/recipes/tags/statistics/">statistics</a>, <a href="/recipes/tags/urllib2/">urllib2</a>, <a href="/recipes/tags/web/">web</a>).
Revision 2.
</p>
<p>Downloads "All Recipe Authors" pages in ActiveState, uses regular expressions to parse author name and number of their recipes on each page. Finally, it displays the recipe submission distribution (the count of how many authors have submitted how many recipes each).</p>
IFrame Example (HTML)
2011-04-21T02:11:57-07:00FB36http://code.activestate.com/recipes/users/4172570/http://code.activestate.com/recipes/577671-iframe-example/
<p style="color: grey">
HTML
recipe 577671
by <a href="/recipes/users/4172570/">FB36</a>
(<a href="/recipes/tags/html/">html</a>, <a href="/recipes/tags/iframe/">iframe</a>).
</p>
<p>IFrame demo. It opens 4 sub-browser windows inside the webpage.</p>
webcheck: site to csv (Python)
2011-03-09T06:37:08-08:00Jervis Whitleyhttp://code.activestate.com/recipes/users/4169341/http://code.activestate.com/recipes/577602-webcheck-site-to-csv/
<p style="color: grey">
Python
recipe 577602
by <a href="/recipes/users/4169341/">Jervis Whitley</a>
(<a href="/recipes/tags/html/">html</a>, <a href="/recipes/tags/linkcheck/">linkcheck</a>, <a href="/recipes/tags/sitemap/">sitemap</a>, <a href="/recipes/tags/webcheck/">webcheck</a>).
Revision 3.
</p>
<p>An extension to Arthur de Jong's excellent webcheck tool (a website link checker) (<a href="http://arthurdejong.org/webcheck" rel="nofollow">http://arthurdejong.org/webcheck</a>) that will read in the resultant webcheck.dat file and create a csv formatted file.</p>
Random URL (Python)
2010-09-12T22:23:09-07:00FB36http://code.activestate.com/recipes/users/4172570/http://code.activestate.com/recipes/577389-random-url/
<p style="color: grey">
Python
recipe 577389
by <a href="/recipes/users/4172570/">FB36</a>
(<a href="/recipes/tags/html/">html</a>, <a href="/recipes/tags/http/">http</a>, <a href="/recipes/tags/url/">url</a>, <a href="/recipes/tags/urllib2/">urllib2</a>, <a href="/recipes/tags/web/">web</a>).
</p>
<p>Finds and displays a random webpage from the Internet.
(Warning: It may take a while!)</p>
Website Text Search (Python)
2010-09-11T17:32:01-07:00FB36http://code.activestate.com/recipes/users/4172570/http://code.activestate.com/recipes/577388-website-text-search/
<p style="color: grey">
Python
recipe 577388
by <a href="/recipes/users/4172570/">FB36</a>
(<a href="/recipes/tags/html/">html</a>, <a href="/recipes/tags/http/">http</a>, <a href="/recipes/tags/url/">url</a>, <a href="/recipes/tags/urllib2/">urllib2</a>, <a href="/recipes/tags/web/">web</a>).
Revision 2.
</p>
<p>Searches a website recursively for the given text string and prints all URLs containing it.</p>
Image Downloader (Python)
2014-02-24T03:49:51-08:00FB36http://code.activestate.com/recipes/users/4172570/http://code.activestate.com/recipes/577385-image-downloader/
<p style="color: grey">
Python
recipe 577385
by <a href="/recipes/users/4172570/">FB36</a>
(<a href="/recipes/tags/html/">html</a>, <a href="/recipes/tags/http/">http</a>, <a href="/recipes/tags/url/">url</a>, <a href="/recipes/tags/urllib2/">urllib2</a>, <a href="/recipes/tags/web/">web</a>).
Revision 4.
</p>
<p>Finds and downloads all images from any given URL.</p>
<p>Important note:</p>
<p>If your download location path has spaces then put quotes around it!</p>
Website Mapper (Python)
2010-09-23T01:23:04-07:00FB36http://code.activestate.com/recipes/users/4172570/http://code.activestate.com/recipes/577392-website-mapper/
<p style="color: grey">
Python
recipe 577392
by <a href="/recipes/users/4172570/">FB36</a>
(<a href="/recipes/tags/html/">html</a>, <a href="/recipes/tags/http/">http</a>, <a href="/recipes/tags/url/">url</a>, <a href="/recipes/tags/web/">web</a>).
Revision 3.
</p>
<p>Prints the tree graph of the given URL. </p>
Komodo Macro - Copy to clipboard code selection in HTML format (JavaScript)
2010-04-14T07:17:59-07:00Davide Ficanohttp://code.activestate.com/recipes/users/4166571/http://code.activestate.com/recipes/577193-komodo-macro-copy-to-clipboard-code-selection-in-h/
<p style="color: grey">
JavaScript
recipe 577193
by <a href="/recipes/users/4166571/">Davide Ficano</a>
(<a href="/recipes/tags/clipboard/">clipboard</a>, <a href="/recipes/tags/html/">html</a>, <a href="/recipes/tags/komodo/">komodo</a>, <a href="/recipes/tags/macro/">macro</a>).
</p>
<p>This macro makes easy to paste on documents (MS Office, Open Office, iWorks Pages) code snippets with syntax highlight and/or line numbers.</p>
<p>This macro copies the selected code text or whole document if no selection is present on clipboard in HTML format.</p>
<p>On OSX you need at least Komodo 6 because previous versions don't support data flavors</p>
Userfriendly Webpage Template (Python)
2010-05-04T07:38:03-07:00david.gaarenstroomhttp://code.activestate.com/recipes/users/4168848/http://code.activestate.com/recipes/577203-userfriendly-webpage-template/
<p style="color: grey">
Python
recipe 577203
by <a href="/recipes/users/4168848/">david.gaarenstroom</a>
(<a href="/recipes/tags/cgi/">cgi</a>, <a href="/recipes/tags/html/">html</a>, <a href="/recipes/tags/httpserver/">httpserver</a>, <a href="/recipes/tags/mvc/">mvc</a>, <a href="/recipes/tags/template/">template</a>, <a href="/recipes/tags/web/">web</a>, <a href="/recipes/tags/webdesign/">webdesign</a>, <a href="/recipes/tags/webpagetemplate/">webpagetemplate</a>).
Revision 5.
</p>
<p>User friendly template class targeted towards Web-page usage and optimized for speed and efficiency.</p>
<p>Tags can be inserted in a template HTML file in a non-intrusive way, by using specially formatted comment strings. Therefore, the template-file can be viewed in a browser, even with prototype data embedded in it, which will later be replaced by dynamic content.
Also, webdesigners can continue to work on the template and upload it without further
modification.</p>
Convert text/enriched MIME to text/html (Python)
2009-06-09T15:08:40-07:00Jack Trainorhttp://code.activestate.com/recipes/users/4076953/http://code.activestate.com/recipes/576800-convert-textenriched-mime-to-texthtml/
<p style="color: grey">
Python
recipe 576800
by <a href="/recipes/users/4076953/">Jack Trainor</a>
(<a href="/recipes/tags/email/">email</a>, <a href="/recipes/tags/html/">html</a>, <a href="/recipes/tags/mime/">mime</a>, <a href="/recipes/tags/text_enriched/">text_enriched</a>).
</p>
<p>Converts text stream in text/enriched MIME format from file or stdin to text/html output to file or stdout.</p>
CommentEditor: HTML editor for online comments (Python)
2009-06-17T12:10:55-07:00Jack Trainorhttp://code.activestate.com/recipes/users/4076953/http://code.activestate.com/recipes/576814-commenteditor-html-editor-for-online-comments/
<p style="color: grey">
Python
recipe 576814
by <a href="/recipes/users/4076953/">Jack Trainor</a>
(<a href="/recipes/tags/editor/">editor</a>, <a href="/recipes/tags/html/">html</a>, <a href="/recipes/tags/wxwidgets/">wxwidgets</a>).
Revision 3.
</p>
<p>Edit online comments with easy addition of HTML tags for bold, italics, underlining, blockquote and anchored links. Then check your work with the preview feature.</p>
<p>Requires wxWidgets.</p>
Serve static web content from within a gzipped tarball to save space using CherryPy (Python)
2009-03-31T18:24:06-07:00Dan McDougallhttp://code.activestate.com/recipes/users/4169722/http://code.activestate.com/recipes/576706-serve-static-web-content-from-within-a-gzipped-tar/
<p style="color: grey">
Python
recipe 576706
by <a href="/recipes/users/4169722/">Dan McDougall</a>
(<a href="/recipes/tags/cherrypy/">cherrypy</a>, <a href="/recipes/tags/compression/">compression</a>, <a href="/recipes/tags/embedded/">embedded</a>, <a href="/recipes/tags/gzip/">gzip</a>, <a href="/recipes/tags/html/">html</a>, <a href="/recipes/tags/http/">http</a>, <a href="/recipes/tags/network/">network</a>, <a href="/recipes/tags/routes/">routes</a>, <a href="/recipes/tags/web/">web</a>, <a href="/recipes/tags/web_server/">web_server</a>).
</p>
<p>This code lets you store all of your static website content inside a gzipped tarball while transparently serving it to HTTP clients on-demand. Perfect for embedded systems where space is limited.</p>