Welcome to Linux Knowledge Base and Tutorial
"The place where you learn linux"
Kitty Hooch

 Create an AccountHome | Submit News | Your Account  

Tutorial Menu
Linux Tutorial Home
Table of Contents

· Introduction to Operating Systems
· Linux Basics
· Working with the System
· Shells and Utilities
· Editing Files
· Basic Administration
· The Operating System
· The X Windowing System
· The Computer Itself
· Networking
· System Monitoring
· Solving Problems
· Security
· Installing and Upgrading
· Linux and Windows

Glossary
MoreInfo
Man Pages
Linux Topics
Test Your Knowledge

Site Menu
Site Map
FAQ
Copyright Info
Terms of Use
Privacy Info
Disclaimer
WorkBoard
Thanks
Donations
Advertising
Masthead / Impressum
Your Account

Communication
Feedback
Forums
Private Messages
Surveys

Features
HOWTOs
News Archive
Submit News
Topics
User Articles
Web Links

Google
Google


The Web
linux-tutorial.info

Who's Online
There are currently, 217 guest(s) and 0 member(s) that are online.

You are an Anonymous user. You can register for free by clicking here

  

html2text



SYNOPSIS

       html2text -help
       html2text -version
       html2text  [  -unparse  |  -check  ]  [ -debug-scanner ] [
       -debug-parser ] [ -rcfile path ]  [  -style  (  compact  |
       pretty ) ] [ -width width ] [ -o output-file ] [ -nobs ] [
       input-uri ...  ]


DESCRIPTION

       html2text reads HTML 3.2 documents  from  the  input-uris,
       formats  each  into a stream of plain text characters (ISO
       8859-1) and writes the result to standard output (or  into
       output-file, if the -o command line option is used).

       Documents  that  are  specified by an URI that begins with
       "http:" (RFC 1738) are retrieved with the Hypertext Trans­
       fer  Protocol (RFC 1945). URIs that begin with "file:" and
       URIs that do not contain a colon specify local files.  All
       other URIs are invalid.

       If  no  input-uris  are  specified  on  the  command line,
       html2text reads from standard input. A dash as the  input-
       uri is an alternate way to specify standard input.

       html2text  understands  all  HTML  3.2 constructs, but can
       render only part of them due to  the  limitations  of  the
       text  output format. However, the program attempts to pro­
       vide good substitutes for the elements it  cannot  render.
       It also accepts syntactically incorrect input and attempts
       to interpret it "reasonably".

       The way in that html2text formats the  HTML  documents  is
       controlled  by formatting properties read from an RC file.
       html2text attempts to read $HOME/.html2textrc (or the file
       specified  by  the  -rcfile  command line option); if that
       file  cannot  be  read,   html2text   attempts   to   read
       /etc/html2textrc.  If no RC file can be read (or if the RC
       file does not override all  formatting  properties),  then
       "reasonable"  defaults  are assumed. The RC file format is
       described in the html2textrc(5) manual page.


OPTIONS

       -help  Print command line summary and exit.

       -version
              Print program version and exit.

       -unparse
              This option is for diagnostic purposes: Instead  of
              formatting the parsed document, generate HTML code,
              that is guaranteed to be syntactically correct.  If
              html2text  has  problems  parsing  a  syntactically
              While scanning the HTML document, html2text reports
              on each lexical token scanned. This option  is  for
              diagnostic purposes.

       -debug-parser
              While scanning the HTML document, html2text reports
              on the tokens being shifted, rules  being  applied,
              etc. This option is for diagnostic purposes.

       -rcfile path
              Attempt  to  read  the file specified in path as RC
              file.

       -style ( compact | pretty )
              Style pretty changes some of the default values  of
              the  formatting  parameters documented in html2tex­
              trc(5).  To find out which and how  the  formatting
              parameter  defaults  are  changed,  check  the file
              "pretty.style". If this option  is  omitted,  style
              compact is assumed as default.

       -width width
              By  default,  html2text  formats the HTML documents
              for a screen width of 79 characters. If redirecting
              the  output  into a file, or if your terminal has a
              width other than 80 characters, or if you just want
              to  get  an  idea  how  html2text  deals with large
              tables and different terminal widths, you may  want
              to specify a different width.

       -o output-file
              Write the output to output-file instead of standard
              output. A dash as the output-file is  an  alternate
              way to specify the standard output.

       -nobs  By  default,  html2text  renders underlined letters
              with sequences  like  "underscore-backspace-charac­
              ter"   and   boldface   letters   like  "character-
              backspace-character", which  works  fine  when  the
              output  is piped into more(1), less(1), or similar.
              For other applications,  or  when  redirecting  the
              output into a file, it may be desirable not to ren­
              der  character  attributes  with   such   backspace
              sequences, which can be specified with this command
              line option.


FILES

       /etc/html2textrc
              System wide parser configuration file.

       $HOME/.html2textrc
              Personal parser configuration file,  overrides  the


RESTRICTIONS

       html2text  provides  only  a  basic  implementation of the
       Hypertext Transfer Protocol (HTTP). It requires  the  com­
       plete and exactly matching URI to be given as argument and
       will not follow redirections (HTTP 301/ 307).


AUTHOR

       html2text was written up to version 1.2.2 by  Arno  Unkrig
       <arno@unkrig.de> for GMRS Software GmbH, Unterschleißheim.

       Current maintainer and primary download location is:
       Martin Bayer <mbayer@zedat.fu-berlin.de>
       http://userpage.fu-berlin.de/~mbayer/tools/html2text.html


SEE ALSO

       html2textrc(5), less(1), more(1)

                            2001-10-05               html2text(1)
  
Show your Support for the Linux Tutorial

Purchase one of the products from our new online shop. For each product you purchase, the Linux Tutorial gets a portion of the proceeds to help keep us going.


Login
Nickname

Password

Security Code
Security Code
Type Security Code


Don't have an account yet? You can create one. As a registered user you have some advantages like theme manager, comments configuration and post comments with your name.

Help if you can!


Amazon Wish List

Did You Know?
The Linux Tutorial welcomes your suggestions and ideas.


Friends



Tell a Friend About Us

Bookmark and Share



Web site powered by PHP-Nuke

Is this information useful? At the very least you can help by spreading the word to your favorite newsgroups, mailing lists and forums.
All logos and trademarks in this site are property of their respective owner. The comments are property of their posters. Articles are the property of their respective owners. Unless otherwise stated in the body of the article, article content (C) 1994-2013 by James Mohr. All rights reserved. The stylized page/paper, as well as the terms "The Linux Tutorial", "The Linux Server Tutorial", "The Linux Knowledge Base and Tutorial" and "The place where you learn Linux" are service marks of James Mohr. All rights reserved.
The Linux Knowledge Base and Tutorial may contain links to sites on the Internet, which are owned and operated by third parties. The Linux Tutorial is not responsible for the content of any such third-party site. By viewing/utilizing this web site, you have agreed to our disclaimer, terms of use and privacy policy. Use of automated download software ("harvesters") such as wget, httrack, etc. causes the site to quickly exceed its bandwidth limitation and are therefore expressly prohibited. For more details on this, take a look here

PHP-Nuke Copyright © 2004 by Francisco Burzi. This is free software, and you may redistribute it under the GPL. PHP-Nuke comes with absolutely no warranty, for details, see the license.
Page Generation: 0.04 Seconds