CiTR Repository

sgrep - a tool to search a file for a structured pattern


 
Object_Id               : 21
Title                   : sgrep - a tool to search a file for a structured pattern
Node_Type               : Production
Format                  : HTML
Address                 :  http://www.cs.helsinki.fi/~jjaakkol/sgrep.html
Keywords                : grep
                          pattern
                          tool
Abstract                : Sgrep (structured grep) is a tool for searching text files and
                          filtering text streams for structured criteria. Sgrep implements
                          a query language based on so called region expressions.
                          
                          Like grep, sgrep can be used for any kind of text 
                          files. However it
                          is most useful for text files containing some kind of structured text.
                          A file containing structured text could be defined as a file, which
                          obeys some syntax. Examples of structured text files are 
                          SGML, HTML, C, Tex 
                          and mail files.
                          
                          (see below for more information.)
                          
                          
                          
                          
Date_Entry_Created      : 08/08/96
Entry_Created_By        : Paul Foster
Date_Entry_Last_Modified: 18/09/96
Entry_Last_Modified_By  : jane
Asset_Qualification     : looks promising
Other_Opinions          : 
Contact_Person          : Paul Foster (within CiTR); Jani Jaakkola ( mailto:jjaakkol@cs.helsinki.fi)
IP_or_Copyright_Owner   : University of Helsinki, Finland
Support_re_Ownership    : see:  http://www.cs.helsinki.fi/~jjaakkol/sgrep.html
Date_Item_Created       : 
Usage_Conditions        : Distributed under the GNU General Public Licence.
Support_re_Usage_Condns : see:  http://www.cs.helsinki.fi/~jjaakkol/sgrep.html

                          
                          (should also find the GNU licence conditions!)
Authors_and_Contributors: Jani Jaakkola ( mailto:jjaakkol@cs.helsinki.fi)

                          Pekka Kilpel╠inen ( mailto:kilpelai@cs.helsinki.fi)

Projects_Used_In        : none
Obtained_From           :  ftp://ftp.cs.helsinki.fi/pub/Software/Local/Sgrep/
Derived_From            : 
Related_Items           : 
Code_Type               : simple tool
Programming_Language    : C
Portations              : SunOS 5.4 sparc
                          Linux 1.3.85 alpha
                          Linux 1.2.13 intel, a.out binaries
                          Linux 1.2.13 intel, elf binaries
                          HP-UX 9000/735
                          OSF1 alpha
                          
                          It has been reported to run also on
                                  SGI/Irix 5.2
Source_Location         : /home/foster/tools/sgrep-0.99.tar.g
Binaries_Location       : 
Issues_Bug_etc_List_Locn: 
Additional_Info         : From the announcement (where from?):
                          
                          If you have ever wondered how to:
                          
                           o Locate only TITLE and H1 .. H9 elements from HTML documents
                           o Remove all  tags from an HTML document
                           o Rename all B elements to STRONG elements
                           o Find 
                          out how many FIG elements there are under SUBPARA
                             elements but not under PARA elements in your SGML file
                           o Print out the TITLE elements from a set of HTML documents
                             in which word 'SGML' is mentioned more than 12 times, or
                             
                          which contain word SGML inside H1 or H2 @ ╔▄elements.
                           o Find out mail senders of mail messages from a set of mail
                             files, which contain word 'SGML' in the subject line, do
                             not contain 'HTML' in the body of the mail, are sent in 
                          year
                             1996 and are not sent from address flame@hot.com
                          
                          then sgrep is a tool for you.
                          
                          Sgrep is distributed under the GNU General Public License.
                          
                          We have put up some WWW-pages on sgrep at
                          
                           
                           http://www.cs.helsinki.fi/~jjaakkol/sgrep.html