Object_Id : 21 Title : sgrep - a tool to search a file for a structured pattern Node_Type : Production Format : HTML Address : http://www.cs.helsinki.fi/~jjaakkol/sgrep.html Keywords : grep pattern tool Abstract : Sgrep (structured grep) is a tool for searching text files and filtering text streams for structured criteria. Sgrep implements a query language based on so called region expressions. Like grep, sgrep can be used for any kind of text files. However it is most useful for text files containing some kind of structured text. A file containing structured text could be defined as a file, which obeys some syntax. Examples of structured text files are SGML, HTML, C, Tex and mail files. (see below for more information.) Date_Entry_Created : 08/08/96 Entry_Created_By : Paul Foster Date_Entry_Last_Modified: 18/09/96 Entry_Last_Modified_By : jane Asset_Qualification : looks promising Other_Opinions : Contact_Person : Paul Foster (within CiTR); Jani Jaakkola ( mailto:jjaakkol@cs.helsinki.fi) IP_or_Copyright_Owner : University of Helsinki, Finland Support_re_Ownership : see: http://www.cs.helsinki.fi/~jjaakkol/sgrep.html Date_Item_Created : Usage_Conditions : Distributed under the GNU General Public Licence. Support_re_Usage_Condns : see: http://www.cs.helsinki.fi/~jjaakkol/sgrep.html (should also find the GNU licence conditions!) Authors_and_Contributors: Jani Jaakkola ( mailto:jjaakkol@cs.helsinki.fi) Pekka KilpelÌinen ( mailto:kilpelai@cs.helsinki.fi) Projects_Used_In : none Obtained_From : ftp://ftp.cs.helsinki.fi/pub/Software/Local/Sgrep/ Derived_From : Related_Items : Code_Type : simple tool Programming_Language : C Portations : SunOS 5.4 sparc Linux 1.3.85 alpha Linux 1.2.13 intel, a.out binaries Linux 1.2.13 intel, elf binaries HP-UX 9000/735 OSF1 alpha It has been reported to run also on SGI/Irix 5.2 Source_Location : /home/foster/tools/sgrep-0.99.tar.g Binaries_Location : Issues_Bug_etc_List_Locn: Additional_Info : From the announcement (where from?): If you have ever wondered how to: o Locate only TITLE and H1 .. H9 elements from HTML documents o Remove all tags from an HTML document o Rename all B elements to STRONG elements o Find out how many FIG elements there are under SUBPARA elements but not under PARA elements in your SGML file o Print out the TITLE elements from a set of HTML documents in which word 'SGML' is mentioned more than 12 times, or which contain word SGML inside H1 or H2 @ ÉÜelements. o Find out mail senders of mail messages from a set of mail files, which contain word 'SGML' in the subject line, do not contain 'HTML' in the body of the mail, are sent in year 1996 and are not sent from address flame@hot.com then sgrep is a tool for you. Sgrep is distributed under the GNU General Public License. We have put up some WWW-pages on sgrep at http://www.cs.helsinki.fi/~jjaakkol/sgrep.html