[//000000001]: # (fileutil \- file utilities) [//000000002]: # (Generated from file 'fileutil\.man' by tcllib/doctools with format 'markdown') [//000000003]: # (fileutil\(n\) 1\.16\.1 tcllib "file utilities")
[ Main Table Of Contents | Table Of Contents | Keyword Index | Categories | Modules | Applications ]
# NAME fileutil \- Procedures implementing some file utilities # Table Of Contents - [Table Of Contents](#toc) - [Synopsis](#synopsis) - [Description](#section1) - [Warnings and Incompatibilities](#section2) - [Bugs, Ideas, Feedback](#section3) - [Keywords](#keywords) - [Category](#category) # SYNOPSIS package require Tcl 8 package require fileutil ?1\.16\.1? [__::fileutil::lexnormalize__ *path*](#1) [__::fileutil::fullnormalize__ *path*](#2) [__::fileutil::test__ *path* *codes* ?*msgvar*? ?*label*?](#3) [__::fileutil::cat__ \(?*options*? *file*\)\.\.\.](#4) [__::fileutil::writeFile__ ?*options*? *file* *data*](#5) [__::fileutil::appendToFile__ ?*options*? *file* *data*](#6) [__::fileutil::insertIntoFile__ ?*options*? *file* *at* *data*](#7) [__::fileutil::removeFromFile__ ?*options*? *file* *at* *n*](#8) [__::fileutil::replaceInFile__ ?*options*? *file* *at* *n* *data*](#9) [__::fileutil::updateInPlace__ ?*options*? *file* *cmd*](#10) [__::fileutil::fileType__ *filename*](#11) [__::fileutil::find__ ?*basedir* ?*filtercmd*??](#12) [__::fileutil::findByPattern__ *basedir* ?__\-regexp__|__\-glob__? ?__\-\-__? *patterns*](#13) [__::fileutil::foreachLine__ *var filename cmd*](#14) [__::fileutil::grep__ *pattern* ?*files*?](#15) [__::fileutil::install__ ?__\-m__ *mode*? *source* *destination*](#16) [__::fileutil::stripN__ *path* *n*](#17) [__::fileutil::stripPwd__ *path*](#18) [__::fileutil::stripPath__ *prefix* *path*](#19) [__::fileutil::jail__ *jail* *path*](#20) [__::fileutil::touch__ ?__\-a__? ?__\-c__? ?__\-m__? ?__\-r__ *ref\_file*? ?__\-t__ *time*? *filename* ?*\.\.\.*?](#21) [__::fileutil::tempdir__](#22) [__::fileutil::tempdir__ *path*](#23) [__::fileutil::tempdirReset__](#24) [__::fileutil::tempfile__ ?*prefix*?](#25) [__::fileutil::maketempdir__ ?__\-prefix__ *str*? ?__\-suffix__ *str*? ?__\-dir__ *str*?](#26) [__::fileutil::relative__ *base* *dst*](#27) [__::fileutil::relativeUrl__ *base* *dst*](#28) # DESCRIPTION This package provides implementations of standard unix utilities\. - __::fileutil::lexnormalize__ *path* This command performs purely lexical normalization on the *path* and returns the changed path as its result\. Symbolic links in the path are *not* resolved\. Examples: fileutil::lexnormalize /foo/./bar => /foo/bar fileutil::lexnormalize /foo/../bar => /bar - __::fileutil::fullnormalize__ *path* This command resolves all symbolic links in the *path* and returns the changed path as its result\. In contrast to the builtin __file normalize__ this command resolves a symbolic link in the last element of the path as well\. - __::fileutil::test__ *path* *codes* ?*msgvar*? ?*label*? A command for the testing of several properties of a *path*\. The properties to test for are specified in *codes*, either as a list of keywords describing the properties, or as a string where each letter is a shorthand for a property to test\. The recognized keywords, shorthands, and associated properties are shown in the list below\. The tests are executed in the order given to the command\. The result of the command is a boolean value\. It will be true if and only if the *path* passes all the specified tests\. In the case of the *path* not passing one or more test the first failing test will leave a message in the variable referenced by *msgvar*, if such is specified\. The message will be prefixed with *label*, if it is specified\. *Note* that the variabled referenced by *msgvar* is not touched at all if all the tests pass\. * *r*ead __file readable__ * *w*rite __file writable__ * *e*xists __file exists__ * e*x*ec __file executable__ * *f*ile __file isfile__ * *d*ir __file isdirectory__ - __::fileutil::cat__ \(?*options*? *file*\)\.\.\. A tcl implementation of the UNIX __[cat](\.\./\.\./\.\./\.\./index\.md\#cat)__ command\. Returns the contents of the specified file\(s\)\. The arguments are files to read, with interspersed options configuring the process\. If there are problems reading any of the files, an error will occur, and no data will be returned\. The options accepted are __\-encoding__, __\-translation__, __\-eofchar__, and __\-\-__\. With the exception of the last all options take a single value as argument, as specified by the tcl builtin command __fconfigure__\. The __\-\-__ has to be used to terminate option processing before a file if that file's name begins with a dash\. Each file can have its own set of options coming before it, and for anything not specified directly the defaults are inherited from the options of the previous file\. The first file inherits the system default for unspecified options\. - __::fileutil::writeFile__ ?*options*? *file* *data* The command replaces the current contents of the specified *file* with *data*, with the process configured by the options\. The command accepts the same options as __::fileutil::cat__\. The specification of a non\-existent file is legal and causes the command to create the file \(and all required but missing directories\)\. - __::fileutil::appendToFile__ ?*options*? *file* *data* This command is like __::fileutil::writeFile__, except that the previous contents of *file* are not replaced, but appended to\. The command accepts the same options as __::fileutil::cat__ - __::fileutil::insertIntoFile__ ?*options*? *file* *at* *data* This comment is similar to __::fileutil::appendToFile__, except that the new data is not appended at the end, but inserted at a specified location within the file\. In further contrast this command has to be given the path to an existing file\. It will not create a missing file, but throw an error instead\. The specified location *at* has to be an integer number in the range __0__ \.\.\. \[file size *file*\]\. __0__ will cause insertion of the new data before the first character of the existing content, whereas \[file size *file*\] causes insertion after the last character of the existing content, i\.e\. appending\. The command accepts the same options as __::fileutil::cat__\. - __::fileutil::removeFromFile__ ?*options*? *file* *at* *n* This command is the complement to __::fileutil::insertIntoFile__, removing *n* characters from the *file*, starting at location *at*\. The specified location *at* has to be an integer number in the range __0__ \.\.\. \[file size *file*\] \- *n*\. __0__ will cause the removal of the new data to start with the first character of the existing content, whereas \[file size *file*\] \- *n* causes the removal of the tail of the existing content, i\.e\. the truncation of the file\. The command accepts the same options as __::fileutil::cat__\. - __::fileutil::replaceInFile__ ?*options*? *file* *at* *n* *data* This command is a combination of __::fileutil::removeFromFile__ and __::fileutil::insertIntoFile__\. It first removes the part of the contents specified by the arguments *at* and *n*, and then inserts *data* at the given location, effectively replacing the removed by content with *data*\. All constraints imposed on *at* and *n* by __::fileutil::removeFromFile__ and __::fileutil::insertIntoFile__ are obeyed\. The command accepts the same options as __::fileutil::cat__\. - __::fileutil::updateInPlace__ ?*options*? *file* *cmd* This command can be seen as the generic core functionality of __::fileutil::replaceInFile__\. It first reads the contents of the specified *file*, then runs the command prefix *cmd* with that data appended to it, and at last writes the result of that invokation back as the new contents of the file\. If the executed command throws an error the *file* is not changed\. The command accepts the same options as __::fileutil::cat__\. - __::fileutil::fileType__ *filename* An implementation of the UNIX __[file](\.\./\.\./\.\./\.\./index\.md\#file)__ command, which uses various heuristics to guess the type of a file\. Returns a list specifying as much type information as can be determined about the file, from most general \(eg, "binary" or "text"\) to most specific \(eg, "gif"\)\. For example, the return value for a GIF file would be "binary graphic gif"\. The command will detect the following types of files: directory, empty, binary, text, script \(with interpreter\), executable elf, executable dos, executable ne, executable pe, graphic gif, graphic jpeg, graphic png, graphic tiff, graphic bitmap, html, xml \(with doctype if available\), message pgp, binary pdf, text ps, text eps, binary gravity\_wave\_data\_frame, compressed bzip, compressed gzip, compressed zip, compressed tar, audio wave, audio mpeg, and link\. It further detects doctools, doctoc, and docidx documentation files, and tklib diagrams\. - __::fileutil::find__ ?*basedir* ?*filtercmd*?? An implementation of the unix command __[find](\.\./\.\./\.\./\.\./index\.md\#find)__\. Adapted from the Tcler's Wiki\. Takes at most two arguments, the path to the directory to start searching from and a command to use to evaluate interest in each file\. The path defaults to "\.", i\.e\. the current directory\. The command defaults to the empty string, which means that all files are of interest\. The command takes care *not* to lose itself in infinite loops upon encountering circular link structures\. The result of the command is a list containing the paths to the interesting files\. The *filtercmd*, if specified, is interpreted as a command prefix and one argument is added to it, the name of the file or directory find is currently looking at\. Note that this name is *not* fully qualified\. It has to be joined it with the result of __pwd__ to get an absolute filename\. The result of *filtercmd* is a boolean value that indicates if the current file should be included in the list of interesting files\. Example: # find .tcl files package require fileutil proc is_tcl {name} {return [string match *.tcl $name]} set tcl_files [fileutil::find . is_tcl] - __::fileutil::findByPattern__ *basedir* ?__\-regexp__|__\-glob__? ?__\-\-__? *patterns* This command is based upon the __TclX__ command __recursive\_glob__, except that it doesn't allow recursion over more than one directory at a time\. It uses __::fileutil::find__ internally and is thus able to and does follow symbolic links, something the __TclX__ command does not do\. First argument is the directory to start the search in, second argument is a list of *patterns*\. The command returns a list of all files reachable through *basedir* whose names match at least one of the patterns\. The options before the pattern\-list determine the style of matching, either regexp or glob\. glob\-style matching is the default if no options are given\. Usage of the option __\-\-__ stops option processing\. This allows the use of a leading '\-' in the patterns\. - __::fileutil::foreachLine__ *var filename cmd* The command reads the file *filename* and executes the script *cmd* for every line in the file\. During the execution of the script the variable *var* is set to the contents of the current line\. The return value of this command is the result of the last invocation of the script *cmd* or the empty string if the file was empty\. - __::fileutil::grep__ *pattern* ?*files*? Implementation of __[grep](\.\./\.\./\.\./\.\./index\.md\#grep)__\. Adapted from the Tcler's Wiki\. The first argument defines the *pattern* to search for\. This is followed by a list of *files* to search through\. The list is optional and __stdin__ will be used if it is missing\. The result of the procedures is a list containing the matches\. Each match is a single element of the list and contains filename, number and contents of the matching line, separated by a colons\. - __::fileutil::install__ ?__\-m__ *mode*? *source* *destination* The __install__ command is similar in functionality to the __install__ command found on many unix systems, or the shell script distributed with many source distributions \(unix/install\-sh in the Tcl sources, for example\)\. It copies *source*, which can be either a file or directory to *destination*, which should be a directory, unless *source* is also a single file\. The ?\-m? option lets the user specify a unix\-style mode \(either octal or symbolic \- see __file attributes__\. - __::fileutil::stripN__ *path* *n* Removes the first *n* elements from the specified *path* and returns the modified path\. If *n* is greater than the number of components in *path* an empty string is returned\. The number of components in a given path may be determined by performing __llength__ on the list returned by __file split__\. - __::fileutil::stripPwd__ *path* If, and only if the *path* is inside of the directory returned by \[__pwd__\] \(or the current working directory itself\) it is made relative to that directory\. In other words, the current working directory is stripped from the *path*\. The possibly modified path is returned as the result of the command\. If the current working directory itself was specified for *path* the result is the string "__\.__"\. - __::fileutil::stripPath__ *prefix* *path* If, and only of the *path* is inside of the directory "prefix" \(or the prefix directory itself\) it is made relative to that directory\. In other words, the prefix directory is stripped from the *path*\. The possibly modified path is returned as the result of the command\. If the prefix directory itself was specified for *path* the result is the string "__\.__"\. - __::fileutil::jail__ *jail* *path* This command ensures that the *path* is not escaping the directory *jail*\. It always returns an absolute path derived from *path* which is within *jail*\. If *path* is an absolute path and already within *jail* it is returned unmodified\. An absolute path outside of *jail* is stripped of its root element and then put into the *jail* by prefixing it with it\. The same happens if *path* is relative, except that nothing is stripped of it\. Before adding the *jail* prefix the *path* is lexically normalized to prevent the caller from using __\.\.__ segments in *path* to escape the jail\. - __::fileutil::touch__ ?__\-a__? ?__\-c__? ?__\-m__? ?__\-r__ *ref\_file*? ?__\-t__ *time*? *filename* ?*\.\.\.*? Implementation of __[touch](\.\./\.\./\.\./\.\./index\.md\#touch)__\. Alter the atime and mtime of the specified files\. If __\-c__, do not create files if they do not already exist\. If __\-r__, use the atime and mtime from *ref\_file*\. If __\-t__, use the integer clock value *time*\. It is illegal to specify both __\-r__ and __\-t__\. If __\-a__, only change the atime\. If __\-m__, only change the mtime\. *This command is not available for Tcl versions less than 8\.3\.* - __::fileutil::tempdir__ The command returns the path of a directory where the caller can place temporary files, such as "/tmp" on Unix systems\. The algorithm we use to find the correct directory is as follows: 1. The directory set by an invokation of __::fileutil::tempdir__ with an argument\. If this is present it is tried exclusively and none of the following item are tried\. 1. The directory named in the TMPDIR environment variable\. 1. The directory named in the TEMP environment variable\. 1. The directory named in the TMP environment variable\. 1. A platform specific location: * Windows "C:\\TEMP", "C:\\TMP", "\\TEMP", and "\\TMP" are tried in that order\. * \(classic\) Macintosh The TRASH\_FOLDER environment variable is used\. This is most likely not correct\. * Unix The directories "/tmp", "/var/tmp", and "/usr/tmp" are tried in that order\. The algorithm utilized is mainly that used in the Python standard library\. The exception is the first item, the ability to have the search overridden by a user\-specified directory\. - __::fileutil::tempdir__ *path* In this mode the command sets the *path* as the first and only directory to try as a temp\. directory\. See the previous item for the use of the set directory\. The command returns the empty string\. - __::fileutil::tempdirReset__ Invoking this command clears the information set by the last call of \[__::fileutil::tempdir__ *path*\]\. See the last item too\. - __::fileutil::tempfile__ ?*prefix*? The command generates a temporary file name suitable for writing to, and the associated file\. The file name will be unique, and the file will be writable and contained in the appropriate system specific temp directory\. The name of the file will be returned as the result of the command\. The code was taken from [http://wiki\.tcl\.tk/772](http://wiki\.tcl\.tk/772), attributed to Igor Volobouev and anon\. - __::fileutil::maketempdir__ ?__\-prefix__ *str*? ?__\-suffix__ *str*? ?__\-dir__ *str*? The command generates a temporary directory suitable for writing to\. The directory name will be unique, and the directory will be writable and contained in the appropriate system specific temp directory\. The name of the directory will be returned as the result of the command\. The three options can used to tweak the behaviour of the command: * __\-prefix__ str The initial, fixed part of the directory name\. Defaults to __tmp__ if not specified\. * __\-suffix__ str The fixed tail of the directory\. Defaults to the empty string if not specified\. * __\-dir__ str The directory to place the new directory into\. Defaults to the result of __fileutil::tempdir__ if not specified\. The initial code for this was supplied by [Miguel Martinez Lopez](mailto:aplicacionamedida@gmail\.com)\. - __::fileutil::relative__ *base* *dst* This command takes two directory paths, both either absolute or relative and computes the path of *dst* relative to *base*\. This relative path is returned as the result of the command\. As implied in the previous sentence, the command is not able to compute this relationship between the arguments if one of the paths is absolute and the other relative\. *Note:* The processing done by this command is purely lexical\. Symbolic links are *not* taken into account\. - __::fileutil::relativeUrl__ *base* *dst* This command takes two file paths, both either absolute or relative and computes the path of *dst* relative to *base*, as seen from inside of the *base*\. This is the algorithm how a browser resolves a relative link found in the currently shown file\. The computed relative path is returned as the result of the command\. As implied in the previous sentence, the command is not able to compute this relationship between the arguments if one of the paths is absolute and the other relative\. *Note:* The processing done by this command is purely lexical\. Symbolic links are *not* taken into account\. # Warnings and Incompatibilities - __1\.14\.9__ In this version __fileutil::find__'s broken system for handling symlinks was replaced with one working correctly and properly enumerating all the legal non\-cyclic paths under a base directory\. While correct this means that certain pathological directory hierarchies with cross\-linked sym\-links will now take about O\(n\*\*2\) time to enumerate whereas the original broken code managed O\(n\) due to its brokenness\. A concrete example and extreme case is the "/sys" hierarchy under Linux where some hundred devices exist under both "/sys/devices" and "/sys/class" with the two sub\-hierarchies linking to the other, generating millions of legal paths to enumerate\. The structure, reduced to three devices, roughly looks like /sys/class/tty/tty0 --> ../../dev/tty0 /sys/class/tty/tty1 --> ../../dev/tty1 /sys/class/tty/tty2 --> ../../dev/tty1 /sys/dev/tty0/bus /sys/dev/tty0/subsystem --> ../../class/tty /sys/dev/tty1/bus /sys/dev/tty1/subsystem --> ../../class/tty /sys/dev/tty2/bus /sys/dev/tty2/subsystem --> ../../class/tty The command __fileutil::find__ currently has no way to escape this\. When having to handle such a pathological hierarchy It is recommended to switch to package __fileutil::traverse__ and the same\-named command it provides, and then use the __\-prefilter__ option to prevent the traverser from following symbolic links, like so: package require fileutil::traverse proc NoLinks {fileName} { if {[string equal [file type $fileName] link]} { return 0 } return 1 } fileutil::traverse T /sys/devices -prefilter NoLinks T foreach p { puts $p } T destroy # Bugs, Ideas, Feedback This document, and the package it describes, will undoubtedly contain bugs and other problems\. Please report such in the category *fileutil* of the [Tcllib Trackers](http://core\.tcl\.tk/tcllib/reportlist)\. Please also report any ideas for enhancements you may have for either package and/or documentation\. When proposing code changes, please provide *unified diffs*, i\.e the output of __diff \-u__\. Note further that *attachments* are strongly preferred over inlined patches\. Attachments can be made by going to the __Edit__ form of the ticket immediately after its creation, and then using the left\-most button in the secondary navigation bar\. # KEYWORDS [cat](\.\./\.\./\.\./\.\./index\.md\#cat), [file utilities](\.\./\.\./\.\./\.\./index\.md\#file\_utilities), [grep](\.\./\.\./\.\./\.\./index\.md\#grep), [temp file](\.\./\.\./\.\./\.\./index\.md\#temp\_file), [test](\.\./\.\./\.\./\.\./index\.md\#test), [touch](\.\./\.\./\.\./\.\./index\.md\#touch), [type](\.\./\.\./\.\./\.\./index\.md\#type) # CATEGORY Programming tools