[//000000001]: # (fileutil\_traverse \- file utilities) [//000000002]: # (Generated from file 'traverse\.man' by tcllib/doctools with format 'markdown') [//000000003]: # (fileutil\_traverse\(n\) 0\.6 tcllib "file utilities")
[ Main Table Of Contents | Table Of Contents | Keyword Index | Categories | Modules | Applications ]
# NAME fileutil\_traverse \- Iterative directory traversal # Table Of Contents - [Table Of Contents](#toc) - [Synopsis](#synopsis) - [Description](#section1) - [OPTIONS](#section2) - [Warnings and Incompatibilities](#section3) - [Bugs, Ideas, Feedback](#section4) - [Keywords](#keywords) - [Category](#category) # SYNOPSIS package require Tcl 8\.3 package require fileutil::traverse ?0\.6? package require fileutil package require control [__::fileutil::traverse__ ?*objectName*? *path* ?*option* *value*\.\.\.?](#1) [__$traverser__ __command__ ?*arg arg \.\.\.*?](#2) [__$traverser__ __files__](#3) [__$traverser__ __foreach__ *filevar* *script*](#4) [__$traverser__ __next__ *filevar*](#5) # DESCRIPTION This package provides objects for the programmable traversal of directory hierarchies\. The main command exported by the package is: - __::fileutil::traverse__ ?*objectName*? *path* ?*option* *value*\.\.\.? The command creates a new traversal object with an associated global Tcl command whose name is *objectName*\. This command may be used to invoke various operations on the traverser\. If the string __%AUTO%__ is used as the *objectName* then a unique name will be generated by the package itself\. Regarding the recognized options see section [OPTIONS](#section2)\. Note that all these options can be set only during the creation of the traversal object\. Changing them later is not possible and causes errors to be thrown if attempted\. The object command has the following general form: * __$traverser__ __command__ ?*arg arg \.\.\.*? *Command* and its *arg*uments determine the exact behavior of the object\. The following commands are possible for traversal objects: - __$traverser__ __files__ This method is the most highlevel one provided by traversal objects\. When invoked it returns a list containing the names of all files and directories matching the current configuration of the traverser\. - __$traverser__ __foreach__ *filevar* *script* The highlevel __files__ method \(see above\) is based on this mid\-level method\. When invoked it finds all files and directories matching per the current configuration and executes the *script* for each path\. The current path under consideration is stored in the variable named by *filevar*\. Both variable and script live / are executed in the context of the caller of the method\. In the method __files__ the script simply saves the found paths into the list to return\. - __$traverser__ __next__ *filevar* This is the lowest possible interface to the traverser, the core all higher methods are built on\. When invoked it returns a boolean value indicating whether it found a path matching the current configuration \(__True__\), or not \(__False__\)\. If a path was found it is stored into the variable named by *filevar*, in the context of the caller\. The __foreach__ method simply calls this method in a loop until it returned __False__\. This method is exposed so that we are also able to incrementally traverse a directory hierarchy in an event\-based manner\. Note that the traverser does follow symbolic links, except when doing so would cause it to enter a link\-cycle\. In other words, the command takes care to *not* lose itself in infinite loops upon encountering circular link structures\. Note that even links which are not followed will still appear in the result\. # OPTIONS - __\-prefilter__ command\_prefix This callback is executed for directories\. Its result determines if the traverser recurses into the directory or not\. The default is to always recurse into all directories\. The callback is invoked with a single argument, the *absolute* path of the directory, and has to return a boolean value, __True__ when the directory passes the filter, and __False__ if not\. - __\-filter__ command\_prefix This callback is executed for all paths\. Its result determines if the current path is a valid result, and returned by __next__\. The default is to accept all paths as valid\. The callback is invoked with a single argument, the *absolute* path to check, and has to return a boolean value, __True__ when the path passes the filter, and __False__ if not\. - __\-errorcmd__ command\_prefix This callback is executed for all paths the traverser has trouble with\. Like being unable to change into them, get their status, etc\. The default is to ignore any such problems\. The callback is invoked with a two arguments, the *absolute* path for which the error occured, and the error message\. Errors thrown by the filter callbacks are handled through this callback too\. Errors thrown by the error callback itself are not caught and ignored, but allowed to pass to the caller, i\.e\. however invoked the __next__\. Any other results from the callback are ignored\. # Warnings and Incompatibilities - __0\.4\.4__ In this version the traverser's broken system for handling symlinks was replaced with one working correctly and properly enumerating all the legal non\-cyclic paths under a base directory\. While correct this means that certain pathological directory hierarchies with cross\-linked sym\-links will now take about O\(n\*\*2\) time to enumerate whereas the original broken code managed O\(n\) due to its brokenness\. A concrete example and extreme case is the "/sys" hierarchy under Linux where some hundred devices exist under both "/sys/devices" and "/sys/class" with the two sub\-hierarchies linking to the other, generating millions of legal paths to enumerate\. The structure, reduced to three devices, roughly looks like /sys/class/tty/tty0 --> ../../dev/tty0 /sys/class/tty/tty1 --> ../../dev/tty1 /sys/class/tty/tty2 --> ../../dev/tty1 /sys/dev/tty0/bus /sys/dev/tty0/subsystem --> ../../class/tty /sys/dev/tty1/bus /sys/dev/tty1/subsystem --> ../../class/tty /sys/dev/tty2/bus /sys/dev/tty2/subsystem --> ../../class/tty When having to handle such a pathological hierarchy it is recommended to use the __\-prefilter__ option to prevent the traverser from following symbolic links, like so: package require fileutil::traverse proc NoLinks {fileName} { if {[string equal [file type $fileName] link]} { return 0 } return 1 } fileutil::traverse T /sys/devices -prefilter NoLinks T foreach p { puts $p } T destroy # Bugs, Ideas, Feedback This document, and the package it describes, will undoubtedly contain bugs and other problems\. Please report such in the category *fileutil* of the [Tcllib Trackers](http://core\.tcl\.tk/tcllib/reportlist)\. Please also report any ideas for enhancements you may have for either package and/or documentation\. When proposing code changes, please provide *unified diffs*, i\.e the output of __diff \-u__\. Note further that *attachments* are strongly preferred over inlined patches\. Attachments can be made by going to the __Edit__ form of the ticket immediately after its creation, and then using the left\-most button in the secondary navigation bar\. # KEYWORDS [directory traversal](\.\./\.\./\.\./\.\./index\.md\#directory\_traversal), [traversal](\.\./\.\./\.\./\.\./index\.md\#traversal) # CATEGORY Programming tools