![]() | Home | Products | Services | Support | Contact us |
This page lets you control how WIC will search for images. The following options are available.

This option lets you specify a web address (URL) to a web site or a page within a web site. Images found on that web site will be saved to your new collection. If you want to collect images from one web site, a web site address such as: http://www.datafire.com
This option lets you specify a web address (URL) to a web site or to an image, and you can include incrementing alpha or numeric values. For example the following URLs are in sequence:
http://www.test.com/001/images.html http://www.test.com/002/images.html http://www.test.com/003/images.html
To have WIC crawl these pages, enter the following URL.
http://www.test.com/[001-003]/images.html
Here is another example, this time, the following URLs are for images in sequence:
http://www.test.com/images/example_a.jpg http://www.test.com/images/example_b.jpg http://www.test.com/images/example_c.jpg
To have WIC download these images, enter the URL
http://www.test.com/images/example_[a-c].jpg
The following options can be placed within the "[]" brackets:
Numerical: (you can pad as many leading zeros as needed) [0-999999999] [01-992] [000032-000039] Alpha: (you can start and end at any letter of the alphabet) [a-z] [A-Z] [a-Z] Multiple: (you can combine as many sequence indicators as needed) http://www.test.com/[02-18]/images/[A-Z]/example_[d-x].jpg
This option lets you specify keywords to seach the entire web for related images. You can specify one or more keywords based on the images you would like to find. You can use standard search engine tokens around keywords such as:
"Pitbull puppies" (exact phrase) +Puppies -pitbull (include / exclude) +Puppies +Pitbull (match any)
This is the path on your computer where you would like to store the downloaded images. You can type in the path in the directory field or click the browse button to select a directory.

This option controls how images are saved on your computer. When this option is disabled, or unchecked, images are stored in the specified directory. Otherwise, if this option is enabled, subfolders are created in the specified directory to mirror the layout of the web site images came from.
For example, if this option is enabled, and WIC finds images at the following sites:
http://www.somesite.com/images/beach/index.html http://www.somesite.com/images/picnic/index.html The following folders will be created within your specified directory: www.somesite.com/images/beach/ www.somesite.com/images/picnic/
This is where you can specify a username and password to automatically login to membership based sites. This feature works with basic authentication, simple web based forms, and AVS membership sites.

A web site that uses basic authentication will always prompt you for a username and password using a popup window from your browser. For these sites, use the login name and password fields.
A web site that simple web based forms will have a username and password form on the web page and it does NOT contain an image with a number or phrase to type in as a passcode. For these sites, use the login name and password fields.
A web site that uses AVS (adult verification system) membership such as Adult Check, Adult Pass and others will have a single ID field. For these sites, use the AVS field.
This is where you control how WIC follows links found on the web.

The Page links option controls how WIC processes links to web pages. The Image links option controls how WIC processes links to images. The Link depth option is used in combination with the Page Links - Follow all links to web sites setting. It controls how many web sites to follow based on the originating link.
This option restricts WIC from following web links. This is useful when you need to collect images from one web page an no others. This option is not useful for keyword based searches.
This option allows WIC to process only the links belong to the main web site you are searching. For example, if you are searching www.datafire.com and WIC finds a link to an external web site, such as www.java.com, then WIC will ignore that link. This option is not useful for keyword based searches.
This option allows WIC to collect images only on the web site you are searching. For example, if you are searching www.datafire.com and an image on the site is at www.java.com, the image will be skipped.
This option will allow WIC to collect images from all web links found. If you are searching by keywords, this is the best option to use. For example, if you are searching www.datafire.com and an image on the site is at www.java.com, the image will be downloaded.
This option is used in combination with the Page Links - Follow all links to web sites setting. The depth is the maximum number of chained URLs that WIC will follow based on domains.
For example, if you are searching www.datafire.com, and you selected for Page Links - Follow all links to web sites setting the following may happen:
WIC searches www.datafire.com and finds a link to www.java.com WIC searches www.java.com and finds a link to www.netbeans.org The link at www.netbeans.org is a link depth of 2 because it is 2 links underneath the original search URL of www.datafire.com.
Because web sites may link to other external web sites, WIC can end up processing an infinite number of web sites. Using the link depth option will restrict WIC from following every web link and focus more on the top level sites of related interest.
These are the options for processing duplicate images. An image is a duplicate if there is already an image of the same file name downloaded to the target folder.

This option will force WIC to download and overwrite duplicate images
This is where you control the option of image file size filtering. This option allows you to set a minimum and or a maximum file size in kilobytes (KB). When images are downloading, they are compared to these options and if the file size does not meet the minimum or maximum settings, the image is not downloaded.

This is the smallest allowable file size in KB. Images smaller than the number here will not download. If this field is blank, this option is ignored.
This is where you control the filter of including and excluding web pages by matching keywords or regular expressions (regex) against web page URLs.

When a keyword or regex matches any URL, that web page is either included or excluded in the image search. Using this filter can speed up the image collecting process by skipping past pages that have nothing to do with your search interest. It is possible to use both the include and exclude list together for a more powerful search.
Keywords are not case sensitive and should be separated with a semicolon ; character.
This is a list of keywords separated by the ';' character. All URLs that contain any of the keywords in the list will be included in the search. All other URLs will be rejected.
This is a list of keywords separated by the ';' character. All URLs that contain any of the keywords in the list will be excluded from the search.
For example, assume that you have
A URL Include List of: puppies;bulldogs A URL Exclude List of: thumbnails;kittens
The following URLs would be included:
http://www.test.com/puppies/index.html http://www.test.com/puppies/bulldogs/index.html
The following URLs would be excluded:
http://www.test.com/dogs/index.html http://www.test.com/puppies/thumbnails/index.html http://www.test.com/puppies/bulldogs/kittens.html
Regular expressions (regex) are special text that describes a pattern to match within text or in this case within a URL.
For example, assume that you have
A URL Include List of: .*?$(?<!\.aspx)
This regex pattern will match all URLs that do not contain .aspx. The following URLs would be included:
http://www.test.com/puppies/index.html http://www.test.com/puppies/bulldogs/index.html
The following URLs would be excluded:
http://www.test.com/dogs/index.aspx http://www.test.com/puppies/getimage.aspx?id=25
This is where you control the filter of including and excluding images by matching keywords or regular expressions (regex) against image URLs.

When a keyword or regex matches any URL, that image is either downloaded or skipped. It is possible to use both the include and exclude list together for a more powerful search.
Keywords are not case sensitive and should be separated with a semicolon ; character.
This is a list of keywords separated by the ';' character. All image URLs that contain any of the keywords in the list will be included in the search. All other image URLs will be rejected.
This is a list of keywords separated by the ';' character. All image URLs that contain any of the keywords in the list will be excluded from the search.
For example, assume that you have
A URL Include List of: puppies;bulldogs A URL Exclude List of: thumbnails;kittens
The following URLs would be included:
http://www.test.com/puppies/mutts.jpg http://www.test.com/puppies/bulldogs/aaa01.jpg
The following URLs would be excluded:
http://www.test.com/dogs/pitbulls_kittens.jpg http://www.test.com/puppies/thumbnails/mutts.jpg
Regular expressions (regex) are special text that describes a pattern to match within text or in this case within a URL.
For example, assume that you have
A URL Include List of: .*?$(?<!\.aspx)
This regex pattern will match all URLs that do not contain .aspx. The following URLs would be included:
http://www.test.com/puppies/pugs.png http://www.test.com/puppies/bulldogs/newborn.jpg
The following URLs would be excluded:
http://www.test.com/dogs/index.aspx http://www.test.com/puppies/getimage.aspx?id=25
This is where you control the filter of including or excluding images based on the width and height of the image. This page includes a size list for specific image dimensions to filter, and it includes a minimum and maximum dimension filter. As images are downloaded to your computer, they are compared to the options on this page to determine if the images should be downloaded or skipped.

This option will turn the filter on or off. If you disable the filter, all entries in the filter list are saved but ignored.
This option will include all images that have the exact width and height dimensions specified in the list. This option results in a very limited set of results.
This option will exclude all images that have the exact width and height dimensions specified in the list. This option results in a clean and more specific set of results. You can use this option to filter out common advertisements and more.
You can specify a minimum width, height, both or none. For example if you set a minimum width of 50 and a minimum height of 50, all images smaller than 50x50 are skipped. You can leave the heigth field empty and just specify a minimum width of 50 so that an image of 60x10 is downloaded. If you leave both width and hight fields empty, this filter is disabled.
You can specify a maximum width, height, both or none. For example if you set a maximum width of 500 and a maximum height of 500, all images larger than 500x500 are skipped. You can leave the heigth field empty and just specify a maximum width of 500 so that an image of 60x600 is downloaded. If you leave both width and hight fields empty, this filter is disabled.
This is where you control the filter of including or excluding images based on the file types.

In most cases you would not use this filter, however if for example you do not want to download any .bmp images, this filter will take care of skipping .bmp images. The case of the keywords is ignored, so you can enter .JPG or .jpg and have the same result.
This option will turn the filter on or off. If you disable the filter, all entries in the filter list are saved but ignored.
This option will include all images that contain one of the file extensions in the list. This option results in a very limited set of results.
This is where you control any specific HTTP headers sent to the web sites as WIC is searching.

This is an advanced feature which is not needed in most cases. There are no disable or enable switches for this option, therefore whatever entries you set here are sent to all web page and image requests.
For example, if you specify the header of User-Agent as Mozilla, all requests to web pages use the header entry - User-Agent Mozilla.