HTTrack Website Copier release history: -------------------------------------- This file lists all changes and fixes that have been made for HTTrack. To do & Known bugs: (fixed soon) - Some interface problems the Windows release 3.20-2 + New: HTTPS support (SSL) + New: ipv6 support + New: 'longdesc' added + New: new file 'new.txt' generated for transfer status reports + New: ISO9660 compatibility option + New: empty mirror/update detection improved + New: Update hack now recognizes "imported" files + New: Option to disable ipv4/ipv6 + New: Filters now recognize patterns like -https://* + Fixed: The engine should be now fully reentrant + Fixed: Fixes for alpha and other 64-bit systems + Fixed: Files downloaded twice if not found in cache + Fixed: ftp problems with 2xx responses + Fixed: ftp problems with multiple lines responses + Fixed: ftp %20 not escaped anymore + Fixed: ftp RETR with quotes problems + Fixed: now tolerent to empty header responses + Fixed: hts-log closed + Fixed: Compressed pages during updates + Fixed: Crash when receiving empty compressed pages + Fixed: Random crashes in 'spider' mode + Fixed: bcopy/bzero not used anymore.. + Fixed: various code cleanups + Fixed: Better UTF8 detection + Fixed: External links now work with https and ftp + Fixed: Top index.html corrupted or missing + Fixed: URL list crashes + Fixed: Random crashes with large sites due to bogus naming handler + Fixed: Freezes on some robots.txt files + Fixed: Compressed files not stored + Fixed: SVG fixes + Fixed: Raw HTML responses + Fixed: 406 error workaround + Fixed: Crashes due to binary files with bogus HTML type (not parsed anymore) + Fixed: External https and ftp links broken, relative https links broken + Fixed: Automatic resizing of filter stack + Fixed: Various ampersand (&) elements added + Fixed: https with proxy temporary workaround (direct connection) + Fixed: "base href" with absolute uris + Fixed: stack frame too large on some systems + Fixed: random bad requests due to bogus authentication + Shell: Several fixes, including registration type problems + Shell: "template files not found" fixed 3.16-2 + Fixed: Zlib v1.4 + Fixed: Gzipped files now downloaded without problems (HTTP compression bug) + Fixed: Ending spaces in URLs now handled correctly + Fixed: META-HTTP bug + Shell: Type registration done only once 3.15 + Fixed: Bogus HTTP-referer with protected sites + Fixed: Fatal IO/socket error with large sites (handles not closed) + Fixed: K4 option now works + Fixed: --continue+URL(s) now clears previous URLs + Fixed: Parsing bug with 'www.foo.com?query' + Shell: 'Soft cancel' documented + Shell: 'Kx' options added 3.10 + Fixed: Broken pipes on Linux version + Fixed: Commandline version bug with gzipped files + Fixed: Crash when reaching compressed error pages + Fixed: Bogus html-escaped characters in query strings + Fixed: Files skipped (bogus anticipating system) + Fixed: Crash when showing stats (div by zero) + Fixed: Problems with URLs/redirects containing spaces or quotes + Fixed: Slash added when ~ detected + Fixed: Ugly VT terminal + New: Faster and cleaner mirror interrupt 3.09 + Fixed: Several problems with javascript parsing + Fixed: Elements after onXXX not parsed + New: Source update wrapper + New: Style url() and @import parsed + Shell: Word database and maximum number of links + Shell: Option changes taken in account immediately + Shell: Cleaner installer (registry keys) 3.08 + New: HTTP compression is now supported + New: Faster response analysis + Fixed: External page in html if cgi + Fixed: Mix between CR and CR/LF for comments + Fixed: Top index corrupted + Shell: Better refresh during parsing + Shell: DLL error 3.07 + Fixed: Random crashes with HTTP redirects + New: New rate limiter (should be sharper) + New: Code cleaned up, new htscore.c/httrack.c files 3.06 + Fixed: Redirect to https/mailto now supported + New: Top index/top dir for Un*x version + New: Sources more modular (.so) + New: Quicktime targetX= tags + New: HTTP 100 partially supported 3.05 + Fixed: Non-scannable tag parameters ("id","name",..) + Fixed: Java classes not found when using "." as separator + Fixed: Java classes not found when missing .class 3.04 + Fixed: URLs with starting spaces + Fixed: bogus URLs when using "base href" + Shell: --assume and -%e options included + New: Documentation updated a little 3.03 + New: Parser optimizations, 10 times faster now! + New: New --assume option to speed up cgi tests + New: Option to avoid Username/password storage for external pages + New: Query string kept for local URIs + Fixed: RFC2396 compliant URLs accepted (//foo and http:foo) + Fixed: foo@foo.com not considered as URL anymore + Fixed: Space encoded into %20 in URIs + Fixed: "Unable to save file" bug + Fixed: Corrupted top index.html + Fixed: Cookies disabled with --get + Fixed: Cache bug for error pages 3.02 + Fixed: Pages without title recorded in top index + Fixed: Error with Content-type+Content-disposition + Fixed: backblue.gif/external.html files not purged anymore + Fixed: Encoding problems with files containing %2F or other characters + Fixed: Write error reported for HTML files + New: hts-stop.lock file to pause the engine + New: New install system using InnoSetup 3.01 + New: HTTP real media files captured + Fixed: Bogus statitics + Fixed: Minor fixes 3.00 + New: New interface, with MANY improvements! + New: Better parsing (enhanced javascript parsing, sharper HTML parsing) + New: Faster and more efficient background download system + New: ETag properly handled + New: Optional URL list + New: Optionnal config file + New: New structure options + New: New filters options (size filters) + New: Better password site handling + New: Traffic control to avoid server overload + New: Setuid and Chroot for Unix release + New: limited 64-bit handling + New: .js files are now parsed + New: Single hts-log.txt file, error level + New: New top index.html design + New: "Update hack" option to prevent unnecessary updates + New: Default language sent for mirrors + New: Searchable index + Fixed: Bogus ftp routines (Linux version) + Fixed: Bug that caused to mirror a complete site from a subdir + Fixed: Bug that caused restart to be very slow + Fixed: Bug that caused loops on several query-string pages (?foo=/) + Fixed: Corrupted cache bug + Fixed: Random broken links (pages not downloaded) + Fixed: Shared links problems + Fixed: Bogus URLs with commas (,) + Fixed: Bogus / and \ mixed + Fixed: Bogus addresses with multiple @ + Fixed: Bogus links with %2E and %2F + Fixed: Bogus empty links + Fixed: "Unexpected backing error" bug fixed + Fixed: Files with incorrect size no more accepted + Fixed: Top index.html created even for untitled pages + Fixed: Bogus N100 option (unable to save file) + Fixed: Deadlock when using many hosts in URLs + Fixed: Password stored internally to avoid access errors + Fixed: Fixed /nul DOS limit + Fixed: Bogus -* filter (nothing mirrored) + Fixed: .shtml now renamed into .html + Fixed: Content-disposition without "" + Fixed: External html page for /foo links + Fixed: Username/password % compliant + Fixed: Javascript parser sometimes failed with " and ' mixed + Fixed: Some Range: bugs when regeting complete files + Fixed: Range: problems with html files + Fixed: HTTP/1.1 407 and 416 messages now handled + Fixed: Bogus timestamp + Fixed: Null chars in HTML bug + Fixed: Error pages cache bug + Fixed: Connect error/site moved do not delete everything anymore! + Fixed: Bogus garbage ../ in relative URL + Shell: New transfer rate estimation + Shell: Fixed crash when using verbose wizard + Shell: dynamic lang.h for easier translation updates + Shell: Fixed some options not passed to the engine + Fixed: A lots of minor fixes! 2.2 Note: 3.00 alpha major bug fixes are included in the 2.2 2.02 + New: Cache system improved, compatible with all platforms + New: Update process improved (accurate date) + New: Remote timestamp for files + New: ETag (HTTP/1.1) supported + Shell: Portugese interface available + Fixed: Bug with links containing commas + Fixed: 'file://' bug with proxy + New: Engine a little bit faster + Shell: Some bugs fixed in the interface 2.01 + New: ftp through proxy finally supported! + New: Sources cleaned up + New: Again some new marvelous options + New: Speed improved (links caught during parsing, faster "fast update") + New: Tool to catch "submit" URL (forms or complex javascript links) + Shell: German interface available + Shell: Dutch interface available + Shell: Polish interface available + Fixed: Level 1 bug fixed + Fixed: Still some parsing/structure problems + Fixed: Referer now sent to server + Fixed: Cookies did not work properly + Fixed: Problems with redirect pages + New: Better javascript parsing + Fixed: Problems with URL-parameters (foo.cgi?param=2&choice=1) + Fixed: Problems with ftp + New: ftp transfers are now in passive mode (firewall compliant) 2.00 -- The First Free Software Release of HTTrack! + New: HTTrack sources (command line), now free software, are given + Shell: Interface rewritten! + New: Documentation rewritten + Shell: Drag&Drop abilities + Shell: More URL informations + Shell: Fixed: Remote access problems + Fixed: Loop problems on some sites causing crashes + Fixed: URL encoding problems + Fixed: Some file access problems for ../ + Fixed: Some fixes for updating a mirror + Shell: Crazy progress bar fixed + Fixed: Form action are rewritten so that cgi on form can work from an offline mirror + Fixed: Crashes after continuing an "hand-interrupted" mirror + Fixed: Bogus files with some servers (chunk bug) 1.30 + Shell: Interface improved + New: robots.txt are followed by default + New: Parsing speed improved on big (>10,000 links) sites with an hash table + New: Mirror Link mode (mirror all links in a page) + New: Cookies are now understood + New: No external pages option (replace external html/gif by default files) + New: Command line version improved, background on Unix releases + Fixed: Problems with javascript parsing + Fixed: Username/password not set to lowercase anymore + Fixed: Problems with base href + New: Links in level 1 html files now patched + New: Expurge now deletes unused folders + New: Option -V executes shell command for every new file + Shell: Primary filter now works 1.24 + Fixed: Ftp protocol bogus (with login/pass) + Fixed: Cache problems (corrupted files) + New: Expurge old files for updates + New: "Updated" messages for mirror updates + Shell: Autodial/hangup option to RAS + Fixed: index.html were not created sometimes + Shell: Fixed: Random crashes with the interface + Shell: Fixed: Filters profile not saved + Fixed: Various (and numerous) fixes 1.23 + Shell: Interface improved + Shell: Multiple setups + Shell: Redefine options + Shell: Continue interrupted mirror improved 1.22 + Fixed: Parsing up/down did not work well + Fixed: Several files not catched, bugs fixes + Fixed: Problems with classes (1.21) + New: Transfer rate can be limited (-A option) + Shell: Smooth refresh + New: ftp basic protocol a little bit improved 1.21 + Fixed: Several java classes were not parsed + Fixed: Some folders without ending / ignored + Fixed: Crashes due to content-type too long 1.20 + Shell: documentation! + Fixed: Some problems with 'host cancel' system after timeouts (crashes) + New: Get only end of files if possible (file partially get) + New: New cache system (only HTML stored) + New: User-defined structure possible + New: Also available: french interface + Fixed: Random crashes (div by 0/illegal instruction) with null size files + New: Limited ftp protocol (files only), e.g. +ftp://* now works + Fixed: Some connect problems with several servers or proxies + New: New option, save html error report by default + Shell: Browse and see log files at the end of a mirror + New: Proxy authentication (ex: guest:star@myproxy.com:8080) + Shell: Interface improved (especially during mirror) + Fixed: Ambiguous files are renamed (asp,cgi->html/gif..) + Shell: New test link mode option + New: Site authentication (ex: guest:star@www.myweb.com/index.html) + Fixed: Minor bugs fixed + Shell: See log files during a mirror + Fixed: Some problems using CGI (different names now) + Fixed: Go down/up/both options and filters + Fixed: "Store html first" did not work + New: -F option ("Browser ID") disguise HTTrack into a browser + New: New filter system + Shell: New "Save as default" options + Fixed: "Build options" did NOT work properly! (files overwritten or missing) + Fixed: User agent ID fixed + Shell: Skip options + Shell: Better interface control during mirrors + Shell: InstallShield and Help files + Fixed: Some external links were not filtered sometimes + Fixed: Mirror crash at the end 1.16b + Shell: Really *stupid* bug fixed causing WinHTTrack to be slooow + Fixed: Crash if the first page has no title fixed + Fixed: Bogus options like "Just scan" saved empty files + Fixed: Forbid all links (*) with manual accept did not work + Shell: Filters interface improved 1.16: + New : Java Classes and subclasses are now retrieved! + New: Better JavaScripts parsing + New: Option: Abandon slowest hosts if timeout/transfer too slow + Shell: Interface improved 1.15b + Fixed: Some bugs fixed 1.15: + Shell: Interface improved + New: Robot improved (some files through javascript are now detected!) + New: Improved wild cards (for example, +www.*.com/*.zip) + New: 'config' file to configurate proxy, path.. only once 1.11 + New: Wait for specific time (begin transfer at specific hour) + New: Time limit option (stops transfer after x seconds) + Shell: Interface improved for an easy use 1.10e + Fixed: Maps were not correctly managed (stupid bug) 1.10d: + Fixed: Bogus index.html fixed 1.10c + Shell: "Time out" field needed "transfer rate" field 1.10b + Fixed: Better memory management 1.10 + New: "Transfer rate out" option added (abandon slowests sites) + New: "Deaf" hosts do not freeze HTTrack any more + Fixed: Again problems with code/codebase tags + New: Broken links detection improved 1.04 + Fixed:Some links were not correctly read (pages with "codebase" tags) + Shell: Interface improved 1.03 (No changes for the command-line robot) + Shell: Big bug fixed! (VERY slow transfer rates..) 1.02 + Fixed: Some java files were not correctly transfered + New: Speed has been improved + Fixed: Log file more accurate + Shell: Interface has been improved 1.01 + Fixed: Structure check error in some cases 1.00 -- The 1.00, Yeah! + New: base and codebase are now scanned 0.998 beta-2 + Fixed: Multiple name bug (files having the same name in the same directory) with -O option fixed 0.997 beta-2 + Fixed: Filenames with '%' were not correctly named + Fixed: Bug detected in 0.996: several files are not written on disk!! 0.996 beta-2 + New: -O option (path for mirror and log) + New: Unmodified file time/date are not changed during an update 0.99 beta-2 + New: User-agent field + New: Shortcuts (--spider etc.) + New: Links not retrieved are now rebuilt absolutly + New: The 'g' option (juste get files in current directory) has been added + New: Primary links analyste has been improved + Fixed: "304" bug fixed 0.25 beta-2 + Fixed: Freeze during several mirrors fixed! + New: More 'N' options (filenames type) 0.24 beta-2 + Fixed: Restart/Update with cache did not work (really not..) + Fixed: Wild cards now work properly (e.g. +www.abc.com* do works) + New: The 'n' option (get non-html files near a link) has been added! 0.23 beta-2 + Fixed: The 'M' option (site size) did not work + Fixed: Files larger than 65Kb were not correctly written older beta + Many, many bugs fixed