{"id":10428,"date":"2021-09-28T09:59:56","date_gmt":"2021-09-28T09:59:56","guid":{"rendered":"https:\/\/linuxways.net\/?p=10428"},"modified":"2021-09-28T09:59:56","modified_gmt":"2021-09-28T09:59:56","slug":"understanding-httrack-advanced-configurations-on-ubuntu","status":"publish","type":"post","link":"https:\/\/linuxways.net\/de\/ubuntu\/understanding-httrack-advanced-configurations-on-ubuntu\/","title":{"rendered":"Understanding HTTrack Advanced Configurations on Ubuntu"},"content":{"rendered":"<h2>Introduction<\/h2>\n<p>HTTrack is a unique piece of software to extract static pages from the web. In this guide, I am going to walk you through advanced configurations on Ubuntu 20.04 LTS. I will show you how to use various settings of HTTrack to extract any particular page for development purposes. HTTrack has enormous benefits for web developers to maintain a clean echo-system of their web applications. It helps them to mitigate any front-end problems. I am using the Ubuntu 20.04 LTS version for this guide.<\/p>\n<h2>Installing HTTrack<\/h2>\n<p>If you haven\u2019t installed HTTrack, then open the command-line interface to apply the following commands.<\/p>\n<pre><strong>$ sudo apt install httrack webhttrack<\/strong><\/pre>\n<p>HTTrack is only available as a web app for Linux operating systems. It can be used as standalone software on Mac and Windows, but it is not the case for us.<\/p>\n<h2>Running HTTrack<\/h2>\n<p>Once installed you will run it via the command line as it is the only option you have.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" width=\"879\" height=\"220\" class=\"wp-image-10429\" src=\"http:\/\/linuxways.net\/wp-content\/uploads\/2021\/09\/word-image-481.png\" srcset=\"https:\/\/linuxways.net\/wp-content\/uploads\/2021\/09\/word-image-481.png 879w, https:\/\/linuxways.net\/wp-content\/uploads\/2021\/09\/word-image-481-300x75.png 300w, https:\/\/linuxways.net\/wp-content\/uploads\/2021\/09\/word-image-481-768x192.png 768w\" sizes=\"auto, (max-width: 879px) 100vw, 879px\" \/><\/p>\n<p>When you run HTTrack then it will look something like this:<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" width=\"769\" height=\"459\" class=\"wp-image-10430\" src=\"http:\/\/linuxways.net\/wp-content\/uploads\/2021\/09\/word-image-482.png\" srcset=\"https:\/\/linuxways.net\/wp-content\/uploads\/2021\/09\/word-image-482.png 769w, https:\/\/linuxways.net\/wp-content\/uploads\/2021\/09\/word-image-482-300x179.png 300w, https:\/\/linuxways.net\/wp-content\/uploads\/2021\/09\/word-image-482-501x300.png 501w\" sizes=\"auto, (max-width: 769px) 100vw, 769px\" \/><\/p>\n<p>Now is the time to work with the advanced configurations of HTTrack.<\/p>\n<h2>Configure HTTrack on Ubuntu<\/h2>\n<h3><strong>STEP 1. Select a language<\/strong><\/h3>\n<p>HTTrack prompts you to select a language first. If English is the default language then you do not need to worry about it. Otherwise, select an appropriate language and move ahead.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" width=\"640\" height=\"415\" class=\"wp-image-10431\" src=\"http:\/\/linuxways.net\/wp-content\/uploads\/2021\/09\/word-image-483.png\" srcset=\"https:\/\/linuxways.net\/wp-content\/uploads\/2021\/09\/word-image-483.png 640w, https:\/\/linuxways.net\/wp-content\/uploads\/2021\/09\/word-image-483-300x195.png 300w\" sizes=\"auto, (max-width: 640px) 100vw, 640px\" \/><\/p>\n<h3><strong>Step 2. Enter project details<\/strong><\/h3>\n<p>Now I am going to add project details. The data comes from LinuxWays.Net as shown below.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" width=\"702\" height=\"365\" class=\"wp-image-10432\" src=\"http:\/\/linuxways.net\/wp-content\/uploads\/2021\/09\/word-image-484.png\" srcset=\"https:\/\/linuxways.net\/wp-content\/uploads\/2021\/09\/word-image-484.png 702w, https:\/\/linuxways.net\/wp-content\/uploads\/2021\/09\/word-image-484-300x156.png 300w\" sizes=\"auto, (max-width: 702px) 100vw, 702px\" \/><\/p>\n<h3><strong>Step 3.<\/strong> <strong>Select Action and Add URLs<\/strong><\/h3>\n<p><img loading=\"lazy\" decoding=\"async\" width=\"575\" height=\"545\" class=\"wp-image-10433\" src=\"http:\/\/linuxways.net\/wp-content\/uploads\/2021\/09\/word-image-485.png\" srcset=\"https:\/\/linuxways.net\/wp-content\/uploads\/2021\/09\/word-image-485.png 575w, https:\/\/linuxways.net\/wp-content\/uploads\/2021\/09\/word-image-485-300x284.png 300w\" sizes=\"auto, (max-width: 575px) 100vw, 575px\" \/><\/p>\n<p>Now I am going to select an action out of the given list and add URLs as shown above. It depends on what I want to achieve. Here is how each of the actions is different than one another.<\/p>\n<p><strong>Download web site(s)<\/strong> This option will copy a full website and will help you to browse it locally.<\/p>\n<p><strong>Download web site(s) + questions <\/strong>This action will do the same as the previous one, but it will also download any URL which works with a query string.<\/p>\n<p><strong>Get individual files<\/strong> This will download all files separately. It means <strong>.css<\/strong>, <strong>.html<\/strong>, and the rest of the available files on the server.<\/p>\n<p><strong>Download all sites in pages (multiple mirrors)<\/strong> This downloads all the sites available on a single server at once.<\/p>\n<p><strong>Test links in pages (bookmark test)<\/strong> Depending on what we want to test on our website, this action will help us to test links on a particular page.<\/p>\n<p>The remaining two configurations are supposed to continue an interrupted action.<\/p>\n<h3><strong>Step 4. My Test Case<\/strong><\/h3>\n<p>In my test case, I am going to select <strong>Get Individual Files<\/strong>. Here is how it looks now.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" width=\"533\" height=\"534\" class=\"wp-image-10434\" src=\"http:\/\/linuxways.net\/wp-content\/uploads\/2021\/09\/word-image-486.png\" srcset=\"https:\/\/linuxways.net\/wp-content\/uploads\/2021\/09\/word-image-486.png 533w, https:\/\/linuxways.net\/wp-content\/uploads\/2021\/09\/word-image-486-300x300.png 300w, https:\/\/linuxways.net\/wp-content\/uploads\/2021\/09\/word-image-486-150x150.png 150w\" sizes=\"auto, (max-width: 533px) 100vw, 533px\" \/><\/p>\n<p>I will input a URL here which is <a href=\"https:\/\/linuxways.net\/de\/\">https:\/\/linuxways.net<\/a>.<\/p>\n<h3><strong>Step 5. Enter URL<\/strong><\/h3>\n<p>Now I will help you with URL and credentials. Add required details as shown below.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" width=\"607\" height=\"406\" class=\"wp-image-10435\" src=\"http:\/\/linuxways.net\/wp-content\/uploads\/2021\/09\/word-image-487.png\" srcset=\"https:\/\/linuxways.net\/wp-content\/uploads\/2021\/09\/word-image-487.png 607w, https:\/\/linuxways.net\/wp-content\/uploads\/2021\/09\/word-image-487-300x201.png 300w\" sizes=\"auto, (max-width: 607px) 100vw, 607px\" \/><\/p>\n<h3><strong>Step 6. Add Settings<\/strong><\/h3>\n<p>Click <strong>OK<\/strong> to add settings and set any options as required as shown below.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" width=\"533\" height=\"540\" class=\"wp-image-10436\" src=\"http:\/\/linuxways.net\/wp-content\/uploads\/2021\/09\/word-image-488.png\" srcset=\"https:\/\/linuxways.net\/wp-content\/uploads\/2021\/09\/word-image-488.png 533w, https:\/\/linuxways.net\/wp-content\/uploads\/2021\/09\/word-image-488-296x300.png 296w\" sizes=\"auto, (max-width: 533px) 100vw, 533px\" \/><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" width=\"570\" height=\"480\" class=\"wp-image-10437\" src=\"http:\/\/linuxways.net\/wp-content\/uploads\/2021\/09\/word-image-489.png\" srcset=\"https:\/\/linuxways.net\/wp-content\/uploads\/2021\/09\/word-image-489.png 570w, https:\/\/linuxways.net\/wp-content\/uploads\/2021\/09\/word-image-489-300x253.png 300w\" sizes=\"auto, (max-width: 570px) 100vw, 570px\" \/><\/p>\n<h3><strong>Step 7. Last Step \u2013 Get Ready to Mirror<\/strong><\/h3>\n<p>In this step, I am ready to mirror my selected website. However, for the test case, I will save the settings and exit.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" width=\"562\" height=\"573\" class=\"wp-image-10438\" src=\"http:\/\/linuxways.net\/wp-content\/uploads\/2021\/09\/word-image-490.png\" srcset=\"https:\/\/linuxways.net\/wp-content\/uploads\/2021\/09\/word-image-490.png 562w, https:\/\/linuxways.net\/wp-content\/uploads\/2021\/09\/word-image-490-294x300.png 294w\" sizes=\"auto, (max-width: 562px) 100vw, 562px\" \/><\/p>\n<h2>Conclusion<\/h2>\n<p>In this article, I walked you through every aspect of HTTrack settings. Now you are ready to mirror any website using HTTrack on Ubuntu 20.04 Linux distribution. In case of any issue, do not hesitate to reach us.<\/p>","protected":false},"excerpt":{"rendered":"<p>Introduction HTTrack is a unique piece of software to extract static pages from the web. In this guide, I am going to walk you through advanced configurations on&hellip;<\/p>","protected":false},"author":1,"featured_media":10618,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[2],"tags":[543,31],"class_list":["post-10428","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ubuntu","tag-httrack","tag-ubuntu"],"_links":{"self":[{"href":"https:\/\/linuxways.net\/de\/wp-json\/wp\/v2\/posts\/10428","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/linuxways.net\/de\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/linuxways.net\/de\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/linuxways.net\/de\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/linuxways.net\/de\/wp-json\/wp\/v2\/comments?post=10428"}],"version-history":[{"count":0,"href":"https:\/\/linuxways.net\/de\/wp-json\/wp\/v2\/posts\/10428\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/linuxways.net\/de\/wp-json\/wp\/v2\/media\/10618"}],"wp:attachment":[{"href":"https:\/\/linuxways.net\/de\/wp-json\/wp\/v2\/media?parent=10428"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/linuxways.net\/de\/wp-json\/wp\/v2\/categories?post=10428"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/linuxways.net\/de\/wp-json\/wp\/v2\/tags?post=10428"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}