Pages in topic:   < [1 2]
Translating a website: Tool for downloading hundreds of files and counting words
Thread poster: Rajan Chopra
Sebastian Witte
Sebastian Witte  Identity Verified
Germany
Local time: 06:08
Member (2004)
English to German
+ ...
A really good piece of freeware Dec 5, 2010

http://www.softpedia.com/get/Internet/Offline-Browsers/HTTrack-Website-Copier.shtml

I have successfully used it to download entire sites and would say it is not that difficult to handle.


 
Pablo Bouvier
Pablo Bouvier  Identity Verified
Local time: 06:08
German to Spanish
+ ...
Translating a website: Tool for downloading hundreds of files and counting words Dec 6, 2010

langclinic wrote:

Hi friends,

A translation agency wants me to tranlate a website that contains dozens of links and in every link there are many links and sub-links. If I download the files one by one, it will take a great deal of time. Is there any method to download all files available on a website (Word files, pdf files, html pages and scanned pages etc.) in a quick and convenient manner?

Secondly, there is no problem in counting the words in MS Word files as I can go to Tools and ascertain the word count but is there any tool to count words in pdf files, html pages etc as counting them manually will kill a lot of time.

Thanks in advance for your precious help.

Regards,

Chopra


Although I agree with what colleagues have already said, perhaps you should give a try to Aquino WebBudget to verify if it can be among your options...




[Edited at 2010-12-06 00:35 GMT]


 
Heiko Schmieder (X)
Heiko Schmieder (X)  Identity Verified
Germany
Local time: 06:08
English to German
+ ...
HTML, CMS > what does the client need? Dec 6, 2010

Hello,

I just want to add some thoughts to the topic website translation. It does not always make sense to 'download' the whole site as you don't know what system is behind. If someone uses a CMS the translated html, php whatever files might not be useful as he or she has to transfer the content anyway manually. So my first question would be - do you work with CMS? Next you might ask if they use CMS if there is the possibility to export content - e.g. via XML or any other format.... See more
Hello,

I just want to add some thoughts to the topic website translation. It does not always make sense to 'download' the whole site as you don't know what system is behind. If someone uses a CMS the translated html, php whatever files might not be useful as he or she has to transfer the content anyway manually. So my first question would be - do you work with CMS? Next you might ask if they use CMS if there is the possibility to export content - e.g. via XML or any other format.

Website translations are not quite trivial in terms of technology - so before downloading the whole site I would clarify this matter. So in case of CMS there won't be any Word files or kind of draft because its all written directly into the CMS...

Beside the technology there can be also the issue of translating php with Trados TagEditor which also can be a problem because TagEditor in standard config will add/delete some coding that makes php useless (at least in my case I experienced that)



Good luck and best,


Heiko
Collapse


 
Ramon Somoza
Ramon Somoza  Identity Verified
Spain
Local time: 06:08
Dutch to Spanish
+ ...
A word of warning Dec 6, 2010

The agency does not seem very competent, just telling you to "translate the website".

You should keep in mind that the website could be updated by the owner while you're translating it, and apart from the fact that you would be trying to translate a moving target you could also have a complaint that you've not translated everything.

In addition, if the website is for example dynamic and you download the pages, the translation will be useless to the end customer if you t
... See more
The agency does not seem very competent, just telling you to "translate the website".

You should keep in mind that the website could be updated by the owner while you're translating it, and apart from the fact that you would be trying to translate a moving target you could also have a complaint that you've not translated everything.

In addition, if the website is for example dynamic and you download the pages, the translation will be useless to the end customer if you translate the individual pages, as such pages do not exist on the customer's site and they will not be able to easily match the text to the pages. Same if it is a CMS.

Now, how to proceed:

- If the website is static (plain html, pdfs, etc.), request that they send you the full package. ALL the site is hosted on a server, so it's not a problem to get the whole set, the webmaster should be able to do it easily. If the resulting set of files is too big for e-mail, let them zip or rar it, and use a service such as yousendit to forward you the full set. You can return it in the same way.

- If the site is dynamic (say, with a database), let them send you the pages with static text AND an export of the database (say, In Excel). Return exactly what they have sent you, so that they can import it again.

- If the site is based on a CMS (Content Management System), ask for either an export of access to the CMS, if it allows side-by-side languages.

I advise however very strongly against downloading the site, unless specifically instructed to do so.
Collapse


 
Ramon Somoza
Ramon Somoza  Identity Verified
Spain
Local time: 06:08
Dutch to Spanish
+ ...
TagEditor and PHP Dec 6, 2010

Heiko Schmieder wrote:
Beside the technology there can be also the issue of translating php with Trados TagEditor which also can be a problem because TagEditor in standard config will add/delete some coding that makes php useless (at least in my case I experienced that)


Totally correct. And even if the coding is not deleted, Trados TagEditor takes PHP code as a tag, and prevents its edition. No problem if it is pure code, but if it has embedded text then you're in trouble.

Some time ago I wrote a post about whether TagEditor was adequate for website translation (See http://www.seo-translator.com/is-trados-tageditor-adequate-for-website-translation/ ), and the response was that it was "somewhat" adequate.


 
Laurent KRAULAND (X)
Laurent KRAULAND (X)  Identity Verified
France
Local time: 06:08
French to German
+ ...
"Somewhat adequate" Dec 6, 2010

Ramon Somoza wrote:

Some time ago I wrote a post about whether TagEditor was adequate for website translation (See http://www.seo-translator.com/is-trados-tageditor-adequate-for-website-translation/ ), and the response was that it was "somewhat" adequate.


Indeed... I can think of at least one CAT which handles PHP (somewhat) better.


 
Rajan Chopra
Rajan Chopra
India
Local time: 09:38
Member (2008)
English to Hindi
+ ...
TOPIC STARTER
Thanks to all of you for your valuable advice Dec 7, 2010

I am thankful to all of you for suggesting the links of many sites which help in downloading and counting the words.

Heiko Schmieder wrote:

So my first question would be - do you work with CMS? Next you might ask if they use CMS if there is the possibility to export content - e.g. via XML or any other format.

Heiko


I am required to provide them the Word files and they will be processing it further to upload to the website.

Regards,

Chopra


 
Ramon Somoza
Ramon Somoza  Identity Verified
Spain
Local time: 06:08
Dutch to Spanish
+ ...
Oh, no.... Dec 7, 2010

langclinic wrote:
I am required to provide them the Word files and they will be processing it further to upload to the website.


Urgggh... another website owner & language agency that don't know shit about website translation or multilingual SEO. Word, for heaven's sake!


 
Laurent KRAULAND (X)
Laurent KRAULAND (X)  Identity Verified
France
Local time: 06:08
French to German
+ ...
Final responsibility Dec 7, 2010

Ramon Somoza wrote:

langclinic wrote:
I am required to provide them the Word files and they will be processing it further to upload to the website.


Urgggh... another website owner & language agency that don't know shit about website translation or multilingual SEO. Word, for heaven's sake!


The final responsibility is with the client, so what?

I certainly hate this low-key approach as much as you do, Ramon, but sometimes we only have to shrug and look the other way. Like in "You can lead a horse to the river..."

[Modifié le 2010-12-07 20:20 GMT]


 
Pages in topic:   < [1 2]


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

Translating a website: Tool for downloading hundreds of files and counting words







Anycount & Translation Office 3000
Translation Office 3000

Translation Office 3000 is an advanced accounting tool for freelance translators and small agencies. TO3000 easily and seamlessly integrates with the business life of professional freelance translators.

More info »
Trados Studio 2022 Freelance
The leading translation software used by over 270,000 translators.

Designed with your feedback in mind, Trados Studio 2022 delivers an unrivalled, powerful desktop and cloud solution, empowering you to work in the most efficient and cost-effective way.

More info »