Hey there RudiC!
Sorry for not answering earlier and, as you'll see, I deleted all the of the "https" from my reply as the forum doesn't let me post URL until I have at least 5 posts.
You're right, I don't expect people to crawl though the site. I'm sure to understand what you mean though. :(
To get the data, you need to generate a listing through this link:
$ ://www.dotmed.com/equipment/2/92/1209/all/offset/0/all?key=&limit=20&price_sort=descending&cond=all&continent_filter=0&zip=&distance=5&att_1=0&att_row_num=1&additionalkeywords=&country=ES
The URL is pretty easy to adapt and I think that I adapted it to my current needs. This will get a Densitometer equipment listing and afterwards I could easily adapt the URL myself to get to the other equipments (as the structure is the same across all equipments).
A few comments on the link itself though:
&limit=20
Obviously limits the output to 20 equipments. I am using 20 right now so that the requests are fast and easy but I change it to 200 to get much more informations and listings afterwards
&price_sort=descending
I'm mostly interest in listing where the price is mentioned, so I decided to sort by descending prices so that I get the listings with prices first (more relevant to me).
&country=ES
I chose Spain as a filter, but it's not much of a relevance. I'd rather have EU listing first which is why I chose Spain.
Now back to the command:
With "curl" I'm getting the listing (I could import that listing locally into an HTML file but since that's not the objective, I get right away with the grep command).
The grep then lists the links available for the listing I specified and that's it for now.
The expected part:
The remaining of the info I mentionned earlier is now located in each URL.
What I need to do now is, based on the previous "grep":
0 <a href="/listing/bone-densitometer/ge/lunar-dpx/2299124"> view more </a>
<a href="/listing/bone-densitometer/osteosys/dexxum-t/2299556"> view more </a>
<a href="/listing/bone-densitometer/hologic/discovery-c/1184884"> view more </a>
<a href="/listing/bone-densitometer/ge/prodigy/1184904"> view more </a>
<a href="/listing/bone-densitometer/ge/lunar-idxa/2246457"> view more </a>
<a href="/listing/bone-densitometer/ge/lunar-prodigy/1668884"> view more </a>
<a href="/listing/bone-densitometer/hologic/qdr-4500-elite/1738541"> view more </a>
<a href="/listing/bone-densitometer/hologic/discovery-c/1405820"> view more </a>
<a href="/listing/bone-densitometer/alara/metriscan/653936"> view more </a>
<a href="/listing/bone-densitometer/sunlight/omnisense-7000s/470081"> view more </a>
<a href="/listing/bone-densitometer/hologic/delphi-c/99115"> view more </a>
<a href="/listing/bone-densitometer/lunar/dpx-nt/2310470"> view more </a>
<a href="/listing/bone-densitometer/hologic/qdr-4500/2219929"> view more </a>
<a href="/listing/bone-densitometer/norland/excell/1184892"> view more </a>
<a href="/listing/bone-densitometer/ge/lunar-dpx-duo/875678"> view more </a>
<a href="/listing/bone-densitometer/ge/lunar-dpx-nt/2284643"> view more </a>
<a href="/listing/bone-densitometer/hologic/discovery-qdr-10041/2257994"> view more </a>
<a href="/listing/bone-densitometer/sunlight/mini-omni-por/2183339"> view more </a>
<a href="/listing/bone-densitometer/ge/lunar-dpx-bravo/2225055"> view more </a>
for each link (for instance, starting with the first on in my list:
<a href="/listing/bone-densitometer/ge/lunar-dpx/2299124"> view more </a>
, go and get:
The price:
$ curl ://www.dotmed.com//listing/bone-densitometer/osteosys/dexxum-t/2299556 | fgrep -e "id=\"price"
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
43 38179 43 16384 0 0 16698 0 0:00:02 --:--:-- 0:00:02 16684"<ul><li class="left">Price:</li><li class="right" id="price"><span itemprop='price' content='19990.00'>$19,990.00 <span itemprop='currency'>USD</span> <a style='font-size: 5pt' href='#' title='Convert the Currency' onClick='javascript:window.open("/listings/currency.html?amount=19990.00¤cy_from=USD", "listing", config="height=200,width=500,toolbar=no,menubar=no,scrollbars=yes,resizable=no,location=no,directories=no,status=yes"); return false;'>[convert]</a></span></li></ul>
The condition:
$ curl ://www.dotmed.com//listing/bone-densitometer/osteosys/dexxum-t/2299556 | fgrep -e "id=\"condition"
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
43 38179 43 16384 0 0 9241 0 0:00:04 0:00:01 0:00:03 9240 <ul><li class="left">Condition:</li><li class="right" id="condition"><span itemprop='condition' content='new'>New</span></li></ul>
The date_updaed:
$ curl ://www.dotmed.com//listing/bone-densitometer/osteosys/dexxum-t/2299556 | fgrep -e "id=\"date_updated"
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
0 38179 0 0 0 0 0 0 --:--:-- --:--:-- --:--:-- 0 <ul><li class="left">Date updated:</li><li class="right" id="date_updated">December 09, 2016</li></ul>
Obviously, my objective is to try to generate a loop that will get me this info for each link in the listing, see how I could clean up the info and send it to CSV or any other similar file to stock the information.o
I hope that some of this long post contains the info you were looking for? If not I apologize and please, if you could detail a little more, that'd be great!
Thanks again and as usual!
Ardzii