Printer Friendly Version
Email this thread to a friend
|
Featured Web Site Template |
|
There are 0 guests and 1 members in the forums right now.
Reflects user activity within the last 5 minutes
|
|
| Member |
Message |
zawam
Joined: Oct 13, 2005
# Posts: 90
|
Posted: 2007-May-07 21:37
Hey, this is a call for some help...
What i want to do is take 15000 of my static webpages i have built up over the years and make all the content dynamic by adding it to a database. What i have is 15000 pages that are identical in layout, but each page has unique content that relates to the product.
I was thinking to save time it would be possible to create a script, such as php and it read in the html file and output a csv type file i could add to mysql, or other things such as read in all the files and output the text, or similar.
This is similar to my table
---------------------------
ID NAME TYPE YEAR
1 DEMO CAR 1992
---------------------------
HIGH PRICE $12.12
LOW PRICE $12.12
AVERAGE PRIACE $12.12
---------------------------
I am looking for a way to do this, so any ideas are appreciated loads. And thanks for any help.
PS - I have not been lazy and looked into this a little, i have tried using php to read in the file and output it showing all the html code, and then just select the lines i want, my troubles are i am not to sure how to srtip out the tags and it doesnt really work at all, i am pretty much a novice.
Thanks once again
|
 |
Dinkar
Staff
Joined: Aug 12, 2001
# Posts: 4391
|
Posted: 2007-May-07 22:35
Open those html files in MS-excel and try saving it to .cvs format.
|
 |
zawam
Joined: Oct 13, 2005
# Posts: 90
|
Posted: 2007-May-07 22:43
How would you open 15000 files in excell or would you have to do them one by one or create a MACRO?
Thanks for the advice
|
 |
Dinkar
Staff
Joined: Aug 12, 2001
# Posts: 4391
|
Posted: 2007-May-07 23:12
LOL, I forgot the number Those are too many files. I don't know macro would be helpful or not. But you can create some php/asp script. Search net for script help.
|
 |
Dinkar
Staff
Joined: Aug 12, 2001
# Posts: 4391
|
Posted: 2007-May-07 23:17
Or hire someone to do it manually. Try elance.
|
 |
Prowler
Staff
Joined: Aug 14, 2000
# Posts: 1794
|
Posted: 2007-May-08 12:39
There are many ways to do this. If you just want to strip out the HTML tags, open the files in Editplus in batches and use Edit - > HTML -> Strip html tags.
The easiest and the ideal way is to use Perl. Perl is ideally suited to extract/manipulate text information. Look for ready made scripts to extract information based on some criteria in script repositories.
|
 |
zawam
Joined: Oct 13, 2005
# Posts: 90
|
Posted: 2007-May-09 14:18
Thanks for all the replies, i am still trying to do it myself with PHP...
But i am looking into the suggestions, and that text Editor. I have a quote for a data entry person, they are telling me costs of around $5000usd, thats a little much, i was thinking of taking a zero off and maybe making the 5 a 2 =) but i will see if some one can make such a script for my budget.
However, i would rather put the cash into other things, seo and marketing for one, so what would such a script be called?
The perl option has came up, i thought of that too, but have no experience with it, so what names or what are the scripts called?
I was thinking data mining, but thats not really what it is as its not in a databse already?
Thanks again for all the help, best place to come to as always for help
|
 |
david68
Joined: May 16, 2005
# Posts: 144
|
Posted: 2007-May-10 15:09
If you can find someone to make a script to do something only you need and only charge $200 I'd be shocked
Out of curiousity, what is the website? I'd like to see the format. PM me with it unless the mods would allow it posted, which normally they don't. I know enough PHP to possibly make that script, for less than $5k.
Also, to strip the HTML out it's easy. Go to php.net/manual/en/function.strip-tags.php
|
 |
Curt
Joined: Eons Ago
# Posts: 3736
|
Posted: 2007-May-10 19:09
$5,000 is a bit high, but $200 is way too cheap. It also depends on what kind of format and information sorting you want the script to do. Sometimes custom scripts can take quite a while to create and test depending on complexity.
|
 |
zawam
Joined: Oct 13, 2005
# Posts: 90
|
Posted: 2007-May-10 22:34
Well, its a static product database, i will post a page if that OK...
I was looking at a freelance website, a fairly large one, there seems other fairly large projects up there for around $200, so i am confident that it would be between 200 and 500 usd...
There are just "11" fields that need copying from 15000 static product pages that are of the same layout etc...
I am going to post my project as $100 usd and above with the freelancers and see what comes back in terms of bids...
I am looking at doing this myself also, i am looking at the reg expressions i would need to create, i dont think its going to be as much "hard" work as it will be long and tedious work...
The products are Aviation related parts... Will send a PM out if you want a looks...
|
 |
You are not permitted to post messages in this forum or topic, because of one or more of the following reasons:
- You have not yet logged in, or registered properly as a member
- You are a member, but no longer have posting rights.
- This is a private forum, for which you do not have permissions.
If you are a recent member, it's possible that you simply have not yet confirmed your account. Please
check your email for a message entitled 'JimWorld Forums: Confirm Your Account' and follow the instructions
contained within.
If you cannot find this message, click here to Re-Send it.
|
If you are still experiencing problem, please read the
Login Assistance
Article for some advice on what may be causing your login not to work properly.
|
Switch to Advanced Editor and ...
Create a New Topic
or Reply to this Thread
|
|