PDA

View Full Version : Downloadable Horticulture database?


mfranklin@msfxdesigns.com
29-06-2007, 11:07 PM
Does anyone know where I can get a csv file of a horticulture database
for a project I'm working on? I'm trying to create an easy-to-use
online database to help me source plants for my design projects. The
project is www.landscapedesignexchange.com.

Any leads on data would be appreciated.

Thanks.

Michael

William Wagner
29-06-2007, 11:24 PM
In article om>,
wrote:

> Does anyone know where I can get a csv file of a horticulture database
> for a project I'm working on? I'm trying to create an easy-to-use
> online database to help me source plants for my design projects. The
> project is www.landscapedesignexchange.com.
>
> Any leads on data would be appreciated.
>
> Thanks.
>
> Michael

It has been awhile. But I believe the Microsoft product Excel can
import CSV.

Bill who loves JMP and wonders if this helps?

--

S Jersey USA Zone 5 Shade
http://www.ocutech.com/ High tech Vison aid
This article is posted under fair use rules in accordance with
Title 17 U.S.C. Section 107, and is strictly for the educational
and informative purposes. This material is distributed without profit.

JoeSpareBedroom
29-06-2007, 11:38 PM
"William Wagner" > wrote in message
...
> In article om>,
> wrote:
>
>> Does anyone know where I can get a csv file of a horticulture database
>> for a project I'm working on? I'm trying to create an easy-to-use
>> online database to help me source plants for my design projects. The
>> project is www.landscapedesignexchange.com.
>>
>> Any leads on data would be appreciated.
>>
>> Thanks.
>>
>> Michael
>
> It has been awhile. But I believe the Microsoft product Excel can
> import CSV.
>
> Bill who loves JMP and wonders if this helps?

He's looking for the data, not the program to import the data.

William Wagner
29-06-2007, 11:50 PM
In article >,
"JoeSpareBedroom" > wrote:

> "William Wagner" > wrote in message
>
> et...
> > In article om>,
> > wrote:
> >
> >> Does anyone know where I can get a csv file of a horticulture database
> >> for a project I'm working on? I'm trying to create an easy-to-use
> >> online database to help me source plants for my design projects. The
> >> project is www.landscapedesignexchange.com.
> >>
> >> Any leads on data would be appreciated.
> >>
> >> Thanks.
> >>
> >> Michael
> >
> > It has been awhile. But I believe the Microsoft product Excel can
> > import CSV.
> >
> > Bill who loves JMP and wonders if this helps?
>
> He's looking for the data, not the program to import the data.

Oh well here is 81400 hits.

http://www.google.com/search?q=csv%20horticulture%20database

Bill

--

S Jersey USA Zone 5 Shade
http://www.ocutech.com/ High tech Vison aid
This article is posted under fair use rules in accordance with
Title 17 U.S.C. Section 107, and is strictly for the educational
and informative purposes. This material is distributed without profit.

raycruzer
30-06-2007, 04:40 PM
On Jun 29, 2:07 pm, wrote:
> Does anyone know where I can get a csv file of a horticulture database
> for a project I'm working on? I'm trying to create an easy-to-use
> online database to help me source plants for my design projects. The
> project iswww.landscapedesignexchange.com.
>
> Any leads on data would be appreciated.
>
> Thanks.
>
> Michael

You can download your USDA Plants database selection in comma-
delimited text file format from here:
http://plants.nrcs.usda.gov/cgi_bin/topics.cgi?earl=advquery/adv_query_down.html
.. You can then convert to MS Excel by opening file with the program.

If other sources exist, you may find them on the EWIRM lists at
www.ergonica.com/Weed_Lists1.htm .

-----
Weeding wisdom: Mulch preventive, Twist defensive - Stow the Hoe
until you Sow!

mfranklin@msfxdesigns.com[_2_]
01-07-2007, 04:46 PM
Thanks for the posts. I have found a few. I had already looked at
the USDA csv, but it's over 200k plants, many of which are weeds or
food crop. It would be rather time consuming to go through and pick
out the plants significant to landscaping.


I'll check out the links you guys gave. Most of the stuff I have
found, people want to sell their database. I'm willing to trade the
information with anyone else who has info, so it's not like I'm only
looking to receive.

Thanks again for all the leads.

MIchael

On Jun 29, 4:07 pm, wrote:
> Does anyone know where I can get a csv file of a horticulturedatabase
> for a project I'm working on? I'm trying to create an easy-to-use
> onlinedatabaseto help me source plants for my design projects. The
> project iswww.landscapedesignexchange.com.
>
> Any leads on data would be appreciated.
>
> Thanks.
>
> Michael

JoeSpareBedroom
01-07-2007, 09:34 PM
Does the USDA file put genus, species and varietal name in 3 different
fields?

> wrote in message
oups.com...
> Thanks for the posts. I have found a few. I had already looked at
> the USDA csv, but it's over 200k plants, many of which are weeds or
> food crop. It would be rather time consuming to go through and pick
> out the plants significant to landscaping.
>
>
> I'll check out the links you guys gave. Most of the stuff I have
> found, people want to sell their database. I'm willing to trade the
> information with anyone else who has info, so it's not like I'm only
> looking to receive.
>
> Thanks again for all the leads.
>
> MIchael
>
> On Jun 29, 4:07 pm, wrote:
>> Does anyone know where I can get a csv file of a horticulturedatabase
>> for a project I'm working on? I'm trying to create an easy-to-use
>> onlinedatabaseto help me source plants for my design projects. The
>> project iswww.landscapedesignexchange.com.
>>
>> Any leads on data would be appreciated.
>>
>> Thanks.
>>
>> Michael
>
>

mfranklin@msfxdesigns.com[_2_]
02-07-2007, 12:34 AM
On Jul 1, 2:34 pm, "JoeSpareBedroom" > wrote:
> Does the USDA file put genus, species and varietal name in 3 different
> fields?
>
> > wrote in message
>
> oups.com...
>
> > Thanks for the posts. I have found a few. I had already looked at
> > the USDA csv, but it's over 200k plants, many of which are weeds or
> > food crop. It would be rather time consuming to go through and pick
> > out the plants significant to landscaping.
>
> > I'll check out the links you guys gave. Most of the stuff I have
> > found, people want to sell their database. I'm willing to trade the
> > information with anyone else who has info, so it's not like I'm only
> > looking to receive.
>
> > Thanks again for all the leads.
>
> > MIchael
>
> > On Jun 29, 4:07 pm, wrote:
> >> Does anyone know where I can get a csv file of a horticulturedatabase
> >> for a project I'm working on? I'm trying to create an easy-to-use
> >> onlinedatabaseto help me source plants for my design projects. The
> >> project iswww.landscapedesignexchange.com.
>
> >> Any leads on data would be appreciated.
>
> >> Thanks.
>
> >> Michael

No. It puts it all in one field. So after having to sort thru 200k
of plants to find out the ones significant to landscaping, I'd have to
transform it. After all that, it is missing all of the relevant
criteria to select a plant for a landscape project. It seems to me
that it is more trouble than it's worth just to get the scientific/
common name and nothing else.

JoeSpareBedroom
02-07-2007, 12:39 AM
> wrote in message
oups.com...
> On Jul 1, 2:34 pm, "JoeSpareBedroom" > wrote:
>> Does the USDA file put genus, species and varietal name in 3 different
>> fields?
>>
>> > wrote in message
>>
>> oups.com...
>>
>> > Thanks for the posts. I have found a few. I had already looked at
>> > the USDA csv, but it's over 200k plants, many of which are weeds or
>> > food crop. It would be rather time consuming to go through and pick
>> > out the plants significant to landscaping.
>>
>> > I'll check out the links you guys gave. Most of the stuff I have
>> > found, people want to sell their database. I'm willing to trade the
>> > information with anyone else who has info, so it's not like I'm only
>> > looking to receive.
>>
>> > Thanks again for all the leads.
>>
>> > MIchael
>>
>> > On Jun 29, 4:07 pm, wrote:
>> >> Does anyone know where I can get a csv file of a horticulturedatabase
>> >> for a project I'm working on? I'm trying to create an easy-to-use
>> >> onlinedatabaseto help me source plants for my design projects. The
>> >> project iswww.landscapedesignexchange.com.
>>
>> >> Any leads on data would be appreciated.
>>
>> >> Thanks.
>>
>> >> Michael
>
> No. It puts it all in one field. So after having to sort thru 200k
> of plants to find out the ones significant to landscaping, I'd have to
> transform it. After all that, it is missing all of the relevant
> criteria to select a plant for a landscape project. It seems to me
> that it is more trouble than it's worth just to get the scientific/
> common name and nothing else.
>

You should be able to parse the words into separate columns easily.
Assuming, for example, that you have things like this in one field:

Pieris japonica

(all words separated by spaces, in other words), do a search-replace,
converting all spaces into some useless character like the tilde ~. Then,
use the text to columns thing with the tilde as the delimiter. Voila.

mfranklin@msfxdesigns.com[_2_]
02-07-2007, 04:43 PM
On Jul 1, 5:39 pm, "JoeSpareBedroom" > wrote:
> > wrote in message
>
> oups.com...
>
>
>
> > On Jul 1, 2:34 pm, "JoeSpareBedroom" > wrote:
> >> Does the USDA file put genus, species and varietal name in 3 different
> >> fields?
>
> >> > wrote in message
>
> oups.com...
>
> >> > Thanks for the posts. I have found a few. I had already looked at
> >> > the USDA csv, but it's over 200k plants, many of which are weeds or
> >> > food crop. It would be rather time consuming to go through and pick
> >> > out the plants significant to landscaping.
>
> >> > I'll check out the links you guys gave. Most of the stuff I have
> >> > found, people want to sell their database. I'm willing to trade the
> >> > information with anyone else who has info, so it's not like I'm only
> >> > looking to receive.
>
> >> > Thanks again for all the leads.
>
> >> > MIchael
>
> >> > On Jun 29, 4:07 pm, wrote:
> >> >> Does anyone know where I can get a csv file of a horticulturedatabase
> >> >> for a project I'm working on? I'm trying to create an easy-to-use
> >> >> onlinedatabaseto help me source plants for my design projects. The
> >> >> project iswww.landscapedesignexchange.com.
>
> >> >> Any leads on data would be appreciated.
>
> >> >> Thanks.
>
> >> >> Michael
>
> > No. It puts it all in one field. So after having to sort thru 200k
> > of plants to find out the ones significant to landscaping, I'd have to
> > transform it. After all that, it is missing all of the relevant
> > criteria to select a plant for a landscape project. It seems to me
> > that it is more trouble than it's worth just to get the scientific/
> > common name and nothing else.
>
> You should be able to parse the words into separate columns easily.
> Assuming, for example, that you have things like this in one field:
>
> Pieris japonica
>
> (all words separated by spaces, in other words), do a search-replace,
> converting all spaces into some useless character like the tilde ~. Then,
> use the text to columns thing with the tilde as the delimiter. Voila.

I understand parsing text files to csv and then flipping it to a
database. I've worked on quite a few databases as my day job is as a
computer geek (I'm only a plant geek by night/weekends). The major
problem with the USDA csv is that it contains a massive amount of
irrelevant data. Beyond that, not all plants fall easily into the
example Genus Species Cultivar/Var. For example, Abelia ' Edward
Goucher' omits species and has 2 words for cultivar. I've been doing
quite a bit of clean-up on the csv files I've been able to get. I'm
just dreading tackling the USDA csv with 200k+ entries then
eliminating tens of thousands of irrelevant entries and then cleaning
up 40-60% that don't easily convert.

Then I end up with tens of thousands of relevant plants with zero
information about them. Over time as I use them in design projects I
can add data, but I'm still forced to go outside my database to pull
relevant information.

I know this is going to be a massive undertaking and an on-going
project. I appreciate all the suggestions and if anyone is interested
in trading content, get with me at http://www.landscapedesignexchange.com.
Thanks again.

JoeSpareBedroom
02-07-2007, 04:46 PM
> wrote in message
ups.com...
> On Jul 1, 5:39 pm, "JoeSpareBedroom" > wrote:
>> > wrote in message
>>
>> oups.com...
>>
>>
>>
>> > On Jul 1, 2:34 pm, "JoeSpareBedroom" > wrote:
>> >> Does the USDA file put genus, species and varietal name in 3 different
>> >> fields?
>>
>> >> > wrote in message
>>
>> oups.com...
>>
>> >> > Thanks for the posts. I have found a few. I had already looked at
>> >> > the USDA csv, but it's over 200k plants, many of which are weeds or
>> >> > food crop. It would be rather time consuming to go through and pick
>> >> > out the plants significant to landscaping.
>>
>> >> > I'll check out the links you guys gave. Most of the stuff I have
>> >> > found, people want to sell their database. I'm willing to trade the
>> >> > information with anyone else who has info, so it's not like I'm only
>> >> > looking to receive.
>>
>> >> > Thanks again for all the leads.
>>
>> >> > MIchael
>>
>> >> > On Jun 29, 4:07 pm, wrote:
>> >> >> Does anyone know where I can get a csv file of a
>> >> >> horticulturedatabase
>> >> >> for a project I'm working on? I'm trying to create an easy-to-use
>> >> >> onlinedatabaseto help me source plants for my design projects. The
>> >> >> project iswww.landscapedesignexchange.com.
>>
>> >> >> Any leads on data would be appreciated.
>>
>> >> >> Thanks.
>>
>> >> >> Michael
>>
>> > No. It puts it all in one field. So after having to sort thru 200k
>> > of plants to find out the ones significant to landscaping, I'd have to
>> > transform it. After all that, it is missing all of the relevant
>> > criteria to select a plant for a landscape project. It seems to me
>> > that it is more trouble than it's worth just to get the scientific/
>> > common name and nothing else.
>>
>> You should be able to parse the words into separate columns easily.
>> Assuming, for example, that you have things like this in one field:
>>
>> Pieris japonica
>>
>> (all words separated by spaces, in other words), do a search-replace,
>> converting all spaces into some useless character like the tilde ~. Then,
>> use the text to columns thing with the tilde as the delimiter. Voila.
>
> I understand parsing text files to csv and then flipping it to a
> database. I've worked on quite a few databases as my day job is as a
> computer geek (I'm only a plant geek by night/weekends). The major
> problem with the USDA csv is that it contains a massive amount of
> irrelevant data. Beyond that, not all plants fall easily into the
> example Genus Species Cultivar/Var. For example, Abelia ' Edward
> Goucher' omits species and has 2 words for cultivar. I've been doing
> quite a bit of clean-up on the csv files I've been able to get. I'm
> just dreading tackling the USDA csv with 200k+ entries then
> eliminating tens of thousands of irrelevant entries and then cleaning
> up 40-60% that don't easily convert.
>
> Then I end up with tens of thousands of relevant plants with zero
> information about them. Over time as I use them in design projects I
> can add data, but I'm still forced to go outside my database to pull
> relevant information.
>
> I know this is going to be a massive undertaking and an on-going
> project. I appreciate all the suggestions and if anyone is interested
> in trading content, get with me at http://www.landscapedesignexchange.com.
> Thanks again.
>
>

I do this all day long with grocery data that was assembled by slobs. If I
could reach through the phone and grab some of these people by the
throat......

Interested in splitting the task? You take half the file and I take the
other? Reassemble it later?

mfranklin@msfxdesigns.com[_2_]
02-07-2007, 04:50 PM
On Jul 2, 9:46 am, "JoeSpareBedroom" > wrote:
> > wrote in message
>
> ups.com...
>
>
>
> > On Jul 1, 5:39 pm, "JoeSpareBedroom" > wrote:
> >> > wrote in message
>
> oups.com...
>
> >> > On Jul 1, 2:34 pm, "JoeSpareBedroom" > wrote:
> >> >> Does the USDA file put genus, species and varietal name in 3 different
> >> >> fields?
>
> >> >> > wrote in message
>
> >> oups.com...
>
> >> >> > Thanks for the posts. I have found a few. I had already looked at
> >> >> > the USDA csv, but it's over 200k plants, many of which are weeds or
> >> >> > food crop. It would be rather time consuming to go through and pick
> >> >> > out the plants significant to landscaping.
>
> >> >> > I'll check out the links you guys gave. Most of the stuff I have
> >> >> > found, people want to sell their database. I'm willing to trade the
> >> >> > information with anyone else who has info, so it's not like I'm only
> >> >> > looking to receive.
>
> >> >> > Thanks again for all the leads.
>
> >> >> > MIchael
>
> >> >> > On Jun 29, 4:07 pm, wrote:
> >> >> >> Does anyone know where I can get a csv file of a
> >> >> >> horticulturedatabase
> >> >> >> for a project I'm working on? I'm trying to create an easy-to-use
> >> >> >> onlinedatabaseto help me source plants for my design projects. The
> >> >> >> project iswww.landscapedesignexchange.com.
>
> >> >> >> Any leads on data would be appreciated.
>
> >> >> >> Thanks.
>
> >> >> >> Michael
>
> >> > No. It puts it all in one field. So after having to sort thru 200k
> >> > of plants to find out the ones significant to landscaping, I'd have to
> >> > transform it. After all that, it is missing all of the relevant
> >> > criteria to select a plant for a landscape project. It seems to me
> >> > that it is more trouble than it's worth just to get the scientific/
> >> > common name and nothing else.
>
> >> You should be able to parse the words into separate columns easily.
> >> Assuming, for example, that you have things like this in one field:
>
> >> Pieris japonica
>
> >> (all words separated by spaces, in other words), do a search-replace,
> >> converting all spaces into some useless character like the tilde ~. Then,
> >> use the text to columns thing with the tilde as the delimiter. Voila.
>
> > I understand parsing text files to csv and then flipping it to a
> > database. I've worked on quite a few databases as my day job is as a
> > computer geek (I'm only a plant geek by night/weekends). The major
> > problem with the USDA csv is that it contains a massive amount of
> > irrelevant data. Beyond that, not all plants fall easily into the
> > example Genus Species Cultivar/Var. For example, Abelia ' Edward
> > Goucher' omits species and has 2 words for cultivar. I've been doing
> > quite a bit of clean-up on the csv files I've been able to get. I'm
> > just dreading tackling the USDA csv with 200k+ entries then
> > eliminating tens of thousands of irrelevant entries and then cleaning
> > up 40-60% that don't easily convert.
>
> > Then I end up with tens of thousands of relevant plants with zero
> > information about them. Over time as I use them in design projects I
> > can add data, but I'm still forced to go outside my database to pull
> > relevant information.
>
> > I know this is going to be a massive undertaking and an on-going
> > project. I appreciate all the suggestions and if anyone is interested
> > in trading content, get with me athttp://www.landscapedesignexchange.com.
> > Thanks again.
>
> I do this all day long with grocery data that was assembled by slobs. If I
> could reach through the phone and grab some of these people by the
> throat......
>
> Interested in splitting the task? You take half the file and I take the
> other? Reassemble it later?

lol . . . that's the way I feel when I'm working on data people send
me at work. If I could just get my hands on them for 5 minutes . . .

I'd be grateful to split the task. I'd also be more than happy to
give you a copy of the entire database at the end if you have any use
for it.

JoeSpareBedroom
02-07-2007, 06:31 PM
> wrote in message
oups.com...
> On Jul 2, 9:46 am, "JoeSpareBedroom" > wrote:
>> > wrote in message
>>
>> ups.com...
>>
>>
>>
>> > On Jul 1, 5:39 pm, "JoeSpareBedroom" > wrote:
>> >> > wrote in message
>>
>> oups.com...
>>
>> >> > On Jul 1, 2:34 pm, "JoeSpareBedroom" > wrote:
>> >> >> Does the USDA file put genus, species and varietal name in 3
>> >> >> different
>> >> >> fields?
>>
>> >> >> > wrote in message
>>
>> >> oups.com...
>>
>> >> >> > Thanks for the posts. I have found a few. I had already looked
>> >> >> > at
>> >> >> > the USDA csv, but it's over 200k plants, many of which are weeds
>> >> >> > or
>> >> >> > food crop. It would be rather time consuming to go through and
>> >> >> > pick
>> >> >> > out the plants significant to landscaping.
>>
>> >> >> > I'll check out the links you guys gave. Most of the stuff I have
>> >> >> > found, people want to sell their database. I'm willing to trade
>> >> >> > the
>> >> >> > information with anyone else who has info, so it's not like I'm
>> >> >> > only
>> >> >> > looking to receive.
>>
>> >> >> > Thanks again for all the leads.
>>
>> >> >> > MIchael
>>
>> >> >> > On Jun 29, 4:07 pm, wrote:
>> >> >> >> Does anyone know where I can get a csv file of a
>> >> >> >> horticulturedatabase
>> >> >> >> for a project I'm working on? I'm trying to create an
>> >> >> >> easy-to-use
>> >> >> >> onlinedatabaseto help me source plants for my design projects.
>> >> >> >> The
>> >> >> >> project iswww.landscapedesignexchange.com.
>>
>> >> >> >> Any leads on data would be appreciated.
>>
>> >> >> >> Thanks.
>>
>> >> >> >> Michael
>>
>> >> > No. It puts it all in one field. So after having to sort thru 200k
>> >> > of plants to find out the ones significant to landscaping, I'd have
>> >> > to
>> >> > transform it. After all that, it is missing all of the relevant
>> >> > criteria to select a plant for a landscape project. It seems to me
>> >> > that it is more trouble than it's worth just to get the scientific/
>> >> > common name and nothing else.
>>
>> >> You should be able to parse the words into separate columns easily.
>> >> Assuming, for example, that you have things like this in one field:
>>
>> >> Pieris japonica
>>
>> >> (all words separated by spaces, in other words), do a search-replace,
>> >> converting all spaces into some useless character like the tilde ~.
>> >> Then,
>> >> use the text to columns thing with the tilde as the delimiter. Voila.
>>
>> > I understand parsing text files to csv and then flipping it to a
>> > database. I've worked on quite a few databases as my day job is as a
>> > computer geek (I'm only a plant geek by night/weekends). The major
>> > problem with the USDA csv is that it contains a massive amount of
>> > irrelevant data. Beyond that, not all plants fall easily into the
>> > example Genus Species Cultivar/Var. For example, Abelia ' Edward
>> > Goucher' omits species and has 2 words for cultivar. I've been doing
>> > quite a bit of clean-up on the csv files I've been able to get. I'm
>> > just dreading tackling the USDA csv with 200k+ entries then
>> > eliminating tens of thousands of irrelevant entries and then cleaning
>> > up 40-60% that don't easily convert.
>>
>> > Then I end up with tens of thousands of relevant plants with zero
>> > information about them. Over time as I use them in design projects I
>> > can add data, but I'm still forced to go outside my database to pull
>> > relevant information.
>>
>> > I know this is going to be a massive undertaking and an on-going
>> > project. I appreciate all the suggestions and if anyone is interested
>> > in trading content, get with me
>> > athttp://www.landscapedesignexchange.com.
>> > Thanks again.
>>
>> I do this all day long with grocery data that was assembled by slobs. If
>> I
>> could reach through the phone and grab some of these people by the
>> throat......
>>
>> Interested in splitting the task? You take half the file and I take the
>> other? Reassemble it later?
>
> lol . . . that's the way I feel when I'm working on data people send
> me at work. If I could just get my hands on them for 5 minutes . . .
>
> I'd be grateful to split the task. I'd also be more than happy to
> give you a copy of the entire database at the end if you have any use
> for it.
>

I see the link to the query, but just so we're both working on the exact
same file, do you want to compress it and email it to me, and let me know
what record numbers you're going to work on so I can pick up at a later
point? The "public" email address you see here is a working one. Just let me
know when you send the file. I don't check this email often because it's
mostly there to catch garbage generated by newsgroup exposure.

JoeSpareBedroom
02-07-2007, 07:37 PM
> wrote in message
oups.com...

>> Interested in splitting the task? You take half the file and I take the
>> other? Reassemble it later?
>
> lol . . . that's the way I feel when I'm working on data people send
> me at work. If I could just get my hands on them for 5 minutes . . .
>
> I'd be grateful to split the task. I'd also be more than happy to
> give you a copy of the entire database at the end if you have any use
> for it.
>

Hang on a sec. If I leave the query as general as possible, I come up with
39,000 to 47,000 plants. How did you end up with 200K?

mfranklin@msfxdesigns.com[_2_]
02-07-2007, 07:46 PM
On Jul 2, 12:37 pm, "JoeSpareBedroom" > wrote:
> > wrote in message
>
> oups.com...
>
> >> Interested in splitting the task? You take half the file and I take the
> >> other? Reassemble it later?
>
> > lol . . . that's the way I feel when I'm working on data people send
> > me at work. If I could just get my hands on them for 5 minutes . . .
>
> > I'd be grateful to split the task. I'd also be more than happy to
> > give you a copy of the entire database at the end if you have any use
> > for it.
>
> Hang on a sec. If I leave the query as general as possible, I come up with
> 39,000 to 47,000 plants. How did you end up with 200K?

I downloaded the csv from the USDA. Importing it into Excel, I had to
break it in half. first half around 120k lines. I emailed it to
you. take a look and make sure I counted right. it may be slightly
less than 200k.

JoeSpareBedroom
02-07-2007, 07:52 PM
> wrote in message
ups.com...
> On Jul 2, 12:37 pm, "JoeSpareBedroom" > wrote:
>> > wrote in message
>>
>> oups.com...
>>
>> >> Interested in splitting the task? You take half the file and I take
>> >> the
>> >> other? Reassemble it later?
>>
>> > lol . . . that's the way I feel when I'm working on data people send
>> > me at work. If I could just get my hands on them for 5 minutes . . .
>>
>> > I'd be grateful to split the task. I'd also be more than happy to
>> > give you a copy of the entire database at the end if you have any use
>> > for it.
>>
>> Hang on a sec. If I leave the query as general as possible, I come up
>> with
>> 39,000 to 47,000 plants. How did you end up with 200K?
>
> I downloaded the csv from the USDA. Importing it into Excel, I had to
> break it in half. first half around 120k lines. I emailed it to
> you. take a look and make sure I counted right. it may be slightly
> less than 200k.
>

Just got it. I think I misunderstood what your version of "clean up the
data" meant. I was referring to cleaning up the raw data, which I see
necessary for many of the "exceptional" records. But, you're talking about
editing out categories of plants. I honestly don't have that much time to
work on it that way.

mfranklin@msfxdesigns.com[_2_]
02-07-2007, 09:04 PM
On Jul 2, 12:52 pm, "JoeSpareBedroom" > wrote:
> > wrote in message
>
> ups.com...
>
>
>
> > On Jul 2, 12:37 pm, "JoeSpareBedroom" > wrote:
> >> > wrote in message
>
> oups.com...
>
> >> >> Interested in splitting the task? You take half the file and I take
> >> >> the
> >> >> other? Reassemble it later?
>
> >> > lol . . . that's the way I feel when I'm working on data people send
> >> > me at work. If I could just get my hands on them for 5 minutes . . .
>
> >> > I'd be grateful to split the task. I'd also be more than happy to
> >> > give you a copy of the entire database at the end if you have any use
> >> > for it.
>
> >> Hang on a sec. If I leave the query as general as possible, I come up
> >> with
> >> 39,000 to 47,000 plants. How did you end up with 200K?
>
> > I downloaded the csv from the USDA. Importing it into Excel, I had to
> > break it in half. first half around 120k lines. I emailed it to
> > you. take a look and make sure I counted right. it may be slightly
> > less than 200k.
>
> Just got it. I think I misunderstood what your version of "clean up the
> data" meant. I was referring to cleaning up the raw data, which I see
> necessary for many of the "exceptional" records. But, you're talking about
> editing out categories of plants. I honestly don't have that much time to
> work on it that way.

no worries. i completely understand. that's my problem as well.
that's why i'm trying to find other sources. thanks tho.

JoeSpareBedroom
02-07-2007, 09:14 PM
> wrote in message
ps.com...
> On Jul 2, 12:52 pm, "JoeSpareBedroom" > wrote:
>> > wrote in message
>>
>> ups.com...
>>
>>
>>
>> > On Jul 2, 12:37 pm, "JoeSpareBedroom" > wrote:
>> >> > wrote in message
>>
>> oups.com...
>>
>> >> >> Interested in splitting the task? You take half the file and I take
>> >> >> the
>> >> >> other? Reassemble it later?
>>
>> >> > lol . . . that's the way I feel when I'm working on data people send
>> >> > me at work. If I could just get my hands on them for 5 minutes . .
>> >> > .
>>
>> >> > I'd be grateful to split the task. I'd also be more than happy to
>> >> > give you a copy of the entire database at the end if you have any
>> >> > use
>> >> > for it.
>>
>> >> Hang on a sec. If I leave the query as general as possible, I come up
>> >> with
>> >> 39,000 to 47,000 plants. How did you end up with 200K?
>>
>> > I downloaded the csv from the USDA. Importing it into Excel, I had to
>> > break it in half. first half around 120k lines. I emailed it to
>> > you. take a look and make sure I counted right. it may be slightly
>> > less than 200k.
>>
>> Just got it. I think I misunderstood what your version of "clean up the
>> data" meant. I was referring to cleaning up the raw data, which I see
>> necessary for many of the "exceptional" records. But, you're talking
>> about
>> editing out categories of plants. I honestly don't have that much time to
>> work on it that way.
>
> no worries. i completely understand. that's my problem as well.
> that's why i'm trying to find other sources. thanks tho.
>

What about using the query interface to do some initial filtering?

mfranklin@msfxdesigns.com[_2_]
02-07-2007, 09:17 PM
On Jul 2, 2:14 pm, "JoeSpareBedroom" > wrote:
> > wrote in message
>
> ps.com...
>
>
>
> > On Jul 2, 12:52 pm, "JoeSpareBedroom" > wrote:
> >> > wrote in message
>
> ups.com...
>
> >> > On Jul 2, 12:37 pm, "JoeSpareBedroom" > wrote:
> >> >> > wrote in message
>
> >> oups.com...
>
> >> >> >> Interested in splitting the task? You take half the file and I take
> >> >> >> the
> >> >> >> other? Reassemble it later?
>
> >> >> > lol . . . that's the way I feel when I'm working on data people send
> >> >> > me at work. If I could just get my hands on them for 5 minutes . .
> >> >> > .
>
> >> >> > I'd be grateful to split the task. I'd also be more than happy to
> >> >> > give you a copy of the entire database at the end if you have any
> >> >> > use
> >> >> > for it.
>
> >> >> Hang on a sec. If I leave the query as general as possible, I come up
> >> >> with
> >> >> 39,000 to 47,000 plants. How did you end up with 200K?
>
> >> > I downloaded the csv from the USDA. Importing it into Excel, I had to
> >> > break it in half. first half around 120k lines. I emailed it to
> >> > you. take a look and make sure I counted right. it may be slightly
> >> > less than 200k.
>
> >> Just got it. I think I misunderstood what your version of "clean up the
> >> data" meant. I was referring to cleaning up the raw data, which I see
> >> necessary for many of the "exceptional" records. But, you're talking
> >> about
> >> editing out categories of plants. I honestly don't have that much time to
> >> work on it that way.
>
> > no worries. i completely understand. that's my problem as well.
> > that's why i'm trying to find other sources. thanks tho.
>
> What about using the query interface to do some initial filtering?

Actually, some additional poking around the USDA site gave me this
link: http://plants.nrcs.usda.gov/cgi_bin/topics.cgi?earl=advquery/adv_query_down.html

I think if this is what you are talking about as far as filtering. I
went straight for the download, but I think this will filter out a lot
of what I don't want. This gave me 39k entries, plus it's broken out
so I have far less database cleanup to do. Now I just have to
reorganize it and filter out the weed and food items. Thanks for
putting me on the right track. This is much more doable.

Google