Archive for the ‘Search tools’ Category

WorldVitalRecords.com and free databases vs. Ancestry.com

Thursday, September 13th, 2007

Look out Ancestry! The competition is gaining on you fast!

There’s a new genealogy site that I’m quite impressed with that has quickly grown to the point where it’s the third most visited genealogy site online per Alexa.
Right now they’re offering a sale  until Sept 17 with 2 years of access for $49. Even at the normal $49/year its a great deal, and it’s not cheap because it’s lacking in content. They’re adding dozens of  databases weekly, and they even offer full fee access to any new databases for 10 days after they are added - making it well worth checking out  even if you aren’t ready to subscribe. The free content already sold me on a membership.

Browse recently added databases to see the new content and which ones are free. Or go ahead and sign up - they’ve made enough deals to ensure they’ll keep you busy with new data for a lomg time.

While there are  a lot of subscription sites out there this one seems like it has a chance to displace Ancestry.com  from the top site slot, and could force Ancestry to stop charging $299 for their full world collection.

World Vital Records was founded by former Ancestrty founder Paul Allen, who seems openly critical of Ancestry as having priced itself too high since he sold off his interest in the company.

Full WVR access for $49 is bargain compared to $299 for full access at Ancestry.com, and  the recent  moves by TGN/Ancestry with regard to the mess called FTM 2008 and the Internet database collection that alienated many online genealogists suggests Ancestry is arrogantly out of touch with the genealogy community.

The WorldVitalRecords.com two year plan also includes some free software, RootsMagic genealogy software. 

I’m looking for akterntives since family Tree Maker 2008 won’t even import my old FTM file.

Ancestry already convinced me not to renew via their recent actions, so now WVR shines with a ray of hope that says inexpensive alternatives can offer great promise. I’m throwing my support to WVR as a vote of confidence and in the hopes that  it forces a wake up call at TGN/Ancestry and because I’m sure I’ll find a lot by looking in fresh new sources..

WVRis also supposed to have a heavy emphasis on social networking (aka Web 2.0) by helping connect regional researchers to share data.

Geographical context is a great stepping stone to expanding on what you already know and for finding tangential info on your ancestors and how they lived.

Recent deals with Accessible Archives and NewspaperArchive.com have added hundreds of small town newspapers. One with Everton brings valuable books and info with it. I don’t think you’ll find any other genealogy site bringing such a broad range of materials together in one place as quickly as WVR has been growing.

Microsoft Live Books Search beta?

Saturday, September 8th, 2007

Much like Google Books Search, the Microsoft’s Live Books Search also offers a much overlooked resource for free genealogy searches and free downloadable books in the public domain.

There are a few things Microsoft does better than Google, much as I hate to admit it. They have more accurate OCR conversions that Google Books, from what I’ve seen. And that offers easier cut and paste usage of text from the books, which is a real time saver when using info from these books.

Microsoft offers downloadable PDF files of public domain books that include the text layer so that you can keyword search the files offline and even save the books as text, or just paste parts you want into your databases and web pages.

But Microsoft’s historical offerings are a bit slimmer than Google and the search/web interface is much harder to use to read books online.

And it’s weird, when is the last time you heard anything about Microsoft books? They seemed to get into it as a defensive reaction to Google Books and they’re apparently still working as part of a consortium and are donating public domain books to the Internet Archive project as far as I could tell. I can’t tell if they’re still scanning books.

Try to find Live’s Book Search from the Microsoft Live site. Hello? It’s not even off the More menu like Google hides their books search. It’s not even on the Betas page linked from the More menu. It’s a bit hard to tell if they’re even still actively scanning books, but it’s there, if I search Google for it. Go figure.

It looks like it launched in December 2006 and has had no press since then. And, wow, http://books.live.com gets me there but redirects to a different URL.

But there is content there not found on Google Books and vice versa.

Microsoft’s attention to detail shows. But the interface sucks and it looks like they are hiding it and hoping we’ll forget it’s there less than a year after the beta launched.

At this point it’s still a useful tool and worth doing some genealogy searches on to see what you may find. If you find a useful book download a copy, and get Google Desktop to index it for you so you can search your Microsoft Books locally.

More on Google Books

Saturday, September 8th, 2007

I started to tack on some other thoughts and info on Google and Microsoft’s book search tools on my last post but thought it would be better to break out this part from the news about the new “My Library” tools that Google Books just launched.

If you hadn’t already noticed, Google Books Search has become an incredibly valuable research tool, and one that’s very useful for genealogists. They’re giving away access to the same public domain data other companies are trying to sell us.

They’re constantly adding new books as they continue scanning libraries around the country and many of these are in the public domain and are fully readable and downloadable for free.

Google Books offers a huge library of history and genealogy data that can be keyword searched for free. It also means that books that previously could only be seen by driving to distant libraries or by paying reprint or CD-ROM companies for copies of long out of print books can now be searched from home. Many of these books are not well indexed in printed form but now you can find any name in the book, in theory.
A few weeks ago Google Books added an “Accessibility” feature allowing you to see (and copy) text versions of the book images and this is wonderful.

In practice Google seems a little sloppy in the OCR conversions. How could a title like
PRUNSYLNANIA ARRHINES by MATTHEW S. QUAY - 1876 slip by spell checks? I’ve also found cases where a keyword search found once instance of “Graham” in a book and my reading the book found other instances, or missing pages.

When you click on Google’s Accessibility/text feature you may see that some of the text is garbled but you can easily cut and paste from the text and save a lot of time in transcribing excerpts.

For example, here’s a book of Chester County PA tax records that is just begging to be diced up into USGENWEB/PA Roots/Rootsweb pages for each township. Most of the work already done, it just needs to be copied, reformatted and carefully cross-checked for accuracy.

I had some issues with columns and poor OCR conversions in Google Books last time I checked. The downloadable PDF files didn’t seem to include the OCR text layers and this means they aren’t searchable offline, and aren’t indexed by Google Desktop. But I’m sure we’ll get there eventually.

I’m hoping that as Google adds new books the PDF text layer will be there. I’d also like to see them re-compile the old PDFs, and to use a more consistent file naming system that clearly identifies a book as volume 1, 2 or Series 3, book 9.

While they’re at it they could ad a custom hidden text data table for Google Desktop to compile a “My Desktop Library” index of local PDF Google books sortable by title, author, year, etc. That could easily be exported to be “mashed” into other useful tools.

Which reminds me, how about adding OCR interpretation to Google Desktop’s PDF scanning, Google? You clearly do OCR conversions on the fly for PDF to HTML conversions in Google web searches and it would be nice if my downloaded Google Books library could be indexed locally.

Google Books also offers no way to report problems with specific books. The many books with partial or missing pages and no way to report it other than the feedback comments suggests they just don’t care. I don’t get it. They’ve digitized some of the best libraries in the country but made it look like a rush job. Surely they want to fix the problems. Let us flag them for you.

Google still doesn’t do a great job of clearly identifying editions and books with similar titles and doesn’t include relevant volume info in the search summary snippets or many of the About this Book pages. It really should show volume info on the header with the truncated titles by the author info.

For example, see the many volumes of Pennsylvania Archives that all try to save as the same book title and that don’t clearly state volume and book number in the About this Book pages. There are at least 10 series of these books and over 100 volumes. Try finding the Sixth Series, Volume XII on the search results above. This volume at least includes the series/volume info on the About page so maybe they’ve started adding more info in more recent scans.

Anyway, despite any shortcomings I love Google Books. I suspect they’re going to change our world and the way many of us interact with books. There’s a lot of needles in those haystacks and Google Books Search acts like a magnet to pull them out for you.

Google Books adds “My Library”, tags, reviews and personal searches

Saturday, September 8th, 2007

I seems like every week or even every few days Google adds some new feature or service that wows me. And this time they added features that do what I asked for in feedback I sent a few weeks ago. I’m sure they were working on it anyway, but it’s like my Christmas came early this year and I got what I wanted.

These features may seem like an obvious advancement but it is revolutionary and radical when it comes to what it represents. This marks a radical shift that will have a broad impact culturally in how people relate to books and use them. I’m sure of it. It adds value to the content. It’s a hugely effective tool to streamline productivity for researchers. and it can make sharing books fun.

It’s the YouTube of Books added as a front end to what has to be the worlds largest virtual library. Electronic books have started growing up to transcend what we can do with printed books. It smells a little like Amazon’s listmania but it throws in full text searches and free b0oks.

Here’s my Google Books “Library“and new custom search engine. I’m not putting the tags or reviews to good use yet. I just got click happy paging through search results building a basic catalog for my interests in Quakers, early colonial Pennsylvania, Virginia, New York, Delaware and the Carolinas, and England, Scotland, Ireland and Presbyterian history.

In less than a few hours I created a custom genealogy index of over 500 books that I know contain useful info on my ancestors.

In less time than I could do a look-up in a printed book index I can search the full text of over hundreds of books I selected because I know or suspect they are relevant to my research. Sure, I could search thousands more books just going to Google Books, but what I need are more ways to filter the world’s largest library to quickly find relevant results.

Now we have the tools to catalog, reference, review and share what we find in this virtual library.
“My Library” in this case also functions as custom genealogy search engine for me and other people much in the way my Google Custom Search Engine Genealogy search does..

Here’s the Google Books blog post announcing this new personalized library feature, including a critical link to the FAQ that seems to be missing from the “My Library” pages themselves so far.

The FAQ sums it up:

“You can now create personalized libraries on Google Book Search where you can label, review, rate, and of course, full-text search, a customized selection of books. These collections will live online and be accessible anywhere you can log in to your Google account. Once you’ve built a collection, you can share it with friends by sending them a link to your library in Google Book Search. You can even set up RSS feeds with friends so that they’re alerted when you add new books to your collection.”

This is really great news. And really useful.

Anyway, one thing I really wanted was a better way to track if I had already looked at specific books before, and I thought that adding tags, review and social networking along the lines of Amazon lists or YouTube videos would be a great way to open up more functional ways to create and share custom libraries. And that’s exactly what they rolled out with the new feature.You can now add Google’s books to your own library and add tags to help you manage and filter them. And you can add reviews, share links to your book collection, export lists, and best of all, you can search within your library and offer that search feature to others.

Think about this for a minute. This isn’t just a way to share cool videos or recommend books. It’s a way to mine data in public domain books for research, school projects and much more. It’s essentially a way to build custom book search engines along the lines of the Google Customer Search Engine (CSE) web tools they offer.

It would be nice if they’d add the ability to add tags to advanced searches so you can filter better, and to create sub-libraries so I could create multiple libraries of books lists specific to Pennsylvania history, Virginia history, etc. I’m sure we’ll see that eventually. I’d also like to see a way to save “Favorite Searches” and to automate them harvesting new books as they are added to flag them for you.

As it is now it’s a very useful tool and I expect it’s going to catch on quick for bloggers, researchers and academics who want to offer custom book searches.

The export feature opens up use of other tools to do mash-ups, mapping, and custom affiliate linked book lists to Amazon, library searches and much more.

And this isn’t just limited to public domain books. The searches will search indexed content that you many not be able to see the results of other than snippets, and this makes it easier to create lists of books to hunt down in libraries.

Google Book Search effectively trumps traditional library card catalogs by offering rich, full indexes of books that can be searched down to specific words, names and locations even if that info isn’t in the traditional printed index.

It’s starting to feel like we’re really in the 21st century. This can totally change the way we search for info in libraries and bookstores. If publishers can’t see the value in people being able to find their content even if they can’t read it free online then they’re living in the past. Surely this will sell more books even if it’s indexing books that aren’t in the public domain and despite Google giving away more books that you could fit in any single family home.

And now Google adds a way to make sharing books fun and useful and as cool as sharing a link to a video but infinitely more useful. We will see experts on various topics filtering for us to create useful subsets of data we can search.

It doesn’t replace printed books. It makes them easier to find, buy, read and reference.

It took me about 10 minutes to add 137 books specific to Colonial Pennsylvania history and/or specific surnames I’m researching. This becomes a custom search tool for me any anyone else who wants to use it.

The tagging and reviews can take it all a step further. Reviews and tags can be used to annotate books and track and share what we find in them.

Family Tree Maker 2008 beta drops CD-Rom database reader!

Saturday, July 14th, 2007

If you own or use genealogy database CDs and family Tree Maker software I want to alert you to some potential concerns about the next version of FTM software leaving out CD-ROM support.

I was initially excited about the new Family Tree Maker beta. I rely heavily on FTM for research on Ancestry.com and to maintain my research. On initial look the new sfotware update is a much more than them  minor revisions of the last few versions. It’s a total re-write of the software with a more web based (and Ancestry.com oriented) approach that could really make this an even more useful tool.

But now I’m pretty angry to find that support for reading data CD-ROMs is being dropped from the new FTM.

If you own Family Archive Cds and use family Tree Maker it’s important that you check out the FTM 2008 beta quickly and send feedback to the beta feedback email address. And do it quickly, or you’ll find yourself left out and unable to use your Cds in the future versions of FTM.

There is a discussion group unrelated to the authors of the program where people are discussing teh beta that also may interest you. The list is run by users and intended to cover more geenral technical issues with the software than just the curernt beta release. But this also seems to be the only place where there is a public discussion of the changes we see in the software.  See the FTM-Tech mailing list for more info and discussions of the software, but be aware that the best place to send feedback about the beta is to the beta email address.

There is a free piece of software, the Family Archive Viewer, which allows you to read the CDs, but it doens’t allow you to update info from them into a GEDCOM database. For years the software vendors selling the CDs have encouraged use of Family Tree Maker because it expands the usefulness of the CDs.

So now, after buying FTM because of these recommendations, and being a customer for years who has purchased upgrade 3 times in the last 4 years or so, and after investing hundreds of dollars in reference CD-ROMs — I discovered the next upgrade will obsolete any ability to work directly with thee CDs to insert data into my research.

Stranger still, the parent company that develops the software still sells the database CDs at their Ancestry web store.

It’s not clear if the motivation is just that they are too lazy to provide backwards compatibility in this re-write of why they decided it was ok to leave this out.
The ability to save files readable by previous versions of the software is another complaint on the beta discussion board, FTM-tech, on rootsweb.com. and users such as myself are unable to import some old databases, and there seems to be much debate of handlnig of sources text fields.

It seems likely that this discontinued CD support is motivated by the desire to get customers to subscribe to Ancestry.com instead of buying CDs.

The online databases are great, and Ancestry has become a fairly essential research tool. But I’ve been advised that I will just need to keep an old version of the software if I want to use CDs with my database (and then I can’t write data back and forth between the two easily, apparently). And that just bothers the heck out of me.

Try our custom genealogy Google Search

Monday, May 7th, 2007

We’ve created a custom Google search tool to search a selected group of genealogy and history web sites. Try it, and add it to your Google Home page if you like it. Let us know what you think of if there are additional sites you think we should add.

Tips: try names in quotes as “Firstname Lastname” and also try again with “Lastname, Firstname”

Try pairing surnames with place names, such as : Graham Carlisle

Navigation

Search

Archives

November 2008
M T W T F S S
« Oct    
 12
3456789
10111213141516
17181920212223
24252627282930

Other

Syndication