<?xml version="1.0" encoding="UTF-8"?><!-- generator="wordpress/2.0" -->
<rss version="2.0" 
	xmlns:content="http://purl.org/rss/1.0/modules/content/">
<channel>
	<title>Comments on: Genealogy anywhere</title>
	<link>http://www.beyondproject.org/2006/04/06/genealogy-anywhere/</link>
	<description>Genealogy anywhere.</description>
	<pubDate>Sat, 04 Jul 2009 07:20:10 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.0</generator>

	<item>
		<title>by: Ben</title>
		<link>http://www.beyondproject.org/2006/04/06/genealogy-anywhere/#comment-54</link>
		<pubDate>Tue, 18 Apr 2006 23:33:28 +0000</pubDate>
		<guid>http://www.beyondproject.org/2006/04/06/genealogy-anywhere/#comment-54</guid>
					<description>Cool.  OCRing old handwriting (or even some people's handwriting nowadays :)) is a tough problem, and it'll be interesting to see how things progress...</description>
		<content:encoded><![CDATA[<p>Cool.  OCRing old handwriting (or even some people&#8217;s handwriting nowadays :)) is a tough problem, and it&#8217;ll be interesting to see how things progress&#8230;
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Dallan Quass</title>
		<link>http://www.beyondproject.org/2006/04/06/genealogy-anywhere/#comment-47</link>
		<pubDate>Wed, 12 Apr 2006 05:53:51 +0000</pubDate>
		<guid>http://www.beyondproject.org/2006/04/06/genealogy-anywhere/#comment-47</guid>
					<description>Check out http://ciir.cs.umass.edu/irdemo/hw-demo/  This is the most advanced demo for old handwriting recognition I've seen.  There's also a research group at INRIA, but I don't know if they have an on-line demo yet.</description>
		<content:encoded><![CDATA[<p>Check out <a href='http://ciir.cs.umass.edu/irdemo/hw-demo/' rel='nofollow'>http://ciir.cs.umass.edu/irdemo/hw-demo/</a>  This is the most advanced demo for old handwriting recognition I&#8217;ve seen.  There&#8217;s also a research group at INRIA, but I don&#8217;t know if they have an on-line demo yet.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Ben</title>
		<link>http://www.beyondproject.org/2006/04/06/genealogy-anywhere/#comment-46</link>
		<pubDate>Tue, 11 Apr 2006 23:03:15 +0000</pubDate>
		<guid>http://www.beyondproject.org/2006/04/06/genealogy-anywhere/#comment-46</guid>
					<description>Ah, good point.  I haven't thought too much about the media part yet, but it warrants a fair amount of attention.  And it feels like there's room for some good innovation here.  Haven't a clue what exactly that would be, but it's there.

Does anyone know how OCR for old handwriting is coming?  Last I heard, it was still pretty far behind, but maybe some advances have been made.  But I agree that having a scan of the original document right there is very nice.</description>
		<content:encoded><![CDATA[<p>Ah, good point.  I haven&#8217;t thought too much about the media part yet, but it warrants a fair amount of attention.  And it feels like there&#8217;s room for some good innovation here.  Haven&#8217;t a clue what exactly that would be, but it&#8217;s there.</p>
<p>Does anyone know how OCR for old handwriting is coming?  Last I heard, it was still pretty far behind, but maybe some advances have been made.  But I agree that having a scan of the original document right there is very nice.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Hilton</title>
		<link>http://www.beyondproject.org/2006/04/06/genealogy-anywhere/#comment-45</link>
		<pubDate>Tue, 11 Apr 2006 18:33:20 +0000</pubDate>
		<guid>http://www.beyondproject.org/2006/04/06/genealogy-anywhere/#comment-45</guid>
					<description>The main reason I don't think it will be scalable is because genealogy is (or will be soon) a lot more than just textual records.  Initially FSFT won't be accepting large notes or attachments.  That may change, but how much will you be able to upload in the end?  Photos?  High-res scans of historical documents?  Audio?  Video?  I don't think it's the Church's intention to service a massive media locker.  Nor do I think they could, effectively.  This requires a distributed solution.

I'm particularly interested in the high-res scans.  It is ludicrous for people to type information from old documents into their computer when they can scan them with inexpensive hardware and perform human assisted-OCR. You end up with fewer errors, and the actual source can be automatically included in the database with no extra effort.  If I disagree with what someone typed in, I can just look at the scan instead of hunting down the actual document.</description>
		<content:encoded><![CDATA[<p>The main reason I don&#8217;t think it will be scalable is because genealogy is (or will be soon) a lot more than just textual records.  Initially FSFT won&#8217;t be accepting large notes or attachments.  That may change, but how much will you be able to upload in the end?  Photos?  High-res scans of historical documents?  Audio?  Video?  I don&#8217;t think it&#8217;s the Church&#8217;s intention to service a massive media locker.  Nor do I think they could, effectively.  This requires a distributed solution.</p>
<p>I&#8217;m particularly interested in the high-res scans.  It is ludicrous for people to type information from old documents into their computer when they can scan them with inexpensive hardware and perform human assisted-OCR. You end up with fewer errors, and the actual source can be automatically included in the database with no extra effort.  If I disagree with what someone typed in, I can just look at the scan instead of hunting down the actual document.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Ben</title>
		<link>http://www.beyondproject.org/2006/04/06/genealogy-anywhere/#comment-43</link>
		<pubDate>Tue, 11 Apr 2006 17:09:40 +0000</pubDate>
		<guid>http://www.beyondproject.org/2006/04/06/genealogy-anywhere/#comment-43</guid>
					<description>Just out of curiosity, would you care to expound on why a centralized solution won't be scalable?  (Not that I disagree, but I'm interested to see why.)

And yes, I agree that not everyone will want to host their data with the Church.  That's just something that happens with religious organizations.  And I also agree that innovation is a lot easier in a smaller group (which is why startups are often successful as opposed to large companies like Microsoft -- &lt;a href=&quot;http://www.paulgraham.com/&quot; rel=&quot;nofollow&quot;&gt;Paul Graham&lt;/a&gt; talks a lot about this).

There will indeed be separation between data and presentation.  It's often a pain when the presentation doesn't do what you need it to do.  (For example, today in the lab where I work, someone wanted to copy the individual list from PAF into Excel.  But you can't do that, or at least not in any easy way.  You can't select the list or do anything except export it into GEDCOM, really.  Not good.  Locking anyone into using only your program is a bad thing.</description>
		<content:encoded><![CDATA[<p>Just out of curiosity, would you care to expound on why a centralized solution won&#8217;t be scalable?  (Not that I disagree, but I&#8217;m interested to see why.)</p>
<p>And yes, I agree that not everyone will want to host their data with the Church.  That&#8217;s just something that happens with religious organizations.  And I also agree that innovation is a lot easier in a smaller group (which is why startups are often successful as opposed to large companies like Microsoft &#8212; <a href="http://www.paulgraham.com/" rel="nofollow">Paul Graham</a> talks a lot about this).</p>
<p>There will indeed be separation between data and presentation.  It&#8217;s often a pain when the presentation doesn&#8217;t do what you need it to do.  (For example, today in the lab where I work, someone wanted to copy the individual list from PAF into Excel.  But you can&#8217;t do that, or at least not in any easy way.  You can&#8217;t select the list or do anything except export it into GEDCOM, really.  Not good.  Locking anyone into using only your program is a bad thing.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Hilton</title>
		<link>http://www.beyondproject.org/2006/04/06/genealogy-anywhere/#comment-42</link>
		<pubDate>Tue, 11 Apr 2006 15:48:50 +0000</pubDate>
		<guid>http://www.beyondproject.org/2006/04/06/genealogy-anywhere/#comment-42</guid>
					<description>I attended a UVPAFUG meeting where Brad Christensen said that Family Tree would be open source.  What that means specifically, I don't know.

Regardless, Beyond as a web application is a great idea.  I've been mulling over some future scenarios and it seems to me that a centralized solution will not ultimately be scalable.  Furthermore, not everyone will trust the Church as a data repository.  And of course, as you mention Ben, some simply want to host their data themselves.  This offers the additional advantage of flexibility and innovation, which would come much more slowly to a bureaucracy.

I think these reasons are compelling enough to press forward.  I would make the obvious recommendation though of a careful separation between data and presentation.  Create a solid data source component, along with a useful web-based UI.  Don't worry about the API just yet.  When/if anyone wants to write a fat client for it, it won't be difficult to design a good API.  If noone does, at least you'll have a good architecture.</description>
		<content:encoded><![CDATA[<p>I attended a UVPAFUG meeting where Brad Christensen said that Family Tree would be open source.  What that means specifically, I don&#8217;t know.</p>
<p>Regardless, Beyond as a web application is a great idea.  I&#8217;ve been mulling over some future scenarios and it seems to me that a centralized solution will not ultimately be scalable.  Furthermore, not everyone will trust the Church as a data repository.  And of course, as you mention Ben, some simply want to host their data themselves.  This offers the additional advantage of flexibility and innovation, which would come much more slowly to a bureaucracy.</p>
<p>I think these reasons are compelling enough to press forward.  I would make the obvious recommendation though of a careful separation between data and presentation.  Create a solid data source component, along with a useful web-based UI.  Don&#8217;t worry about the API just yet.  When/if anyone wants to write a fat client for it, it won&#8217;t be difficult to design a good API.  If noone does, at least you&#8217;ll have a good architecture.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Dallan Quass</title>
		<link>http://www.beyondproject.org/2006/04/06/genealogy-anywhere/#comment-40</link>
		<pubDate>Tue, 11 Apr 2006 03:30:40 +0000</pubDate>
		<guid>http://www.beyondproject.org/2006/04/06/genealogy-anywhere/#comment-40</guid>
					<description>I'm not the best person to ask regarding whether family tree is going to be open source.  The Church does seem to be moving (albeit slowly) in that direction</description>
		<content:encoded><![CDATA[<p>I&#8217;m not the best person to ask regarding whether family tree is going to be open source.  The Church does seem to be moving (albeit slowly) in that direction
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Ben</title>
		<link>http://www.beyondproject.org/2006/04/06/genealogy-anywhere/#comment-36</link>
		<pubDate>Mon, 10 Apr 2006 04:18:47 +0000</pubDate>
		<guid>http://www.beyondproject.org/2006/04/06/genealogy-anywhere/#comment-36</guid>
					<description>I've become pretty much convinced that the way to go is develop the web client first and foremost, because it's going to be used by the most people.  I personally plan on spending my time working on the web, but the API will make desktop clients possible (and hopefully people &lt;i&gt;will&lt;/i&gt; want to write them), so that there's offline synchronization in those cases where people don't have Internet access.  (And there'll certainly be those cases.)  So web it is.  And if it appears that a desktop client is desperately needed, so be it, but I have a feeling that it's not going to be as important as I thought.  (And if I'm wrong, well, the API will make it an easy problem to fix. :))

The CVS repository idea is still important, of course, even using web clients alone, because there's always the possibility that you and someone else could be working on the data at the same time.

As for Family Tree, even if Beyond ends up being similar (excluding the desktop clients for the time being), I think it's still worth it to proceed, because innovative ideas are important (like tags, for example).  And variety is good.  And people may want to host it themselves.  Okay, I think I need to come up with better reasons, and it's starting to sound like I'm repeating myself a lot. :)

Is Family Tree going to be open source, by the way?</description>
		<content:encoded><![CDATA[<p>I&#8217;ve become pretty much convinced that the way to go is develop the web client first and foremost, because it&#8217;s going to be used by the most people.  I personally plan on spending my time working on the web, but the API will make desktop clients possible (and hopefully people <i>will</i> want to write them), so that there&#8217;s offline synchronization in those cases where people don&#8217;t have Internet access.  (And there&#8217;ll certainly be those cases.)  So web it is.  And if it appears that a desktop client is desperately needed, so be it, but I have a feeling that it&#8217;s not going to be as important as I thought.  (And if I&#8217;m wrong, well, the API will make it an easy problem to fix. :))</p>
<p>The CVS repository idea is still important, of course, even using web clients alone, because there&#8217;s always the possibility that you and someone else could be working on the data at the same time.</p>
<p>As for Family Tree, even if Beyond ends up being similar (excluding the desktop clients for the time being), I think it&#8217;s still worth it to proceed, because innovative ideas are important (like tags, for example).  And variety is good.  And people may want to host it themselves.  Okay, I think I need to come up with better reasons, and it&#8217;s starting to sound like I&#8217;m repeating myself a lot. :)</p>
<p>Is Family Tree going to be open source, by the way?
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Dallan Quass</title>
		<link>http://www.beyondproject.org/2006/04/06/genealogy-anywhere/#comment-35</link>
		<pubDate>Sun, 09 Apr 2006 06:59:41 +0000</pubDate>
		<guid>http://www.beyondproject.org/2006/04/06/genealogy-anywhere/#comment-35</guid>
					<description>That makes sense regarding living data - require people to grant specific access to other users.  Also, if you require that everyone use Beyond, then you don't need to worry as much about GEDCOM compatibility.  Your allowing non-beyond users to edit the pedigree online using an ajax client is what I was trying to suggest in my previous note.  My question is the same as yours: which client will be used by the most people, and which should be developed first?

I don't think you could say that the Church expects Family Tree to replace PAF.  First of all, their initial incarnation doesn't have a desktop interface.  Nor does it have the ability to export GEDCOM.  My understanding is that these things wil be added later.  And they'll probably want other desktop genealogy programs to interface with Family Tree through some kind of sync protocol.  But a shared family respository like a CVS repository, where the updates that you make are automatically reflected in my copy the next time I sync, and where the two of us can choose to share data on living as well as dead people, would require more work I think.  So it seems you have a good niche with your idea.

BTW, I've checked into the various XML GEDCOM specs.  Most of them don't appear to have had any activity in years.  GenXML is a the only exception I could find.  Developing a parser that can accept the various flavors of GEDCOM out there and output data suitable for XML or database representation is proving to be somewhat of a challenge.</description>
		<content:encoded><![CDATA[<p>That makes sense regarding living data - require people to grant specific access to other users.  Also, if you require that everyone use Beyond, then you don&#8217;t need to worry as much about GEDCOM compatibility.  Your allowing non-beyond users to edit the pedigree online using an ajax client is what I was trying to suggest in my previous note.  My question is the same as yours: which client will be used by the most people, and which should be developed first?</p>
<p>I don&#8217;t think you could say that the Church expects Family Tree to replace PAF.  First of all, their initial incarnation doesn&#8217;t have a desktop interface.  Nor does it have the ability to export GEDCOM.  My understanding is that these things wil be added later.  And they&#8217;ll probably want other desktop genealogy programs to interface with Family Tree through some kind of sync protocol.  But a shared family respository like a CVS repository, where the updates that you make are automatically reflected in my copy the next time I sync, and where the two of us can choose to share data on living as well as dead people, would require more work I think.  So it seems you have a good niche with your idea.</p>
<p>BTW, I&#8217;ve checked into the various XML GEDCOM specs.  Most of them don&#8217;t appear to have had any activity in years.  GenXML is a the only exception I could find.  Developing a parser that can accept the various flavors of GEDCOM out there and output data suitable for XML or database representation is proving to be somewhat of a challenge.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Ben</title>
		<link>http://www.beyondproject.org/2006/04/06/genealogy-anywhere/#comment-33</link>
		<pubDate>Sat, 08 Apr 2006 20:41:35 +0000</pubDate>
		<guid>http://www.beyondproject.org/2006/04/06/genealogy-anywhere/#comment-33</guid>
					<description>Hmm, I hadn't thought about data on living people.  Right now I don't necessarily see Beyond as being a &lt;i&gt;published&lt;/i&gt; set of data (meaning, even though you can access it from a web client, it's not a public set of pages).  Originally I'd thought of having Beyond output a set of HTML files, which seems to be the norm, but instead I think it would be better to say, &quot;I want these lines to show up on this website,&quot; and that way the link is live and you don't have to keep uploading those HTML files.  The less the user has to do to keep things going, the better.  This way, the user would choose which lines and which people to expose to the public view, basically, and I think that solves the living people problem.

As for compatibility, Beyond will be PAF compatible, and if I can get file format specs for other record managers, it'll read/write those formats as well.  I agree that GEDCOM is rather lossy in some cases, and that's not good.  There are other standards (most based on XML, it seems), and I'll be evaluating those to see if any would work.  Not for storage, that is -- it'll be SQL -- but for transfering data.  In all reality, though, I doubt that those companies are necessarily going to want to hand out their file format specs.  Some may, but certainly not all.

I think that to pull off the synchronization, other collaborators will have to use Beyond, because other record managers won't have the sync functionality.  (If they added it, however, that'd be another question.)  My hope is that Beyond will be good enough that everyone will want to switch to it anyway. ;)

I'm not quite sure I understand what you mean by removing offline updates to the pedigree.  I &lt;i&gt;do&lt;/i&gt; plan on using Ajax to build the web client, and it'll be quite functional for editing one's pedigree.  (The main difference I see between the offline clients and the web client is that some of the fancier stuff won't be in the web client, but with advances in technology it may become a moot point.)  I want the web client to be as rich as possible.  In fact, if this takes off, I see people mainly using the web client instead of the desktop client (like Gmail and the other web-based e-mail clients replacing desktop e-mail clients).  And yet there still needs to be offline functionality, because sometimes you go places where there isn't any Internet access.

Does the Church expect Family Tree to replace PAF?  It didn't seem like that was the case.  Hopefully Beyond will be able to interface with Family Tree (and WeRelate, for that matter).  We'll see.</description>
		<content:encoded><![CDATA[<p>Hmm, I hadn&#8217;t thought about data on living people.  Right now I don&#8217;t necessarily see Beyond as being a <i>published</i> set of data (meaning, even though you can access it from a web client, it&#8217;s not a public set of pages).  Originally I&#8217;d thought of having Beyond output a set of HTML files, which seems to be the norm, but instead I think it would be better to say, &#8220;I want these lines to show up on this website,&#8221; and that way the link is live and you don&#8217;t have to keep uploading those HTML files.  The less the user has to do to keep things going, the better.  This way, the user would choose which lines and which people to expose to the public view, basically, and I think that solves the living people problem.</p>
<p>As for compatibility, Beyond will be PAF compatible, and if I can get file format specs for other record managers, it&#8217;ll read/write those formats as well.  I agree that GEDCOM is rather lossy in some cases, and that&#8217;s not good.  There are other standards (most based on XML, it seems), and I&#8217;ll be evaluating those to see if any would work.  Not for storage, that is &#8212; it&#8217;ll be SQL &#8212; but for transfering data.  In all reality, though, I doubt that those companies are necessarily going to want to hand out their file format specs.  Some may, but certainly not all.</p>
<p>I think that to pull off the synchronization, other collaborators will have to use Beyond, because other record managers won&#8217;t have the sync functionality.  (If they added it, however, that&#8217;d be another question.)  My hope is that Beyond will be good enough that everyone will want to switch to it anyway. ;)</p>
<p>I&#8217;m not quite sure I understand what you mean by removing offline updates to the pedigree.  I <i>do</i> plan on using Ajax to build the web client, and it&#8217;ll be quite functional for editing one&#8217;s pedigree.  (The main difference I see between the offline clients and the web client is that some of the fancier stuff won&#8217;t be in the web client, but with advances in technology it may become a moot point.)  I want the web client to be as rich as possible.  In fact, if this takes off, I see people mainly using the web client instead of the desktop client (like Gmail and the other web-based e-mail clients replacing desktop e-mail clients).  And yet there still needs to be offline functionality, because sometimes you go places where there isn&#8217;t any Internet access.</p>
<p>Does the Church expect Family Tree to replace PAF?  It didn&#8217;t seem like that was the case.  Hopefully Beyond will be able to interface with Family Tree (and WeRelate, for that matter).  We&#8217;ll see.
</p>
]]></content:encoded>
				</item>
</channel>
</rss>
