<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>LitSoft &#187; E-Discovery: The Good, The Bad, and The Ugly</title>
	<atom:link href="http://www.litsoftllc.com/category/e-discovery-good-bad-ugly/feed" rel="self" type="application/rss+xml" />
	<link>http://www.litsoftllc.com</link>
	<description>Just another WordPress weblog</description>
	<lastBuildDate>Wed, 27 Jan 2010 16:41:25 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.6</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>E-Discovery Search 101</title>
		<link>http://www.litsoftllc.com/e-discovery-good-bad-ugly/e-discovery-search-101-2</link>
		<comments>http://www.litsoftllc.com/e-discovery-good-bad-ugly/e-discovery-search-101-2#comments</comments>
		<pubDate>Mon, 01 Jun 2009 17:19:32 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[E-Discovery: The Good, The Bad, and The Ugly]]></category>

		<guid isPermaLink="false">http://www.litsoftllc.com/?p=229</guid>
		<description><![CDATA[These days, it’s becoming harder and harder to stay on top of the various searching practices that may be applied during the e-discovery lifecycle. While it is universally agreed that search is the best and most efficient way to reduce the mind bogging volumes of data, which also makes it the best and most efficient [...]]]></description>
			<content:encoded><![CDATA[<p class="MsoNormal" style="margin-bottom: 0.0001pt;">These days, it’s becoming harder and harder to stay on top of the various searching practices that may be applied during the e-discovery lifecycle.<span> </span>While it is universally agreed that search is the best and most efficient way to reduce the mind bogging volumes of data, which also makes it the best and most efficient way to reduce the mind boggling costs of e-discovery, agreement on the best search technology and methodology is vigorously contested.<span> </span>Not surprisingly, where one stands on the topic of search technology and methodology typically falls in lock step with the company one represents.<span> </span>In this article, I will leave the bias aside and simply provide a basic overview and definition of the most common search technologies and methodologies used in e-discovery. <span id="more-229"></span></p>
<p class="MsoNormal" style="margin-bottom: 0.0001pt;">
<p><br/></p>
<p class="MsoNormal" style="margin-bottom: 0.0001pt;"><strong>Over-inclusive vs. Under-inclusive</strong></p>
<p class="MsoNormal" style="margin-bottom: 0.0001pt;">These self-explanatory terms are applicable in several contexts within the legal domain.<span> </span>With respect to e-discovery search, they refer to search practices that either; A) do not return all the documents that may be relevant to the particular search (under-inclusive), or B) return more than just the documents that are relevant to the particular search (over-inclusive). <span> </span>While it is generally accepted that all search methodologies will have a certain level of each, too much of either is a recipe for disaster. Judge Paul Grimm’s opinion last year in the Victor Stanley case provides an interesting perspective on the over-inclusive – under-inclusive issue, while also providing useful insight on how to enhance the defensibility of search in general.</p>
<p class="MsoNormal" style="margin-bottom: 0.0001pt;">
<p><br/></p>
<p class="MsoNormal" style="margin-bottom: 0.0001pt;"><strong>Keyword Search</strong></p>
<p class="MsoNormal" style="margin-bottom: 0.0001pt;">Thanks to our friends at Google and Yahoo!, most of us are more or less familiar with how keyword search works.<span> </span>Simply type in a word or phrase, and the search engine returns the documents that contain that word or phrase. – no rocket science there.<span> </span>Of course, even keyword search has its complexities.<span> </span>While Boolean operators (AND, OR, NOT) are relatively straightforward, “wildcard” or “star” searching (e.g., “infra*” returns any document containing a word that begins with those characters), stemming (e.g., “jog” with stemming returns any documents containing “jog”, “jogs”, “jogger”, “jogged”, “jogging”) and “fuzzy” searching (returns any document containing various spellings and/or misspellings of a search term), have made keyword search a lot more susceptible to some of the issues that make search such a controversial and scrutinized subject within the e-discovery space.</p>
<p class="MsoNormal" style="margin-bottom: 0.0001pt;">
<p><br/></p>
<p class="MsoNormal" style="margin-bottom: 0.0001pt;"><strong>Concept Search</strong></p>
<p class="MsoNormal" style="margin-bottom: 0.0001pt;">Concept search, which has been getting a lot of buzz in the industry lately, tries to overcome the under-inclusive issue that is perceived to be inherent in traditional keyword search.<span> </span>The theory is that by simply looking for specific words, keyword search will miss documents regarding the same subject matter but containing synonyms of the keywords rather than the keywords themselves.<span> </span>For example, one document may refer to an employee as having been “fired”, while another may refer to the employee as having been “terminated.”<span> </span>While these documents clearly address the same topic, it’s easy to see how keyword search could miss one or the other.<span> </span>That said, the flip side of the under-inclusive coin is over-inclusiveness, which can lead to many other problems, including skyrocketing costs.</p>
<p class="MsoNormal" style="margin-bottom: 0.0001pt;">
<p><br/></p>
<p class="MsoNormal" style="margin-bottom: 0.0001pt;"><strong>Concept Categorization</strong></p>
<p class="MsoNormal" style="margin-bottom: 0.0001pt;">Concept categorization, also known as concept or document clustering, gained mainstream notoriety in the e-discovery space with the success of Attenex and Stratify.<span> </span>What these tools and others like them do is automatically bundle documents of similar subject matter together, which subsequently increases reviewer productivity.<span> </span>While this notion certainly has merit, at some point the industry began to think of concept categorization as a search process.<span> </span>Concept categorization looks at each document, applies some logic (e.g, Bayesian), and based on that logic makes a determination on which documents have similar subject matter.<span> </span>Concept categorization does not take in a search term and retrieve documents based on that parameter.</p>
<p class="MsoNormal" style="margin-bottom: 0.0001pt;">
<p><br/></p>
<p class="MsoNormal" style="margin-bottom: 0.0001pt;"><strong>“Black-Box”</strong></p>
<p class="MsoNormal" style="margin-bottom: 0.0001pt;">Traditionally search companies like the aforementioned Google and Yahoo! guard their search algorithms as extremely confidential trade secrets.<span> </span>This approach also applies to enterprise search companies, many of who have had their technology adopted by the e-discovery industry.<span> </span>The problem is that secrecy is not a good thing when is comes to the question of how evidence was found.<span> </span>Imagine a homicide proceeding where the prosecution answers the question of how they found the murder weapon with, “Sorry Judge, it’s a secret.”<span> </span>Needless to say, that probably wouldn’t sit well with most judges.<span> </span>Unfortunately, that’s essentially what attorneys that use “black-box” search tools have to say when the same question is posed regarding documents in an e-discovery case.<span> </span>Of course, there are ways to mitigate this issue, and consulting with opposing counsel and agreeing on search methodology before hand is the key component for doing so.</p>
<p class="MsoNormal" style="margin-bottom: 0.0001pt;">
<p><br/></p>
<p class="MsoNormal" style="margin-bottom: 0.0001pt;"><strong>“Transparent”</strong></p>
<p class="MsoNormal" style="margin-bottom: 0.0001pt;">The idea behind transparent search is to overcome the exact issue presented by black-box search algorithms.<span> </span>By bringing the operations that search engines typically do behind the scenes to the forefront, and, in some cases, allowing users to tweak those operations, transparent search is supposedly better suited for e-discovery and the defensibility requirement inherent in any litigation.<span> </span>Take for example the stemming capabilities offered by many keyword search tools.<span> </span>In a typical black-box search tool, a stemming enabled search for the term “jog” might automatically return any documents containing the words “jog”, “jogs”, “jogger”, “jogged”, and “jogging.”<span> </span>In a black-box solution, these stem variants would be largely unknown and the user would not have any ability to tweak them.<span> </span>Transparent search on the other hand, would allow users to adjust the stem variants as necessary, therefore limiting the search to only the desired terms and providing full “transparency” into the process.<span> </span>While the benefits of transparent search are clear, the technology is quite new and has yet to be proven in the mainstream e-discovery space.<span> </span>Furthermore, transparent search creates more complexity and additional areas of debate among opposing counsels.<span> </span>Time will tell if the promise of transparent search truly comes to fruition.</p>
<p class="MsoNormal" style="margin-bottom: 0.0001pt;">
<p><br/></p>
<p class="MsoNormal" style="margin-bottom: 0.0001pt;">E-Discovery search is a complex and ever evolving topic.<span> </span>Like so much of the technology in the industry, e-discovery has had to make due with search tools that were not designed for use in litigation or a legal setting, which has led to lessons being learned the hard way.<span> </span>However, the development of e-discovery specific search technology, together with more refined search methodologies, the future is bright, and search promises to play an increasingly important role in e-discovery.</p>
<p><!--EndFragment--></p>
]]></content:encoded>
			<wfw:commentRss>http://www.litsoftllc.com/e-discovery-good-bad-ugly/e-discovery-search-101-2/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Web-based vs. Web-enabled</title>
		<link>http://www.litsoftllc.com/e-discovery-good-bad-ugly/web-based-vs-web-enabled</link>
		<comments>http://www.litsoftllc.com/e-discovery-good-bad-ugly/web-based-vs-web-enabled#comments</comments>
		<pubDate>Sat, 11 Apr 2009 19:16:04 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[E-Discovery: The Good, The Bad, and The Ugly]]></category>

		<guid isPermaLink="false">http://www.litsoftllc.com/?p=197</guid>
		<description><![CDATA[About four years ago, when I first began researching and writing the LitSoft business plan, I was amazed to find out that several of the &#8220;main stream&#8221; hosted ESI review platforms we&#8217;re not actually web-applications.  Instead, they were traditional desktop applications that were hosted and served over the web via terminal service technology, most [...]]]></description>
			<content:encoded><![CDATA[<p>About four years ago, when I first began researching and writing the LitSoft business plan, I was amazed to find out that several of the &#8220;main stream&#8221; hosted ESI review platforms we&#8217;re not actually web-applications.  Instead, they were traditional desktop applications that were hosted and served over the web via terminal service technology, most commonly Citrix.  Popular ESI review platforms that are web-enabled desktop applications include Attenex, CaseLogistix, and Stratify, among many others.<span id="more-197"></span></p>
<p>As one could logically surmise, the primary advantage of web-enabling an existing desktop application using Citrix is that doing so allows geographically dispersed access to the application without having the redevelop the software as a web application.  The cost-benefit here is an assessment of whether the redevelopment costs outweigh the added efficiencies a true web application would bring to bear.  Sometimes &#8212; depending on many other variables that I&#8217;d rather not get in to here &#8212; the answer is yes.  Conversely, very often the answer is no, which is why so many enterprises have undertaken development projects to modernize their applications and convert them to web-based.</p>
<p>In a hosted ESI review context, the primary advantage of web-enabling desktop applications using Citrix becomes a moot point.  The users of these applications (GCs and Law Firms) are not the owners of the technology, and therefore the cost-benefit analysis I previously mentioned is not applicable.  In a hosted/on-demand/software as a service model, users&#8217; only concern is having a tool that facilitates maximum efficiency at a minimum cost, and a desktop application that has been web-enabled via Citrix certainly doesn&#8217;t fit this bill.  I&#8217;m not trying to bash Citrix here &#8212; it certainly has its &#8220;sweet spot&#8221; &#8212; I just want to be clear that in a hosted ESI review context, this practice is down right laughable.</p>
<p>So what makes desktop applications that are web-enabled via Citrix less efficient than true web applications?</p>
<ol>
<li>Higher Cost: Citrix (and similar terminal service technology) require costly licenses for the software to be deployed on both the server and all client machines.  In addition, more hardware is required to support the same number of users/data.</li>
<li>Steeper Learning Curve: A desktop application can employ a multitude of user interface/navigation constructs.  True web-applications on the other hand, all have similar navigation constructs that have become second nature for most users.</li>
<li>Worse Performance: Because web-applications have been designed and developed to work over the web, they perform better than desktop applications in that environment (faster response times, page loads/screen refreshes, data transfers, etc).</li>
<li>Decreased Integrations: Web applications are built on protocols and standards, making them relatively easy to integrate with other web applications.  Desktop applications were never meant to integrate in this way, therefore they can not.</li>
</ol>
<p>With the evolution of Web 2.0, web applications are becoming much more functional and responsive, and have enhanced the overall user experience.  My guess is that for hosted ESI review, web-enabled desktop applications will eventually be phased out completely, which will increase the overall efficiency of e-discovery.  Let&#8217;s hope our client community demands that this occur sooner rather than later.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.litsoftllc.com/e-discovery-good-bad-ugly/web-based-vs-web-enabled/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Truth About &#8220;Native&#8221; Review</title>
		<link>http://www.litsoftllc.com/e-discovery-good-bad-ugly/the-truth-about-native-review</link>
		<comments>http://www.litsoftllc.com/e-discovery-good-bad-ugly/the-truth-about-native-review#comments</comments>
		<pubDate>Tue, 20 Jan 2009 21:44:52 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[E-Discovery: The Good, The Bad, and The Ugly]]></category>

		<guid isPermaLink="false">http://www.litsoftllc.com/?p=190</guid>
		<description><![CDATA[This first post for 2009 centers around the notion of &#8220;native&#8221; document/ESI review.  While I&#8217;m not sure who/what company first coined the term &#8220;native review&#8221;, I do know that the concept really picked up momentum during LegalTech 2005 when Ringtail burst on to the scene with its flagship product &#8212; Casebook &#8212; a web-based, [...]]]></description>
			<content:encoded><![CDATA[<p>This first post for 2009 centers around the notion of &#8220;native&#8221; document/ESI review.  While I&#8217;m not sure who/what company first coined the term &#8220;native review&#8221;, I do know that the concept really picked up momentum during LegalTech 2005 when Ringtail burst on to the scene with its flagship product &#8212; Casebook &#8212; a web-based, &#8220;native&#8221; review tool.  The primary benefit of &#8220;native&#8221; review was that it was no longer necessary to convert all electronic documents to a uniform format (typically either TIFF of PDF) prior to review.  Not having to convert every single page of every single document meant huge savings in overall eDiscovery costs, and &#8220;native&#8221; review almost instantly became widely accepted as a best practice.  In fact, so much hype was generated by Casebook that FTI Consulting acquired the Ringtail shortly after LegalTech 2005 for $35 MM (see<a href="http://www.blogger.com/%20http://www.fticonsulting.com/web/about/pressreleases/139/FTI_Consulting_Completes_Acquisition_of_Ringtail_Solutions.html"> here</a> for the official press release on that acquisition).<span id="more-190"></span></p>
<p>My issue with the term &#8220;native review&#8221; is that it implies these tools are displaying documents in their native application, for example, xyz.doc displayed in MS Word.  Of course, that is not the case, and to build a review platform that actually launched the native application of every document that might exist in a collection is completely outside the realm of feasibility.  &#8220;Native&#8221; review tools still display documents through a viewer, just as TIFFs are displayed.  However, &#8220;native&#8221; review viewers are more sophisticated in that they are able to render hundreds of document formats, and make them appear very similar to the way they would be displayed in their native application.  In some instances (for MS Office documents for example), these native viewers even have hooks into the native applications that help refine the rendering, furthering the &#8220;native&#8221; review myth.</p>
<p>To take things a step further, virtually all &#8220;native&#8221; review tools use some variant of Oracle&#8217;s (formerly Stellent&#8217;s) Outside In viewer technology.  Whether it be Avantstar&#8217;s Quick View Plus (used by Ringtail), or Daeja System&#8217;s ViewONE Pro (used by iCONECTnXT and LitScope), or some other flavor, it&#8217;s all Outside In behind the scenes.  This is why all native review tools can claim virtually identical capabilities in terms of the number of file formats their application can render (which is up to over 400 at this point).  Of course, Outside In is much more than just a viewer, but I&#8217;ll save further expansion on that topic for a later post.</p>
<p>Finally, let me reiterate that I am in no way questioning the benefits of &#8220;native&#8221; review &#8212; I am a huge proponent of this practice and believe any vendor recommending TIFF (or any other uniform image format) reviews is doing their client a disservice.  I just think it&#8217;s important that both vendors and client&#8217;s alike understand what &#8220;native review&#8221; really means, which will (hopefully) help us all get on the same page.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.litsoftllc.com/e-discovery-good-bad-ugly/the-truth-about-native-review/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Eeny, Meeny, Miny, Moe&#8230;</title>
		<link>http://www.litsoftllc.com/e-discovery-good-bad-ugly/eeny-meeny-miny-moe</link>
		<comments>http://www.litsoftllc.com/e-discovery-good-bad-ugly/eeny-meeny-miny-moe#comments</comments>
		<pubDate>Wed, 10 Sep 2008 18:52:56 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[E-Discovery: The Good, The Bad, and The Ugly]]></category>

		<guid isPermaLink="false">http://www.litsoftllc.com/?p=193</guid>
		<description><![CDATA[As a member of the e-discovery vendor community, I often self sympathize on the challenge I, my company, and my other vendor-side colleagues face when trying to sell e-discovery services.  Between the fierce competition and the (justifiable) &#8220;what have you done for me lately&#8221; mind set among clients, keeping the project pipeline at capacity [...]]]></description>
			<content:encoded><![CDATA[<p>As a member of the e-discovery vendor community, I often self sympathize on the challenge I, my company, and my other vendor-side colleagues face when trying to sell e-discovery services.  Between the fierce competition and the (justifiable) &#8220;what have you done for me lately&#8221; mind set among clients, keeping the project pipeline at capacity is a constant struggle.  However, from time to time I think its important that we take a step back and try to put ourselves in our clients&#8217; shoes (clients &#8212; feel free to do the same <img src='http://www.litsoftinc.com/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' /> ).<span id="more-193"></span></p>
<p>If I were on the client side of the fence, I think it&#8217;d be a HUGE struggle to sift through the various vendors who pitch their services to me on a daily basis.  The sheer number of e-discovery vendors &#8212; a number that increases daily &#8212; is one thing, but perhaps a bigger challenge is the overwhelming variance among companies offering e-discovery services.  To illustrate this point, let&#8217;s look at the top 20 e-discovery service providers based on the 2008 Socha-Gelbmann Survey.  Companies within this group include:</p>
<ul>
<li>Two of the &#8220;Big Four&#8221; accounting firms</li>
<li>A business advisory consulting firm</li>
<li>A computer forensics software company</li>
<li>Several pure e-discovery ASPs</li>
<li>A company that has evolved from paper-based litigation support services</li>
</ul>
<p>Keep in mind this variance is only among the top 20 providers, and there is an estimated 500 providers in the space.  So how exactly does one select from this list?  On the law firm side, many have turned to a formal RFP/Response process.  This is probably long overdue, but unfortunately many of the RFPs that I&#8217;ve seen are poorly structured/written and therefore have limited value.  On the corporate side, many turn to independent consultants &#8212; of which there seems to be an endless number these days &#8212; to help vet/qualify/disqualify providers.</p>
<p>Any way you slice it, picking the the right provider is a daunting task, and I for one do not envy those charged with doing so.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.litsoftllc.com/e-discovery-good-bad-ugly/eeny-meeny-miny-moe/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>$2500 per GB or $4.30 per GB?</title>
		<link>http://www.litsoftllc.com/e-discovery-good-bad-ugly/2500-per-gb-or-430-per-gb</link>
		<comments>http://www.litsoftllc.com/e-discovery-good-bad-ugly/2500-per-gb-or-430-per-gb#comments</comments>
		<pubDate>Fri, 16 May 2008 04:11:25 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[E-Discovery: The Good, The Bad, and The Ugly]]></category>

		<guid isPermaLink="false">http://www.litsoftllc.com/?p=186</guid>
		<description><![CDATA[I&#8217;m a little late to the party, but I managed to get a few minutes to catch up on the latest comings and going in the eDiscovery world and found this  press release announcing the strategic alliance between Attenex and Kazeon.  The press release reads more or less like typical marketing fluff until [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m a little late to the party, but I managed to get a few minutes to catch up on the latest comings and going in the eDiscovery world and found this <a href="http://www.kazeon.com/newsroom2/Press-Releases-2008/kazeon_pr_50608.pdf"> press release </a>announcing the strategic alliance between Attenex and Kazeon.  The press release reads more or less like typical marketing fluff until you get to about the middle of the 2nd page where it makes the claim that, &#8220;Kazeon delivers industry-leading price/performance for in-house processing of ESI in preparation for reactive and proactive eDiscovery matters as low as <span style="font-weight: bold;">$4.30 per Gigabyte</span>.&#8221;  Immediately I thought that it had to be a type-o (I first thought it was a &#8220;read-o&#8221; until I reread it a few times).  But alas, it was not a type-o &#8212; it was an actual claim made by two companies that should definitely know better.<tt><span id="more-186"></span></tt></p>
<p><tt>For me, the timing of this read was somewhat ironic as just a couple days prior I had learned that a large project I had been tracking for a few weeks had been awarded to one of eDiscovery "big boys" (as per the norm on this blog I will not name names).  My source on this deal told me that the vendor did not budge off their standard rate, which is known to be somewhere between $2000 - $2500 per GB (which includes cull down, extraction, clustering/concept search, and perhaps 3 months hosting).  Learning that this particular law firm (and most definitely many more) would still allow one of their client to be charged $2000 - $2500 per GB for these services was equally, if not more baffling to me.</tt></p>
<p><tt>So what's the bigger joke, $2500 per GB or $4.30 per GB???</tt></p>
]]></content:encoded>
			<wfw:commentRss>http://www.litsoftllc.com/e-discovery-good-bad-ugly/2500-per-gb-or-430-per-gb/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
