Google Docs Ready for (Legal) Primetime? : E-Discovery Bytes
oday’s predominant word processors are Microsoft Word and Corel WordPerfect. MS Word is also offered as a web-based application or Saas (Software-as-a-Service). However, there is a newer type of document collaboration, where numerous people have access to the same document so that they can all contribute and monitor changes made by others. These types of applications are becoming more common. For example, Google has begun to offer its own Google Word Processor called “Google Docs” — which allows users to share and collaborate on documents.
What does it matter which type you use in your business? Here’s one comparison between the Google and Microsoft web products. But there’s much more when it comes to the battle between WORD v. GOOGLE DOCS.
via Google Docs Ready for (Legal) Primetime? : E-Discovery Bytes.
A recent improvement for Arabic searches | Official Google Blog
We've learned that when performing a search on Google, people sometimes forget to separate words with spaces. Moreover, people often mistakenly repeat a letter within a single word. For instance, when writing the query [amazingly beautiful poem], you might write it as [amazingly beautiifullpoem].
These types of errors are much more common in languages like Arabic, where most of the letters are cursive. That means that the shapes of the letters change, based on the position of the letter in the word (initial, middle, final or isolated). Moreover, some Arabic letters are considered word breaks, meaning that the following letter must be in an “initial” shape. In other words, if the last letter of one word is a word break, the following word may not be separated with a space.
For example, the queries [وزارةالتعليم] and [وزارة التعليم] have an identical meaning (Ministry of Education) and they’re both written in a common form for Arabic documents. But they have different, albeit correct, formats — the first query is written as a single word, while the second is written as two. Google needs to understand that while they’re written differently, they mean the same thing and should yield the exact same search results. In this example, both queries were written correctly, just in different formats. But sometimes people just make errors — like repeating the same letter twice. For example, you might write [راائعة الجماال], repeating the letter “ا” twice in both query words. In this case the correct spelling should be [رائعة الجمال]. It's important that Google search recognizes your query — despite spelling errors.
To address issues like this, we recently developed a search ranking improvement that targets certain Arabic queries. Our algorithm employs rules of Arabic spelling and grammar along with signals from historical search data to decide when to leave out spaces between words or when to remove unnecessarily repeated letters. Now, when you type a query leaving out spaces or repeating a letter, we’ll return better results based not only on what you typed, but also on what our algorithm understands is the “correct” query. For example, here's what happens when you type [قصيدة راائعةالجماال] ([amazingly beautiful poem] in Arabic) with repeated letters and dropped spaces between words.
via Official Google Blog: A recent improvement for Arabic searches.
Court Tells Microsoft to Edit Word | BusinessWeek
Microsoft Corp. (MSFT), the world's biggest software maker, must alter its popular Word software or stop selling the product after it lost its appeal of a $200 million patent-infringement verdict won by a Canadian company.
The company, based in Redmond, Washington, was given until Jan. 11 — five months from the original order issued in August — to make the change by the U.S. Court of Appeals for the Federal Circuit in Washington. Word is part of Microsoft’s Office software, used by more than 500 million people.
The court today upheld a verdict which has since grown to $290 million won by closely held I4i LP of Toronto. The dispute is over a patented invention related to customizing extensible markup language, or XML, a way of encoding data to exchange information among programs. Microsoft has called it an “obscure functionality.”
The Million-Dollar Words of E-Discovery
Throughout history, humans have had miscommunications and misunderstandings. Some are comical like Abbott and Costello’s “Who’s On First?”; others can be tragic like “Romeo and Juliet.” Most of the time, a misunderstood word here or there doesn’t have far-reaching implications, but in the legal profession a misunderstood word or instruction can cost millions.
In today’s litigious environment, attorneys and IT professionals frequently find themselves in the midst of an e-discovery project that requires the team to quickly find information, which can reside on computers and smartphones, within applications such as e-mail, or on backup tapes stored in the closet of someone’s home. Put these challenges together and the stage is set for expensive, complex, and sometimes frustrating, e-discovery projects.
THE BILLION-DOLLAR MARKET
E-discovery is among the fastest growing segments in the IT industry. Spending for EDD software and services will reach an estimated $1 billion by year-end according to Gartner. A survey of medium-sized U.S. companies conducted by Kroll Ontrack found that, on average, companies will spend $1.29 million to manage electronic data in 2009 compared with $437,000 last year.
[continued] Law.com – The Million-Dollar Words of E-Discovery.