r5 - 12 Jun 2007 - 14:19:46 - ErikBorraYou are here: TWiki >  Opensearch Web > ProjectDescription
This page describes our project. Currently the description of our project consists of translated excerpts from our application for digital pioneers.

Theme of project

This project is an exemplary peer to peer, collaborative event, whereby people mutually form a search engine without the intervention of central servers or a central actor.

Motiviation

The direct motivation for this project is the increasingly worrying censorship and manipulation by major multinational companies of search engines (google, yahoo, microsoft and the censorship of the Chinese versions of their sites.) See also WhyOpenSearch.

Resulting product

The final product will be an easy to install multi-platform (MS Windows, Linux, Macintosh) client, which will not overload the computer. The client has a twofold goal: "crawling" the web for indexation and storage of the indexed data. Via the client the user may introduce search queries to open search, with will then be answered by the network.

The key-points are:

  • Decentralisation, to prevent manipulation (censorship, biasing certain sites)
  • Redundancy, to exclude single-points-of-failure and to make manipulation difficult
  • Easily accessible: to maximize participation the number of possible technical obstacles will be kept to a strict minimum, so no complicated installation procedures and no excessive memory usage.

An important side product is a community of interested people, developers but also enthusiastic users (the early adopters). This community is necessary to keep on developing and distributing the product.

Target group

The target group is the internet user in the largest possible sense. The more internet users participate, the more indexed data will be gathered.

In order to reach the target group we will keep the software to be installed as easy as possible. Obstacles which might deter people must be avoided. Moreover, the client will be published in forums, conventions and meetings, via articles in magazines and of course by referring to it through our network.

A special category of internet user is the hacker/idealist/programmer who wants to help implement the project. This subcategory will be approached separately through our networks within these groups and by visiting meetings in the Netherlands and probably Belgium and Germany. Besides, we plan to organise three workshops (with virtual presence possible) to get this group involved.

Similar initiatives

Comparable projects do exist (see ExternalLinks). The difference with our project however is that the others do distributed crawling, but then use central storage for their results. It is explicitly our goal to exclude central storage.

Planning and results

Our initial schedule, submitted to digital pioneers (see also section Other parties, funding below) proposed the schedule below:

  Phase Activities Results/ Final product Start End
1. Kick-off Appointment developer, set up of foundation; kick-off workshop Architectural requirements 14 Nov. 2006 12 Dec.2006
2. Design Development architecture, research of existing, related products Architectural design whitepaper 13 Dec.2006 18 Jan.2007
3. Development Development Rel-0.9.0-beta of the client 19 Jan.2007 19 April 2007
4. Beta Tests, roll-outs, fixing bugs. Rel-1.0.0 of the client 20 April 2007 18 June 2007
pr1 Community building Visit of hackers' congress, writing articles for programmers' and hackers' magazines Start of a developers' community 14 Nov. 2006 19 April 2007
pr2 Early adopter recruitment User groups, visits to universities and organisations (eff, ffii...) Group beta testers 20 April 2007 18 June 2007
pr3 Roll-out User groups, popular computer magazines (eff, ffi,…) An increasing user-base 19 June 2007

As of may 2007, it has become clear that this planning was unrealistic. A revised planning has been made, where we split tasks between the foundation and the developer. We also add two new actors: community and students. Building the community should result in work being split off to the community at some point. Also, the idea has been raised to work with students of different disciplines and integrate part of the work in their course material. With the increasing tendency of universities organising their curriculum around real-world projects, this might provide a good opportunity.

Month Foundation Developer Community Students
April 07 Release alpha version of client idem    
May 07 Investigate further funding Wrap up current activities, mostly p2p    
  Publish articles, create buzz      
  Developing front-end      
June 07 Apply for further funding Write documentation    
  Publish articles, create buzz      
  Wrap up digital pioneers funding (report)      
July 07 inactive inactive    
August 07 Invest in community (re)new developer, re-evaluate p2p layer    
  Prepare student projects      
September 07        
October 07        
November 07        
December 07        
January 08        
February 08        

Software used

The open source/ free C Compiler gcc is used on unix/macos, on windows the gnu environment cygwin is used. For windows we will make use of an open source unix-socket-compatibility library. The user interface is entirely html/xml based, and is for access through a web browser exclusively. This eliminates the need to implement cross-platform compatible GUI code.

Content

Content is not the core of the project, but has an important auxiliary role. Good communication is vital to develop a community of developers (and probably later of users). To this end we will work via wiki (www.open-search.net), mailing lists and an IRC chat channel (most likely freenode.net).

Deliverables

The application will be available for free for the most frequently used versions of the supported operating systems (mac, windows, unix). During the project and thereafter the source codes will be available for free via a publicly accessible source code repository and be covered by the GPL (version 2) licence. All promotion material will be distributed with a creative commons licence. As is usual with free software projects, in the end it is the community which will continue the project through further development and support.

Organisation

The application is introduced by the govcom.org foundation ; for the implementation of the project itself a separate OpenSearchFoundation will be set up. The govcom.org foundation is an Amsterdam-based non-profit organisation; its aim is the development and hosting of political tools on the web. Examples are the issue crawler, which is software used to map debates on the web, a visualisation of the digital divide, and projects on internet censorship. govcom.org is financed by universities, research centers and social organisations. Projects are carried out by volunteers, paid professionals or by both.

The board of governors of the OpenSearchFoundation will be composed of:

  • President: Dr. Richard Rogers, lecturer at Amsterdam University, founder of Govcom.org foundation (www.govcom.org), winner of the 2005 Best Information Science Book of the Year Award of the American society for Information Science and Technology (ASIST);
  • Treasurer: Koen Martens, owner of Sonologic, programmer, hacker, organiser, treasurer of GroenLinks The Hague section and idealist;
  • Secretary: initiator MSc Erik Borra, expert in the field of information technique and public administration, learning systems and text analysis.

The aim of the foundation is defined as follows:

The aim of the open search foundation is the development, distribution and maintenance of distributed, peer-to-peer search engines without central actors or storage. On the one hand this should prevent major multinationals to become preponderant in the field of information on the internet and on the other had to prevent public authorities to use the information concerning people's search behaviour to persecute them on the basis of their political convictions, sexual preferences or other characteristics.

As a developer we propose to hire Robin Gareus, MSc physicist who e.a. already developed a high performance network file system for the CERN cyclotron in Switzerland and is known here as an all round hacker and skilled programmer. He has already shown his interest in the assignment. Moreover, Erik Borra and Koen Martens will be responsible for the organisation, coordination, networks and public relations.

Budget

Development
Labour costs development 20000
Expenses various activities 3000
23000
Facilities
Workshops 1000
1000
Publicity
Design logo 500
Publicity 720
800
2020
Administration
Setting up foundation 300
Fees accountants 700
1000
Hardware
Purchase pc for developer 800
Serverpark (second hand, about 10 machines) 1000
KVM switch 70
1870
Total (in Euro) 28890

Labour costs for the developer include social insurance premiums, taxes and all legally required contributions .

Planning is as follows:

  • December 2006: 0.8 fte for the development of the information architecture and design.
  • January through March 2007: 0.8 fte for the development of the beta-version.
  • April through the middle of June 2007: 0.4 fte for beta-testing and bug-fixing

Workshops

Three workshops will be organised to increase the involvement of potential developers. These workshops are organised around three special milestones: kick off (12 December 2006), the completion of the architecture and the start of the development (18 January 2007) and the start of the beta fase (19 April 2007). Renowned speakers will be invited to give a short presentation about that particular phase of the project to attract participants. There will be time available to brainstorm about the subject with an expert. Each workshop will be virtually accessible via a chat channel, projected on a screen, and a live audio and video stream. The workshops have two goals: to assemble interested people and to gather useful input. We count on approximately 20 participants.

Publicity

To attract a large public for the project, a number of publicity channels must be used. A friend designer will develop our foundation style and logo, which explains the low cost. As initiator Koen Martens owns a hosting company, websites and mailing lists can be developed for free and managed at a low cost.

To make the project known, reach potential developers and build a community, it is essential to participate in leading conferences in the field. Participation to the Chaos Communication Congress is intended: this is a yearly event organised by the chaos computer club. This meeting is attended by hackers from all over Europe and beyond, who are not only nerds, but also idealists, activists and people involved in the combination politics, society and ICT. The organisation reacted positively to our request to organise a workshop. This is an excellent opportunity to expand a community.

In the Netherlands and Belgium as well internet censorship and entanglement of interests in information flows get ever more attention; several organisations hold public meetings (such as HCC, nluug, de Balie, Creative Commons, Kennisland, political parties, education institutions). We will try to attend these events, if possible with an active contribution.

Other parties, funding

Initial funding came from the digital pioneers project, an initiative of the Netherlands Knowledgeland foundation to sparkle socialy committed innovative IT projects.

We are currently looking for further funding, candidates can be on ProjectFunding

Follow-up

The main feature of the project is that it is not meant to generate profits, so a commercial sequel is not intended. At the end of the first version the foundation will continue to exist, and if necessary look for subsequent financial sponsoring. It is important to make the community enthusiastic, so that volunteers from all over the world will contribute to its development according to the open source/free software model. Experience has shown that a living and active product is vital to reach this goal. Although our aim is not commercial, we are aware of the fact that this project is rather innovative. Enterprises and public authorities become interested as well in the possibilities of open source and peer-to-peer software. Of course this project will generate knowledge which could be interesting and useful for those parties.

Edit | WYSIWYG | Attach | Printable | Raw View | Backlinks: Web, All Webs | History: r5 < r4 < r3 < r2 < r1 | More topic actions
 
This site is powered by the TWiki collaboration platformCopyright © by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback