Welcome to Community Server Sign in | Join

Automatic Classification of Documents

Grid Computing Applications

<October 2003>
SuMoTuWeThFrSa
2829301234
567891011
12131415161718
19202122232425
2627282930311
2345678

Navigation

Subscriptions

Discussion on document classification

This is what myself and Mohan discussed:

We build a web service that is capable of classifying all the text based documents, given the directory containing documents to be classified. A remote machine name, documents directory and permisison to acess it will be critical input to web service. As a service, we will go through all the documents, classify them, assign some sort of index, will restructure directory based on the classification and store the indexes on server side. More importantly, we will put that machine in our grid and install a web service on itself so that any changes made later in the documents can be taken care and updated indexes can b enotified to server. In a way, this will be an application of grid computing and web services. Classification techniques and algorithm needs to be studied and explored in detail. Whenever, some of the machine in the grid goes out of the network, thoses indexes will not be available to the user using web services at server or at any of the machine connected. A user can also use these web services in oreder to search and locate some document based on some keyword or classification tecchnique implemented in service. Classification technique will be implemented as plug-ins so that new ones acn be added or current ones can be modified later and made more effecient and effective, if needed.

We are planning to meet tomorrow at 10am in M.Engg. lab to exlplore further by initiating a group discussion among ourselves. So in case if we are able to make it, let us meet there ot otherwise let us allocate some other time and date. It will not be feasible for all of us to meet all the time, and that should be fine.

~Vishal

posted on Monday, October 27, 2003 3:29 PM by sapna