Implementation of Rule Based Algorithm for Sandhi-Vicheda Of Compound Hindi Words



IJCSI International Journal of Computer Science Issues, Vol. 3, 2009

48


ISSN (Online): 1694-0784

ISSN (Printed): 1694-0814

eg’o;

egτ $ ,?o;

no'o;

По $ ,‘o;

ije?o;

ije $ ,oo4

;Fkfrgifld

;ftt$ ,frgifld

Table 8: Rule VII Implemented Word List

Step 4.8: (Rule for eliminating the half letter in
Sandhi- Vicheda) If find the (Half CH) (
P) Letter then
eliminates the Letter and decompose the word.

ifU/kPNn

ιfU∕τ $ Nn

fopNn

fθ $ Nn

ifjpNn

ifj $ Nn

y{4hPNTOT

ye $ nt;t

Table 9: Rule VIII Implemented Word List

Step 4.9: (Rule of Visarga in Sandhi Vicheda) If find
the (Half Letter) then replace with Sign (
: )visarga.

fU?py

fU% $ py

fU'rt

fu% $ r t

∏Liτgι

n $ Ikgl

fULrτj

fU% $ rkj

Table 10: Rule IX Implemented Word List

Step 5: Repeat Steps 4.1 to 4.9 to check the next word
for checking the Vyanjan that combined with Matra.
Then replace the Matra with Swar.

Step 6: Find the Unicode value for each of the Hindi
characters and additional characters and use those
values to implement above rules.

Step 7: Display the results.

Our module was developed in Visual Basic.NET
(2005) and the encoding used for text was in Unicode,
most suitable for other applications as well. Unicode
uses a 16 bit encoding that provision for 65536
characters. Unicode standard [18] assigns each
character a unique numeric value and name. Presently
it provides codes for 49194 characters:

In Hindi Language:      Total Swar=13

Total Vyanjan=33

Total Matra=13

V RESULTS AND DISCUSSION

We have tested our software on more than 200 words.
Using the Rule based algorithm we have reported an
accuracy of 60-80% depending upon the number of
rules to be implemented. SANDHI-VICHEDA is an
easy and interesting way that can give entirely new
dimension that add new way to traditional approach to
Hindi Teaching.

VI CONCLUSION AND FUTURE WORK

In this paper, we presented the technique for the
Sandhi-Vicheda of compound hindi words. Using the
Rule based algorithm we have reported an accuracy of
60-80% depending upon the number of rules to be
implemented. As future work, database can be
extended to include more entries to improve the
accuracy. This software can be used as a teaching aid
to all the students from Class-V to the highest level of
education. With this software one can learn about the
very important aspect of Hindi Grammar i.e.
‘SANDHI-VICHEDA’. By adding new more features,
we can upgrade it to learn all the aspects of Hindi
Grammar. It can also be used to solve and test the
problems related to Hindi Grammar.

ACKNOWLEDGEMENT

We would like to thank Dr. G.S. Lehal, Professor and
Head, Department of Computer Science, Punjabi
University, Patiala for many helpful suggestions and
comments.

REFERENCES

[1] Bharati, Akshar, Vineet Chaitanya & Rajeev Sangal,
1991,
A Computational Grammar for Indian languages
processing
, Indian Linguistics Journal, pp.52, 91-103.

[2] Bharati A., Chaitanya V and Sangal R, "Natural
Language processing: A Paninian Perspective", Prentice
Hall of India, 1995.

[3] Cheng, Chin-Chuan “English Stresses and Chinese Tones
in Chinese Sentences” California University, Berkeley,
Phonology Laboratory.

[4] Dan W. Patterson “Introduction to Artificial Intelligence
and Expert Systems” Prentice Hall P-227.

IJCSI



More intriguing information

1. The Works of the Right Honourable Edmund Burke
2. The name is absent
3. Constructing the Phylomemetic Tree Case of Study: Indonesian Tradition-Inspired Buildings
4. The name is absent
5. Political Rents, Promotion Incentives, and Support for a Non-Democratic Regime
6. The economic value of food labels: A lab experiment on safer infant milk formula
7. Regional differentiation in the Russian federation: A cluster-based typification
8. Estimated Open Economy New Keynesian Phillips Curves for the G7
9. The name is absent
10. BARRIERS TO EFFICIENCY AND THE PRIVATIZATION OF TOWNSHIP-VILLAGE ENTERPRISES
11. Female Empowerment: Impact of a Commitment Savings Product in the Philippines
12. The name is absent
13. The name is absent
14. The Macroeconomic Determinants of Volatility in Precious Metals Markets
15. The name is absent
16. Declining Discount Rates: Evidence from the UK
17. The name is absent
18. The name is absent
19. Who is missing from higher education?
20. The name is absent