Automatically categorizing Sinhala news items from selected news sources


Overview

Team

Table of Contents

  1. Introduction
  2. Problem
  3. Aim
  4. Objectives
  5. Proposed solution
  6. Solution Architecture
  7. Tools and Technologies
  8. Plan of Work
  9. Links

Introduction

Problem

The primary problem addressed by this project is the lack of tools available for automatically categorizing Sinhala news articles. This creates a challenge for readers to find relevant articles quickly and for news organizations to effectively manage their content.

Aim

The aim of this project is to develop an automated system that categorizes Sinhala news items based on their content to make it easier for readers to find relevant articles quickly and for news organizations to effectively manage their content.

Objectives

Proposed solution

Impact/Business Value:

Success Measurements:

User Stories/Use Case Scenarios:

Solution Architecture

Solution Architecture

Tools and Technologies

For natural language processing and machine learning

  1. LTK (Natural Language Toolkit)
  2. Scikit-learn:
  3. Pandas:
  4. Numpy:
  5. PyTorch:

Web application development

  1. MERN stack

Plan of Work

Outline

Outline

Considerations for extendability

Team, Strengths, and Expertise: