1
|
Rashid MRA, Hasan KF, Hasan R, Das A, Sultana M, Hasan M. A comprehensive dataset for sentiment and emotion classification from Bangladesh e-commerce reviews. Data Brief 2024; 53:110052. [PMID: 38317738 PMCID: PMC10838682 DOI: 10.1016/j.dib.2024.110052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Revised: 01/04/2024] [Accepted: 01/05/2024] [Indexed: 02/07/2024] Open
Abstract
In the rapidly evolving domain of e-commerce, analyzing customer feedback through reviews is crucial, particularly for understanding and enhancing consumer experience in the Bangladeshi market. Our comprehensive dataset, derived from two Bangladeshi e-commerce platforms, Daraz and Pickaboo, features a diverse collection of reviews in both Bengali and English, covering a broad range of products. These reviews are not only rich in linguistic variety but also encapsulate a spectrum of emotions, some even conveyed through emojis, offering a deep dive into consumer sentiment. Expert annotators have meticulously examined and categorized each review, classifying emotions into five distinct types - Happiness, Sadness, Fear, Anger, and Love - and sentiments into Positive (Happiness, Love) and Negative (Sadness, Anger, Fear) categories. This level of detailed annotation enables precise assessments of customer emotions and preferences, which are essential for evaluating and improving existing product offerings. Moreover, the insights gleaned from this dataset are invaluable for guiding future product development and uncovering new opportunities in the dynamic Bangladeshi market. Ultimately, this dataset not only serves as a significant resource for sentiment analysis using natural language processing (NLP) techniques but also contributes valuable insights into the unique consumer behavior patterns in Bangladesh, enriching the NLP community's understanding of diverse market dynamics.
Collapse
Affiliation(s)
- Mohammad Rifat Ahmmad Rashid
- Department of Computer Science & Engineering, East West University Bangladesh, Jahurul Islam Ave, Dhaka 1212, Bangladesh
| | - Kazi Ferdous Hasan
- Department of Computer Science & Engineering, East West University Bangladesh, Jahurul Islam Ave, Dhaka 1212, Bangladesh
| | - Rakibul Hasan
- Department of Computer Science & Engineering, East West University Bangladesh, Jahurul Islam Ave, Dhaka 1212, Bangladesh
| | - Aritra Das
- Department of Computer Science & Engineering, East West University Bangladesh, Jahurul Islam Ave, Dhaka 1212, Bangladesh
| | - Mithila Sultana
- Department of Computer Science & Engineering, East West University Bangladesh, Jahurul Islam Ave, Dhaka 1212, Bangladesh
| | - Mahamudul Hasan
- Department of Computer Science & Engineering, East West University Bangladesh, Jahurul Islam Ave, Dhaka 1212, Bangladesh
| |
Collapse
|
2
|
Emu IA, Niloy NT, Karim BMA, Chowdhury A, Johora FT, Hasan M, Mittra T, Rashid MRA, Jabid T, Islam M, Ali MS. ArsenicSkinImageBD: A comprehensive image dataset to classify affected and healthy skin of arsenic-affected people. Data Brief 2024; 52:110016. [PMID: 38293578 PMCID: PMC10827410 DOI: 10.1016/j.dib.2023.110016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Revised: 12/05/2023] [Accepted: 12/22/2023] [Indexed: 02/01/2024] Open
Abstract
Compared to other popular research domains, dermatology got less attention among machine learning researchers. One of the main concerns for this problem is an inadequate dataset since collecting samples from the human body is very sensitive. In recent years, arsenic has emerged as a significant issue for dermatologists. Arsenic is a highly toxic substance found in the earth's crust whose small amounts can be very injurious to the human body. People who are exposed to arsenic for a long time through water and food can get cancer and skin lesions. With a view to contributing to this aspect, this dataset has been organized with the help of which the researchers can understand the impact of this contamination and design a solution using artificial intelligence. To the best of our knowledge, this is the first standard, easy-to-use, and open dataset of arsenic diseases. The images were collected from four places in Bangladesh, under the Department of Public Health Engineering, Chapainawabganj, where they are working on arsenic contamination. The dataset has 8892 skin images, with half of them showing people with arsenic effects and the other half showing mixed skin images that are not affected by arsenic. This makes the dataset useful for treating people with arsenic-related conditions. Eventually, this dataset can attract the attention of not only the machine learning researchers, but also scientists, doctors, and other professionals in the associated research field.
Collapse
Affiliation(s)
- Ismot Ara Emu
- Department of Computer Science and Engineering, East West University, Aftabnagar, Dhaka, Bangladesh
| | - Nishat Tasnim Niloy
- Department of Computer Science and Engineering, East West University, Aftabnagar, Dhaka, Bangladesh
| | - Bhuyan Md Anowarul Karim
- Department of Computer Science and Engineering, East West University, Aftabnagar, Dhaka, Bangladesh
| | - Anindya Chowdhury
- Department of Computer Science and Engineering, East West University, Aftabnagar, Dhaka, Bangladesh
| | - Fatema Tuj Johora
- Department of Computer Science and Engineering, East West University, Aftabnagar, Dhaka, Bangladesh
| | - Mahamudul Hasan
- Department of Computer Science and Engineering, East West University, Aftabnagar, Dhaka, Bangladesh
| | - Tanni Mittra
- Department of Computer Science and Engineering, East West University, Aftabnagar, Dhaka, Bangladesh
| | | | - Taskeed Jabid
- Department of Computer Science and Engineering, East West University, Aftabnagar, Dhaka, Bangladesh
| | - Maheen Islam
- Department of Computer Science and Engineering, East West University, Aftabnagar, Dhaka, Bangladesh
| | - Md. Sawkat Ali
- Department of Computer Science and Engineering, East West University, Aftabnagar, Dhaka, Bangladesh
| |
Collapse
|
3
|
Rashid MRA, Hossain MS, Fahim MD, Islam MS, Tahzib-E-Alindo, Prito RH, Sheikh MSA, Ali MS, Hasan M, Islam M. Comprehensive dataset of annotated rice panicle image from Bangladesh. Data Brief 2023; 51:109772. [PMID: 38020434 PMCID: PMC10661701 DOI: 10.1016/j.dib.2023.109772] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Revised: 10/25/2023] [Accepted: 10/31/2023] [Indexed: 12/01/2023] Open
Abstract
Bangladesh's economy is primarily driven by the agriculture sector. Rice is one of the staple food of Bangladesh. The count of panicles per unit area serves as a widely used indicator for estimating rice yield, facilitating breeding efforts, and conducting phenotypic analysis. By calculating the number of panicles within a given area, researchers and farmers can assess crop density, plant health, and prospective production. The conventional method of estimating rice yields in Bangladesh is time-consuming, inaccurate, and inefficient. To address the challenge of detecting rice panicles, this article provides a comprehensive dataset of annotated rice panicle images from Bangladesh. Data collection was done by a drone equipped with a 4 K resolution camera, and it took place on April 25, 2023, in Bonkhoria Gazipur, Bangladesh. During the day, the drone captured the rice field from various heights and perspectives. After employing various image processing techniques for curation and annotation, the dataset was generated using images extracted from drone video clips, which were then annotated with information regarding rice panicles. The dataset is the largest publicly accessible collection of rice panicle images from Bangladesh, consisting of 2193 original images and 5701 augmented images.
Collapse
Affiliation(s)
| | - Md. Shafayat Hossain
- Department of Computer Science and Engineering, East West University, Dhaka, Bangladesh
| | - MD Fahim
- Department of Computer Science and Engineering, East West University, Dhaka, Bangladesh
| | - Md. Shajibul Islam
- Department of Computer Science and Engineering, East West University, Dhaka, Bangladesh
| | - Tahzib-E-Alindo
- Department of Computer Science and Engineering, East West University, Dhaka, Bangladesh
| | - Rizvee Hassan Prito
- Department of Computer Science and Engineering, East West University, Dhaka, Bangladesh
| | | | - Md Sawkat Ali
- Department of Computer Science and Engineering, East West University, Dhaka, Bangladesh
| | - Mahamudul Hasan
- Department of Computer Science and Engineering, East West University, Dhaka, Bangladesh
| | - Maheen Islam
- Department of Computer Science and Engineering, East West University, Dhaka, Bangladesh
| |
Collapse
|