1
|
Using Dynamic Pruned N-Gram Model for Identifying the Gender of the User. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12136378] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/10/2022]
Abstract
Organizations analyze customers’ personal data to understand and model their behavior. Identifying customers’ gender is a significant factor in analyzing markets that help plan the promotional campaigns, determine target customers and provide relevant offers. Several techniques were developed to analyze different types of data, including text, image, speech, and biometrics, to identify the gender of the user. The method of synthesis of the profile name differs from one customer to another. Using numerical substitutions of specific letters, known as Leet language, impedes the gender identification task. Moreover, using acronyms, misspellings, and adjacent names impose additional challenges. Towards this goal, this work uses the customers’ profile names associated with submitted reviews to recognize the customers’ gender. First, we create datasets of profile names extracted from the customers’ reviews. Secondly, we introduce a dynamic pruned n-gram model for identifying the gender of the user. It starts with data segmentation to handle adjacent parts, followed by data conversion and cleaning to fix the use of Leet language. Feature selection through a dynamic pruned n-gram model is the next step with the recurrent misspelling correction using fuzzy matching. We evaluate the proposed approach on the real data collected from active web resources. The obtained results demonstrate its validity and reliability.
Collapse
|
2
|
Zhong Y, Huang C, Li Q. A collaborative filtering recommendation algorithm based on fuzzy C-means clustering. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2022. [DOI: 10.3233/jifs-212216] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
With the rapid growth of data scale, the problems of collaborative filtering recommendation algorithm are more and more obvious, such as data sparsity, cold start, scalability, and the change of user interest over time. About the existing problems, we introduce the fuzzy clustering and propose a collaborative filtering algorithm based on fuzzy C-means clustering. The algorithm performs fuzzy clustering on the item attribute information to make items belonging to different categories in different membership degree, increases the data density, effectively reduces the data sparsity, and solves the issue that the inaccuracy of similarity leads to the low recommendation accuracy. Meanwhile, the algorithm introduces the time weight function. Different evaluation times give different time weight values, and recently evaluated items are more representative of the user current interest, so we give a higher weight value, and early evaluated items have less effect on the user current interest, thus the weight value are relatively lower. The experimental results show that our algorithm can effectively alleviate the data sparsity problem and time migration of users preferences, thus achieve better performance.
Collapse
Affiliation(s)
- Ying Zhong
- Research & Development Institute of Northwestern Polytechnical University, Shenzhen, Guangdong, PR China
| | - Chenze Huang
- School of Computer Science and Engineering, Northwestern Polytechnical University, Xian, PR China
| | - Qi Li
- School of Computer Science and Engineering, Northwestern Polytechnical University, Xian, PR China
| |
Collapse
|