Please use this identifier to cite or link to this item:
https://dspace.ctu.edu.vn/jspui/handle/123456789/12140
Title: | Hupsmt: An efficient algorithm for mining high utility-probability sequences in uncertain databases with multiple minimum utility thresholds |
Authors: | Truong, Chi Tin Tran, Ngoc Anh Duong, Van Hai Le, Hoai Bac |
Keywords: | High utility-probability sequence Uncertain quantitative sequence database Upper and lower-bounds Width and depth pruning strategies |
Issue Date: | 2019 |
Series/Report no.: | Journal of Computer Science and Cybernetics;Vol.35 (01) .- P.01–20 |
Abstract: | The problem of high utility sequence mining (HUSM) in quantitative sequence databases (QSDBs) is more general than that of mining frequent sequences in sequence databases. An important limitation of HUSM is that a user-predefined minimum utility threshold is used to decide if a sequence is high utility. However, this is not suitable for many real life applications as sequences may differ in importance. Another limitation of HUSM is that data in QSDBs are assumed to be precise. But in the real world, data collected by sensors, or other means, may be uncertain. Thus, this paper proposes a framework for mining high utility-probability sequences (HUPSs) in uncertain QSDBs (UQSDBs) with multiples minimum utility thresholds using a minimum utility. Two new width and depth pruning strategies are also introduced to eliminate low utility or low probability sequences as well as their extensions early, and to reduce the sets of candidate items for extensions during the mining process. Based on these strategies, a novel efficient algorithm named HUPSMT is designed for discovering HUPSs. Finally, an experimental study conducted with both real-life and synthetic UQSDBs shows the performance of HUPSMT in terms of time and memory consumption. |
URI: | http://dspace.ctu.edu.vn/jspui/handle/123456789/12140 |
ISSN: | 1813-9663 |
Appears in Collections: | Tin học và Điều khiển học (Journal of Computer Science and Cybernetics) |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
_file_ | 7.49 MB | Adobe PDF | View/Open | |
Your IP: 13.59.106.174 |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.