速度和效果是聚类算法面临的两大问题.DBSCAN(density based spatial clustering of applications with noise)是典型的基于密度的一种聚类方法,对于大型数据库的聚类实验显示了它在速度上的优越性.提出了一种基于密度的递归聚类算法(re...速度和效果是聚类算法面临的两大问题.DBSCAN(density based spatial clustering of applications with noise)是典型的基于密度的一种聚类方法,对于大型数据库的聚类实验显示了它在速度上的优越性.提出了一种基于密度的递归聚类算法(recursive density based clustering algorithm,简称RDBC),此算法可以智能地、动态地修改其密度参数.RDBC是基于DBSCAN的一种改进算法,其运算复杂度和DBSCAN相同.通过在Web文档上的聚类实验,结果表明,RDBC不但保留了DBSCAN高速度的优点,而且聚类效果大大优于DBSCAN.展开更多
BACKGROUND Traditionally, the mortality rate at 1-year post hip fracture was quoted as approximately 30% of all hip fractures. There have been recent improvements in hip fracture care in the main driven by national hi...BACKGROUND Traditionally, the mortality rate at 1-year post hip fracture was quoted as approximately 30% of all hip fractures. There have been recent improvements in hip fracture care in the main driven by national hip fracture registries with reductions in 30-d mortality rates reported.AIM To address recent 1-year post hip fracture mortality rates in the literature.METHODS Systematic literature review, national hip fracture registries/databases, local studies on hip fracture mortality, 5 years limitation(2013-2017), cohorts > 100,studies in English. Outcome measure: Mortality rate at 1-year post hip fracture.RESULTS Recent 1-year mortality rates were reviewed using the literature from 8 National Registries and 36 different countries. Recently published 1-year mortality rates appear lower than traditional figures and may represent a downward trend.CONCLUSION There appears to be a consistent worldwide reduction in mortality at 1-year post hip fracture compared to previously published research. Globally, those which suffer hip fractures may currently be benefiting from the results of approximately 30 years of national registries, rigorous audit processes and international collaboration. The previously quoted mortality rates of 10% at 1-mo and 30% at 1-year may be outdated.展开更多
There is a trend that, virtually everyone, rang- ing from big Web companies to traditional enterprisers to physical science researchers to social scientists, is either al- ready experiencing or anticipating unpreceden...There is a trend that, virtually everyone, rang- ing from big Web companies to traditional enterprisers to physical science researchers to social scientists, is either al- ready experiencing or anticipating unprecedented growth in the amount of data available in their world, as well as new op- portunities and great untapped value. This paper reviews big data challenges from a data management respective. In partic- ular, we discuss big data diversity, big data reduction, big data integration and cleaning, big data indexing and query, and fi- nally big data analysis and mining. Our survey gives a brief overview about big-data-oriented research and problems.展开更多
Traditionally, SQL query language is used to search the data in databases. However, it is inappropriate for end-users, since it is complex and hard to learn. It is the need of end-user, searching in databases with key...Traditionally, SQL query language is used to search the data in databases. However, it is inappropriate for end-users, since it is complex and hard to learn. It is the need of end-user, searching in databases with keywords, like in web search engines. This paper presents a survey of work on keyword search in databases. It also includes a brief introduction to the SEEKER system which has been developed.展开更多
Background: Traditional Chinese medicine (TCM) treats diseases in a holistic manner, while TCM formulae are multi-component, multi-target agents at the molecular level. Thus there are many parallels between the key id...Background: Traditional Chinese medicine (TCM) treats diseases in a holistic manner, while TCM formulae are multi-component, multi-target agents at the molecular level. Thus there are many parallels between the key ideas of TCM pharmacology and network pharmacology. These years, TCM network pharmacology has developed as an interdisciplinary of TCM science and network pharmacology, which studies the mechanism of TCM at the molecular level and in the context of biological networks. It provides a new research paradigm that can use modern biomedical science to interpret the mechanism of TCM, which is promising to accelerate the modernization and internationalization of TCM? Results: In this paper we introduce state-of-the-art free data sources, web servers and softwares that can be used in the TCM network pharmacology, including databases of TCM, drug targets and diseases, web servers for the prediction of drug targets, and tools for network and functional analysis. Conclusions: This review could help experimental pharmacologists make better use of the existing data and methods in their study of TCM.展开更多
There is a great thrust in industry toward the development of more feasible and viable tools for storing fast-growing volume, velocity, and diversity of data, termed 'big data'. The structural shift of the storage m...There is a great thrust in industry toward the development of more feasible and viable tools for storing fast-growing volume, velocity, and diversity of data, termed 'big data'. The structural shift of the storage mechanism from traditional data management systems to NoSQL technology is due to the intention of fulfilling big data storage requirements. However, the available big data storage technologies are inefficient to provide consistent, scalable, and available solutions for continuously growing heterogeneous data. Storage is the preliminary process of big data analytics for real-world applications such as scientific experiments, healthcare, social networks, and e-business. So far, Amazon, Google, and Apache are some of the industry standards in providing big data storage solutions, yet the literature does not report an in-depth survey of storage technologies available for big data, investigating the performance and magnitude gains of these technologies. The primary objective of this paper is to conduct a comprehensive investigation of state-of-the-art storage technologies available for big data. A well-defined taxonomy of big data storage technologies is presented to assist data analysts and researchers in understanding and selecting a storage mecha- nism that better fits their needs. To evaluate the performance of different storage architectures, we compare and analyze the ex- isling approaches using Brewer's CAP theorem. The significance and applications of storage technologies and support to other categories are discussed. Several future research challenges are highlighted with the intention to expedite the deployment of a reliable and scalable storage system.展开更多
文摘速度和效果是聚类算法面临的两大问题.DBSCAN(density based spatial clustering of applications with noise)是典型的基于密度的一种聚类方法,对于大型数据库的聚类实验显示了它在速度上的优越性.提出了一种基于密度的递归聚类算法(recursive density based clustering algorithm,简称RDBC),此算法可以智能地、动态地修改其密度参数.RDBC是基于DBSCAN的一种改进算法,其运算复杂度和DBSCAN相同.通过在Web文档上的聚类实验,结果表明,RDBC不但保留了DBSCAN高速度的优点,而且聚类效果大大优于DBSCAN.
文摘BACKGROUND Traditionally, the mortality rate at 1-year post hip fracture was quoted as approximately 30% of all hip fractures. There have been recent improvements in hip fracture care in the main driven by national hip fracture registries with reductions in 30-d mortality rates reported.AIM To address recent 1-year post hip fracture mortality rates in the literature.METHODS Systematic literature review, national hip fracture registries/databases, local studies on hip fracture mortality, 5 years limitation(2013-2017), cohorts > 100,studies in English. Outcome measure: Mortality rate at 1-year post hip fracture.RESULTS Recent 1-year mortality rates were reviewed using the literature from 8 National Registries and 36 different countries. Recently published 1-year mortality rates appear lower than traditional figures and may represent a downward trend.CONCLUSION There appears to be a consistent worldwide reduction in mortality at 1-year post hip fracture compared to previously published research. Globally, those which suffer hip fractures may currently be benefiting from the results of approximately 30 years of national registries, rigorous audit processes and international collaboration. The previously quoted mortality rates of 10% at 1-mo and 30% at 1-year may be outdated.
基金This work was partially done when the authors worked in SA Center for Big Data Research in Renmin University of China. This Center is funded by a Chinese National 111 Project Attracting Interna- tional Talents in Data Engineering Research. This paper was also partially supported by Beijing Natural Science Foundation (Grant No. 4112030) and National Natural Science Foundation (Grant No. 61170011) and China Na- tional Social Security Foundation (Grant No: 12&ZD220).
文摘There is a trend that, virtually everyone, rang- ing from big Web companies to traditional enterprisers to physical science researchers to social scientists, is either al- ready experiencing or anticipating unprecedented growth in the amount of data available in their world, as well as new op- portunities and great untapped value. This paper reviews big data challenges from a data management respective. In partic- ular, we discuss big data diversity, big data reduction, big data integration and cleaning, big data indexing and query, and fi- nally big data analysis and mining. Our survey gives a brief overview about big-data-oriented research and problems.
文摘Traditionally, SQL query language is used to search the data in databases. However, it is inappropriate for end-users, since it is complex and hard to learn. It is the need of end-user, searching in databases with keywords, like in web search engines. This paper presents a survey of work on keyword search in databases. It also includes a brief introduction to the SEEKER system which has been developed.
基金the National Natural Science Foundation of China (Nos. 81520108030, 21472238,61372194 and 81260672)Professor of Chang Jiang Scholars Program, Shanghai Engineering Research Center for the Preparation of Bioactive Natural Products (No. 16DZ2280200)+3 种基金the Scientific Foundation of Shanghai China (Nos. 13401900103 and 13401900101)the National Key Research and Development Program of China (No. 2017YFC1700200)the Natural Science Foundation of Chongqing (No. cstc2018jcyjAX0090)Chongqing Education Reform Project of Graduate (No. yjgl52017).
文摘Background: Traditional Chinese medicine (TCM) treats diseases in a holistic manner, while TCM formulae are multi-component, multi-target agents at the molecular level. Thus there are many parallels between the key ideas of TCM pharmacology and network pharmacology. These years, TCM network pharmacology has developed as an interdisciplinary of TCM science and network pharmacology, which studies the mechanism of TCM at the molecular level and in the context of biological networks. It provides a new research paradigm that can use modern biomedical science to interpret the mechanism of TCM, which is promising to accelerate the modernization and internationalization of TCM? Results: In this paper we introduce state-of-the-art free data sources, web servers and softwares that can be used in the TCM network pharmacology, including databases of TCM, drug targets and diseases, web servers for the prediction of drug targets, and tools for network and functional analysis. Conclusions: This review could help experimental pharmacologists make better use of the existing data and methods in their study of TCM.
文摘There is a great thrust in industry toward the development of more feasible and viable tools for storing fast-growing volume, velocity, and diversity of data, termed 'big data'. The structural shift of the storage mechanism from traditional data management systems to NoSQL technology is due to the intention of fulfilling big data storage requirements. However, the available big data storage technologies are inefficient to provide consistent, scalable, and available solutions for continuously growing heterogeneous data. Storage is the preliminary process of big data analytics for real-world applications such as scientific experiments, healthcare, social networks, and e-business. So far, Amazon, Google, and Apache are some of the industry standards in providing big data storage solutions, yet the literature does not report an in-depth survey of storage technologies available for big data, investigating the performance and magnitude gains of these technologies. The primary objective of this paper is to conduct a comprehensive investigation of state-of-the-art storage technologies available for big data. A well-defined taxonomy of big data storage technologies is presented to assist data analysts and researchers in understanding and selecting a storage mecha- nism that better fits their needs. To evaluate the performance of different storage architectures, we compare and analyze the ex- isling approaches using Brewer's CAP theorem. The significance and applications of storage technologies and support to other categories are discussed. Several future research challenges are highlighted with the intention to expedite the deployment of a reliable and scalable storage system.