Success or failure of an E-commerce platform is often reduced to its ability to maximize the conversion rate of its visitors. This is commonly regarded as the capacity to induce a purchase from a visitor. Visitors pos...Success or failure of an E-commerce platform is often reduced to its ability to maximize the conversion rate of its visitors. This is commonly regarded as the capacity to induce a purchase from a visitor. Visitors possess individual characteristics, histories, and objectives which complicate the choice of what platform features that maximize the conversion rate. Modern web technology has made clickstream data accessible allowing a complete record of a visitor’s actions on a website to be analyzed. What remains poorly constrained is what parts of the clickstream data are meaningful information and what parts are accidental for the problem of platform design. In this research, clickstream data from an online retailer was examined to demonstrate how statistical modeling can improve clickstream information usage. A conceptual model was developed that conjectured relationships between visitor and platform variables, visitors’ platform exit rate, boune rate, and decision to purchase. Several hypotheses on the nature of the clickstream relationships were posited and tested with the models. A discrete choice logit model showed that the content of a website, the history of website use, and the exit rate of pages visited had marginal effects on derived utility for the visitor. Exit rate and bounce rate were modeled as beta distributed random variables. It was found that exit rate and its variability for pages visited were associated with site content, site quality, prior visitor history on the site, and technological preferences of the visitor. Bounce rate was also found to be influenced by the same factors but was in a direction opposite to the registered hypotheses. Most findings supported that clickstream data is amenable to statistical modeling with interpretable and comprehensible models.展开更多
Suffering from the inefficient traditional trial-and-error methods and the huge searching space filled by millions of candidates, discovering new perovskite visible photocatalysts with higher hydrogen production rate(...Suffering from the inefficient traditional trial-and-error methods and the huge searching space filled by millions of candidates, discovering new perovskite visible photocatalysts with higher hydrogen production rate(RH_(2)) still remains a challenge in the field of photocatalytic water splitting(PWS). Herein, we established structural-property models targeted to RH_(2) and the proper bandgap(Eg) via machine learning(ML) technology to accelerate the discovery of efficient perovskite photocatalysts for PWS. The Pearson correlation coefficients(R) of leave-one-out cross validation(LOOCV) were adopted to compare the performances of different algorithms including gradient boosting regression(GBR), support vector regression(SVR), backpropagation artificial neural network(BPANN), and random forest(RF). It was found that the BPANN model showed the highest R values from LOOCV and testing data of 0.9897 and 0.9740 for RH_(2),while the GBR model had the best values of 0.9290 and 0.9207 for Eg. Furtherly, 14 potential PWS perovskite candidates were screened out from 30,000 ABO3-type perovskite structures under the criteria of structural stability, Eg, conduction band energy, valence band energy and RH_(2). The average RH_(2) of these14 perovskites is 6.4% higher than the highest value in the training data set. Moreover, the online web servers were developed to share our prediction models, which could be accessible in http://materialsdata-mining.com/ocpmdm/material_api/ahfga3d9puqlknig(E_g prediction) and http://materials-datamining.com/ocpmdm/material_api/i0 ucuyn3 wsd14940(RH_(2) prediction).展开更多
Research on reservoir rock stress sensitivity has traditionally focused on unary granular structures,neglecting the binary nature of real reservoirs,especially tight reservoirs.Understanding the stresssensitive behavi...Research on reservoir rock stress sensitivity has traditionally focused on unary granular structures,neglecting the binary nature of real reservoirs,especially tight reservoirs.Understanding the stresssensitive behavior and mathematical characterization of binary granular media remains a challenging task.In this study,we conducted online-NMR experiments to investigate the permeability and porosity evolution as well as stress-sensitive control mechanisms in tight sandy conglomerate samples.The results revealed stress sensitivity coefficients between 0.042 and 0.098 and permeability damage rates ranging from 65.6%to 90.9%,with an average pore compression coefficient of 0.0168—0.0208 MPa 1.Pore-scale compression occurred in three stages:filling,compression,and compaction,with matrix pores playing a dominant role in pore compression.The stress sensitivity of binary granular media was found to be influenced by the support structure and particle properties.High stress sensitivity was associated with small fine particle size,high fines content,high uniformity coefficient of particle size,high plastic deformation,and low Young's modulus.Matrix-supported samples exhibited a high irreversible permeability damage rate(average=74.2%)and stress sensitivity coefficients(average=0.089),with pore spaces more slit-like.In contrast,grain-supported samples showed low stress sensitivity coefficients(average=0.021)at high stress stages.Based on the experiments,we developed a mathematical model for stress sensitivity in binary granular media,considering binary granular properties and nested interactions using Hertz contact deformation and Poiseuille theory.By describing the change in activity content of fines under stress,we characterized the non-stationary state of compressive deformation in the binary granular structure and classified the reservoir into three categories.The model was applied for production prediction using actual data from the Mahu reservoir in China,showing that the energy retention rates of support-dominated,fil展开更多
文摘Success or failure of an E-commerce platform is often reduced to its ability to maximize the conversion rate of its visitors. This is commonly regarded as the capacity to induce a purchase from a visitor. Visitors possess individual characteristics, histories, and objectives which complicate the choice of what platform features that maximize the conversion rate. Modern web technology has made clickstream data accessible allowing a complete record of a visitor’s actions on a website to be analyzed. What remains poorly constrained is what parts of the clickstream data are meaningful information and what parts are accidental for the problem of platform design. In this research, clickstream data from an online retailer was examined to demonstrate how statistical modeling can improve clickstream information usage. A conceptual model was developed that conjectured relationships between visitor and platform variables, visitors’ platform exit rate, boune rate, and decision to purchase. Several hypotheses on the nature of the clickstream relationships were posited and tested with the models. A discrete choice logit model showed that the content of a website, the history of website use, and the exit rate of pages visited had marginal effects on derived utility for the visitor. Exit rate and bounce rate were modeled as beta distributed random variables. It was found that exit rate and its variability for pages visited were associated with site content, site quality, prior visitor history on the site, and technological preferences of the visitor. Bounce rate was also found to be influenced by the same factors but was in a direction opposite to the registered hypotheses. Most findings supported that clickstream data is amenable to statistical modeling with interpretable and comprehensible models.
基金Financial support to this work from the National Key Research and Development Program of China (No. 2016YFB0700504)the Science and Technology Commission of Shanghai Municipality (18520723500) is gratefully acknowledged。
文摘Suffering from the inefficient traditional trial-and-error methods and the huge searching space filled by millions of candidates, discovering new perovskite visible photocatalysts with higher hydrogen production rate(RH_(2)) still remains a challenge in the field of photocatalytic water splitting(PWS). Herein, we established structural-property models targeted to RH_(2) and the proper bandgap(Eg) via machine learning(ML) technology to accelerate the discovery of efficient perovskite photocatalysts for PWS. The Pearson correlation coefficients(R) of leave-one-out cross validation(LOOCV) were adopted to compare the performances of different algorithms including gradient boosting regression(GBR), support vector regression(SVR), backpropagation artificial neural network(BPANN), and random forest(RF). It was found that the BPANN model showed the highest R values from LOOCV and testing data of 0.9897 and 0.9740 for RH_(2),while the GBR model had the best values of 0.9290 and 0.9207 for Eg. Furtherly, 14 potential PWS perovskite candidates were screened out from 30,000 ABO3-type perovskite structures under the criteria of structural stability, Eg, conduction band energy, valence band energy and RH_(2). The average RH_(2) of these14 perovskites is 6.4% higher than the highest value in the training data set. Moreover, the online web servers were developed to share our prediction models, which could be accessible in http://materialsdata-mining.com/ocpmdm/material_api/ahfga3d9puqlknig(E_g prediction) and http://materials-datamining.com/ocpmdm/material_api/i0 ucuyn3 wsd14940(RH_(2) prediction).
基金funded in part by the National Natural Science Foundation of China,grant number 51574257in part by the National Key Research and Development Program of China,grant number 2015CB250904。
文摘Research on reservoir rock stress sensitivity has traditionally focused on unary granular structures,neglecting the binary nature of real reservoirs,especially tight reservoirs.Understanding the stresssensitive behavior and mathematical characterization of binary granular media remains a challenging task.In this study,we conducted online-NMR experiments to investigate the permeability and porosity evolution as well as stress-sensitive control mechanisms in tight sandy conglomerate samples.The results revealed stress sensitivity coefficients between 0.042 and 0.098 and permeability damage rates ranging from 65.6%to 90.9%,with an average pore compression coefficient of 0.0168—0.0208 MPa 1.Pore-scale compression occurred in three stages:filling,compression,and compaction,with matrix pores playing a dominant role in pore compression.The stress sensitivity of binary granular media was found to be influenced by the support structure and particle properties.High stress sensitivity was associated with small fine particle size,high fines content,high uniformity coefficient of particle size,high plastic deformation,and low Young's modulus.Matrix-supported samples exhibited a high irreversible permeability damage rate(average=74.2%)and stress sensitivity coefficients(average=0.089),with pore spaces more slit-like.In contrast,grain-supported samples showed low stress sensitivity coefficients(average=0.021)at high stress stages.Based on the experiments,we developed a mathematical model for stress sensitivity in binary granular media,considering binary granular properties and nested interactions using Hertz contact deformation and Poiseuille theory.By describing the change in activity content of fines under stress,we characterized the non-stationary state of compressive deformation in the binary granular structure and classified the reservoir into three categories.The model was applied for production prediction using actual data from the Mahu reservoir in China,showing that the energy retention rates of support-dominated,fil