NEQNET: The world of theoretical physics

For physicts by physicists

  • Archives
  • Best posts
  • Contact us
  • How to subscribe
  • Newcomer?
  • Open problems
  • Home
  • APPLIED
  • ASTRO
  • COND-MAT
  • HEP-TH/PH

Subscribe via email

or connect via

329. Human Activity in the Web

APPLIED, COND-MAT, HEP-TH/PH — By Filippo Radicchi on March 30, 2009 at 4:05 pm
Print This Post Print This Post   Save This Post as PDF

329. Human Activity in the Web Filippo Radicchi is a research scientist in Complex Systems Lagrange Lab, ISI Foundation, Turin. He is interested in non-equilibrium diagrammatic methods, RG group analysis of complex networks and community detection. Dmitry.

We use to spend a relevant part of our time surfing the Web: we read news, make posts in blogs, share photos and music, buy books or other goods, etc. The Web offers great possibilities to communicate and retrieve information and none of the precedent technologies can be compared to the Web in terms of globality and velocity of communication.

The Web represents also an important source of information for scientific purposes. Actions performed in the Web are generally stored in electronic databases. Think for example about NEQNET: when we you make a post or leave a comment, meaningful information about your action are stored in the database present on the computer which hosts the service: in addition to the content of the message, also your nickname and the time stamp of your message are saved. Electronic databases collecting information about human activity in the Web can be therefore used in order to understand how people behave and interact.

Former studies have already focused on computer related human activities. Particular attention has been addressed to the activity patterns of humans. Interesting information can be extracted from the statistical analysis of the so called inter-event times. Imagine we know the the instants of time 329. Human Activity in the Web in which a user i has performed 329. Human Activity in the Web actions. From such information, we can calculate 329. Human Activity in the Web inter-event time gaps: 329. Human Activity in the Web. Then we can compute the inter-event time probability distribution function (pdf) of the i-th user as

329. Human Activity in the Web,

where 329. Human Activity in the Web is the total number of consecutive actions performed by the user i which differ by 329. Human Activity in the Web units of time. The global (calculated over the whole population) pdf can be then calculated as

329. Human Activity in the Web

which is basically the weighted average of the pdfs of single users: each user contributes to the global pdf linear proportionally to her/his global activity. Global inter-event pdfs have been studied in the case of e-mail communication [Nature 435, 207-211 (2005)], Web surfing [Phys.  Rev. E 78, 026123 (2008)], etc. In all these cases, it has been shown that the global inter-event time pdf can be well fitted by a power-law

329. Human Activity in the Web,

where the exponent 329. Human Activity in the Web ranges from 1 to 2, depending on the case of study. This finding is particularly relevant because human activity seems to be characterized by a bursty behavior: long periods of inactivity followed by short periods of intense activity. Some models have been introduced in order to explain this emergent behavior [Nature 435, 207-211 (2005)]. More recently, in [Proc. Natl. Acad. Sci. USA 105, 18153-18158 (2008)] it has been shown that the power-law decay could be explained as the superposition of non-homogeneous poissonian processes.

In our paper, we study three very large databases. We considered a big set of inquiries performed on the search’s engine of America On Line, all logging actions performed by users on the English website of Wikipedia and a big set of feedback messages sent by users on the Ebay (EB) website. The global inter-event time pdf calculated for the EB dataset is shown in Figure 1.

329. Human Activity in the Web

As one can clearly see, the global pdf is characterized by a power-law decay modulated by periodic (daily) oscillations. It should be noticed that the definition of the global pdf is meaningful only in the hypothesis that all users behaves in the same way, which means that each 329. Human Activity in the Web is a random variable extracted from the same pdf (the global one) independently of the considered user. This assumption is however wrong. If we calculate the statistical significance of the global 329. Human Activity in the Web to describe the activity pattern of single users we see that it significantly violates the null hypothesis. A simple Kolmogorov-Smirnov (KS) test which systematically compare the global 329. Human Activity in the Web with each of the single users’pdf (see Figure 2), shows that fraction of users whose activity pattern is describe by  P(tau) within a significant level at least equal to Q is much less than expected.

329. Human Activity in the Web

The main reason of this discrepancy is due to the heterogeneity of the population in terms of number of operations performed. Not all users perform the same number of actions, but instead the number of users who have performed a operations equal to n follows a broad distribution. Interestingly, users performing the same number of operations have similar activity patterns. We first define 329. Human Activity in the Web as the inter-event pdf averaged only over users who have performed n total actions. We first see that the statistical significance of 329. Human Activity in the Web is much better than the one of 329. Human Activity in the Web (Figure 3).

329. Human Activity in the Web

Each panel reports the fraction of users R(Q) whose activity patterns are described by the pdf P(n)(329. Human Activity in the Web) with a probability at least equal to Q for different values of n. The qualitative comparison with Figure 2 tells us that the P(n)(329. Human Activity in the Web) can describe the activity patterns of single users much better than P(n)(329. Human Activity in the Web). The reliability of P(n)(329. Human Activity in the Web) decreases however as n increases.

In addition, we can see 329. Human Activity in the Web depends on 329. Human Activity in the Web in the sense that the decay exponent of this pdf varies as a function of n (see Figure 4).

329. Human Activity in the Web

Each panel reports the inter-event time pdfs P(n)(329. Human Activity in the Web) for the same values of n considered in Figure 3. P(n)(329. Human Activity in the Web) can be well fitted by a power-law (dashed lines), but the decay exponent varies with n. In the presented cases we have:  329. Human Activity in the Web ≈ 1.1, 1.2, 1.8 and 2.3.

The importance of the result is twofold. First, it is important to stress that the study of the global 329. Human Activity in the Web is meaningless. The global pdf is defined on the basis of wrong hypothesis and therefore every results obtained by its analysis are biased.  Instead of a global pdf valid for every user, it is better to focus on the study of many different pdfs each corresponding to users with similar activity. Second, the finding opens a new direction for the modelling the process. New models are required in order to understand how and why the number of operations influences the decay exponent of the inter-event time pdfs.

People interested in this topic is invited to read our manuscript and visit the homepage, where all data can be freely downloaded.

2 Comments

    Leave a Comment

    Click here to cancel reply.


    For LaTeX in your comment, please use tags [tex] and [/tex]. Also, you may use the following HTML tags: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong> .

    Login with Facebook:

    or subscribe me to comments RSS feed

    Trackback responses to this post

    • NEQNET: last two weeks of March | NEQNET: Non-equilibrium Phenomena
    • Death of the blog | pKfactor

    Related Posts

    • No Related Post
    • Comments
    • Login
    • Search
    • Large non-Gaussianity from axion inflation Large non-Gaussianity from axion inflation
      January 31, 2012 14:01
    • On strong disorder renormalization On strong disorder renormalization
      March 18, 2011 20:03
    • Relaunching NEQNET
      January 20, 2011 21:01
    • Saturday’s photoguess: what does this monkey symbolize?
      June 20, 2009 19:06
    • Dynamics of space storm
      June 19, 2009 12:06
    • Micro: Can 2 question provide a common solution? Black holes, information loss paradox and Ultra high energy cosmic rays. ...
    • James Ph. Kotsybar: SAGITTARIUS A* -- James Ph. Kotsybar Mysteriously cloaked, obscure despite Interior illuminating glare, Long ...
    • cad: The presentation can be found in several formats at http://pirsa.org/08110051/...
    • James Ph. Kotsybar: COMMENTS FOR THIS ENTRY ARE CLOSED -- James Ph. Kotsybar When the general public hears about A breakthrough in...
    • James Ph. Kotsybar: FEARFUL SYMMETRY (from Songs Of Experiments) -- James Ph. Kotsybar Beyond notice, out of sight in dimensions c...
    • James Ph. Kotsybar: OFF THE SCALE -- James Ph. Kotsybar The young lady known simply as Bright, who could travel at speeds f...
    • James Ph. Kotsybar: COSMIC PREDICTION -- James Ph. Kotsybar Betelgeuse is gonna blow! It?s just a matter of time It?s only ten...
    • Leo: Ok I have a question: if a black hole is supermassive, say 100 billion solar mass, let's suppose Alice fells int...
    • James Ph. Kotsybar: HIGHER DIMENSIONS -- James Ph. Kotsybar From point to line to plane to sphere there?s only three dimensi...
    • superkuh: I am glad you are back! No more disapointing weekly refreshings of that monkey....
    • Dmitry: Thank you James, beautiful and inspiring as usual :)...
    • Dmitry: Yes, renormalization group should still be applicable, but crit. exponents have to be different. By the way, if we ...
    • James Ph. Kotsybar: UNIQUE PARAMETERS -- James Ph. Kotsybar There is only one answer to creation. Though we don?t nearly understan...
    • Ariel Amir: Very interesting, thanks. It indeed seems to be very close to what we discuss here. I wonder whether the distributi...
    • Dmitry: Hi Ariel, thanks for the explanation! Let me now discuss a bit the application I had in mind. It has to do w...
      Register,
      login with your NEQNET profile
      or login with Facebook:

      • 226. Top ten open problems in physics
        • 33 comments
      • 353. Vortex line representation. Cauchy invariant
        • 33 comments
      • The very meaning of socialism
        • 26 comments
      • 377. Temporal and spatial dependence of quantum entanglement
        • 26 comments
      • Biocentrism: book review
        • 24 comments

    Facebook: Community

    Facebook: Your Friends

    Google Friend Connect

    Links: Journals

    Links: Resources

    Links: Blogs

    • Lubos Motl's Reference Frame
    Copyright © NEQNET: The world of theoretical physics 2008-2011. All Rights Reserved.