Category Archives: Brain in a jar

Distributed Processing and Biological Approximation

The complexity of biological thinking is impossible is impossible to replicate today when constrained by the limitations of a single machine. With the ability to define and exchange standard objects and a standard interface, this barrier could be broken. The hurdle of coding models that approximate the function of the brain has also been insurmountable as long as development was dictated, managed, and organized within a company. The upper limit on programs of this origin seems to be in the range of 10 million lines. The growth of open source development opens the door to the integration of many codes to form the billions of lines necessary to approach human capacity for thought.

Building Intelligent systems: biological and software

There are amazing analogies between computer processors and the brain, and it is just a matter of time before the algorithms that define electronic processors mirror the functionality of the chemical processors of our brains. The biological neural systems rely on inputs and pattern recognition to learn. Computers are excellent at storing (remembering) facts, and are becoming proficient at recognizing relationships as defined by statistical patterns and neural networks. However, computers cannot yet create metaphors or learn how to independently process new information.

Metaphors are an important part of how humans think. We understand new information as we draw parallels and connections between it and prior information. Over-simply put, we learn by recognizing relationships between new information and old. Computers could do the same when their sets of data include enough relevant fields to be able to computationally identify systems that are described by similar dynamics. In other words, when a computer has data about how things works, it can find systems that work similarly to each other. The leap from there to drawing metaphors is a data-mining process: it can be solved by computational rote, where statistical relationships are identified, prioritized, and used for prediction.

This process also leads to “learning” about how to process new information. By recognizing the metaphors, new data can be classified and described according to how it is understood. And just like in our brains, there will be errors. Misunderstanding will occur as metaphors are calculated based on incomplete data sets. As more data is input, systems will have to be able to make corrections and re-calculate all of the other metaphors that included the corrected data. New corrections will be made and a cascade of corrections will result in a modified historical data record. A large number of calculations and recalculations will occur with each new input, and the storage of historical data (and calculated results) will require substantial processing and storage.

Recognizing metaphors will allow machines to output statements like: “It appears that ABC is driven in many similar ways to XYZ. The result we are seeking might be accomplished by A because a similar result was achieved in XYZ when X was applied.” Put more simply, computers will be able to express creative suggestions.

Interestingly, storage could be massively reduced by deleting large volumes of data that support the relationships that are strong enough to overcome some threshold level of certainty. For example, if everything falls, then we don’t have to keep all that data, just the relationship that everything falls. This may be analogous to forming intuitions.

Centralized network computing will win

I know it’s a big debate right now, but centralized network computing will win in the end.

Centralized network computing is the term used to describe a system of networked web servers (or a single web server) that provides integrated applications and storage for multiple users who can access the system through distributed terminals. The system can be geographically distributed or not, but will share a common (integrated) network of applications, probably using a software interface standard to encourage and enable multiple independent application development teams.

Centralized networks are inevitable because of self-reinforcing competitive advantages. Economies of scale and market forces will lead to substantial change in the way we compute, and the systems we use now are simply the seeds that will grow into (and merge together to form) global centralized information service providers. There are already some very strong indicators that this trend is happening, and the potential points to this trend being a very long one.

  1. There are economies of scale in processing Load balancing can optimize processor utilization and provide both faster interactivity and reduced hardware investment.
  2. There are competitive advantages in information and application aggregation Integrations can break down the walls between programs, improving functionality through better integration and data sharing. You can analyze data in more dimensions and with more flexibility. Development rates can improve as it becomes possible to support separation of more software components and the people that work on them.
  3. Load balancing improves transmissions Transfer rates improve because fewer nodes are required and data traffic can be optimized. Information served to you from a server on your edge is more reliable and fast than information sent to you from another server through your edge.
  4. End-user transparency The front-end possibilities under centralized computing are not limited beyond that of other systems. This implies that there will not be a selection bias away from centralized systems because end-users will not prefer or recognize the difference between systems. That is not to say that they will all be the same – only that they all could be. The opportunity set in one system is available in the other.
  5. The outsourced storage industry exists This implies that there is a willingness to adopt on the part of the owners of data.

You can see the markets already rewarding companies that are moving to take advantage of this trend. Many of these companies are providing application services along with ISP connectivity, and they are capturing traffic. This traffic is investing time and thought into signaling their own preferences. Some examples include personalizing page layout and content — often even using system-wide wallets and e-mail. Giving users what they prefer is a huge competitive advantage. The time it takes to personalize a competing system is a high transaction cost – especially relative to the low cost of inertia.

Eventually, you will be using only a browser. All your computing will occur on a centralized system and be sent to your browser for translation into your interface. All of your applications will be centrally hosted, so your profile and applications – essentially everything you do on your computer – will be available to you from any machine, at any time.

Multiple systems will compete for scale, reducing marginal costs and creating meaningful and irreproducible competitive advantages. This race will likely be won permanently by 2050. Before that time, ASP services will consolidate rapidly along with economic cycles. the early players will rely on loss leader models to attract user bases, and will transition to profitability as the scale reaches the tipping point. The companies that make it to profitability first and reinvest in their technology platform will improve their integration, breadth, and quality to further support their competitive advantages.

In the first decade or two of this trend, there will probably be dozens of smaller companies that are able to enter and gain market share against their larger competitors. These companies will have competitive advantages most likely based on data storage, traditional media integration, wireless adoption, software platform architecture, applications suite integrations, and possibly international comparative advantage. After 20 years, the marginal advantages possible from these characteristics will not pose a meaningful threat to the aggregation and scale advantages of the top few market participants.

Consolidate or die will be the mantra of information companies.

Forecast Changes in Asset Values based on Measurable Cultural influences

The sum total of the media contributed to the internet approximates the attention of society during that period. Then analysis of this media indicates trends in attention and preferences. These trends create signals about the directions of values for securities and other assets. Frequency and trend directions of keyword usage, volume of content in certain classifications, and level and type of contribution of media files vs. sector and industry pricing trends are recommended starting points for analytical comparison.

Artificially Intelligent Program Trading System


Data Sources

/ / | | | \

Parallelized Analytics Agents

\ \ | | | / /

Signal Aggregation analytics

|

Webserver

|

Trading Platforms and Reporting

Arbitrages – even of very small marginal size – could be exploited based on a large number of artificially intelligent program trading systems mining historical and currently released information for financial products and correlated factors. Using multiple signalling agents, aggregated signals can be calculated by weighting each agent based on the statistical strength of historical signal combinations. Using this type of design, it is easy to develop multiple agents that are completely independent and confidential. Multiple agents can be developed concurrently, tested, and introduced to the aggregate system at any rate.

The process of the elimination of arbitrage opportunities will create vast concentrations of wealth within the companies that embrace the tools that automate this process. As new information sources become available for analysis, new arbitrages may be identified with increasing complexity. The abstraction of trading systems to automaically test and integrate new data sources will mark the last decades of financially advantageous investment in hedge funds. After that time, return will be a stochastic function of expected risk.